Being compilation-based means IREE does not have a traditional runtime that dispatches “ops” to their fat kernel implementations. What IREE provides is a toolbox for different deployment scenarios. It scales from running generated code on a particular API (such as emitting C code calling external DSP kernels), to a HAL (Hardware Abstraction Layer) that allows the same generated code to target multiple APIs (like Vulkan and Direct3D 12), to a full VM allowing runtime model loading for flexible deployment options and heterogeneous execution.
IREE aims to
- Support advanced models on mobile/edge devices. Dynamic shapes, dynamic flow control, dynamic multi-model dispatch, streaming models, tree-based search algorithms, and other are all good examples of exciting ML evolution. We are trying to build IREE from the ground-up to enable these models and run them efficiently on modern hardware, especially on mobile/edge devices.
- Demonstrate MLIR‘s ability to develop non-traditional ML compiler backends and runtimes. MLIR enables IREE’s holistic approach of focusing on the math being performed and how that math is scheduled rather than graphs of “ops”.
- Embrace standard-based ML via Vulkan. The graphics world is shifting towards favoring modern explicit APIs for performance and predictability and Vulkan is emerging as the “compatibility” layer. We would love to allow hardware vendors to be able to make ML efficient on their hardware without the need for bespoke runtimes and special access. We also would love to let developers and users utilize all the hardware available on as many platforms as possible.
Roadmap and Milestones
IREE is in the early stages of development and not yet ready for broad adoption. Check out the long-term design roadmap to get a sense of where we're headed.
We plan on a quarterly basis using OKRs. Review our latest objectives to get a sense of what we're up to in the near term.
We use GitHub Projects to track progress on IREE components and specific efforts. We use GitHub Milestones to track the work associated with plans for each quarter.
Build Status
CI System | Build System | Platform | Architecture | Component | Status |
---|
Kokoro | Bazel | Linux | x86 | Core |  |
Kokoro | Bazel | Linux | x86 | Bindings |  |
Kokoro | Bazel | Linux | x86-swiftshader | Integrations |  |
Kokoro | Bazel | Linux | x86-turing | Integrations |  |
Kokoro | CMake | Linux | x86-swiftshader | Core + Bindings |  |
Kokoro | CMake | Linux | x86-turing | Core + Bindings |  |
Kokoro | CMake | Android | arm64-v8a | Runtime (build only) |  |
BuildKite | CMake | Android | arm64-v8a | Runtime |  |
License
IREE is licensed under the terms of the Apache license. See LICENSE for more information.