| commit | 5c38bcc7068f486733584bd251c9596059c31b7f | [log] [tgz] |
|---|---|---|
| author | Lei Zhang <antiagainst@google.com> | Mon Jun 05 17:29:11 2023 -0400 |
| committer | GitHub <noreply@github.com> | Mon Jun 05 14:29:11 2023 -0700 |
| tree | 2019d84f5022c4c79eb2bace9aa94fa68f99fd21 | |
| parent | 2544efec2f33bcbd12d1a07e1b8eda354b7d0d3c [diff] |
[cuda] Implement basics for a CUDA HAL driver rewrite (#13942) This commit starts a CUDA HAL driver rewrite under `experimental/`. We create a new `cuda2/` directory to host the new code to avoid interrupting the current CodeGen development. This commit just brings in the basics for boot up a new HAL driver, including dynamic symbols management, error status management, and IREE HAL driver implementation. Most of the code is directly copied from existing HAL driver, with noticeable changes: * Split CUDA and NCCL dynamic symbols into separate structures for better organziation and allowing optionality. * Fleshed out CUDA error to IREE status conversions. * Better organized code blocks and improved error messages and various comments. Building this commmit with `-DIREE_EXTERNAL_HAL_DRIVERS=cuda2`, we can have `tools/iree-run-module --dump_devices` showing `cuda2` devices, in parallel to the existing CUDA one. Progress towards https://github.com/openxla/iree/issues/13245
IREE (Intermediate Representation Execution Environment, pronounced as “eerie”) is an MLIR-based end-to-end compiler and runtime that lowers Machine Learning (ML) models to a unified IR that scales up to meet the needs of the datacenter and down to satisfy the constraints and special considerations of mobile and edge deployments.
See our website for project details, user guides, and instructions on building from source.
IREE is still in its early phase. We have settled down on the overarching infrastructure and are actively improving various software components as well as project logistics. It is still quite far from ready for everyday use and is made available without any support at the moment. With that said, we welcome any kind of feedback on any communication channels!
See our website for more information.
IREE is licensed under the terms of the Apache 2.0 License with LLVM Exceptions. See LICENSE for more information.