- 7404ce9 [SHM] Add huge page and NUMA placement support (#23697) by Ben Vanik · 2 hours ago main
- 487d2d3 Add MPSC queue. (#23700) by Ben Vanik · 2 hours ago
- 2604130 Async proactor fixes: TSAN bridge and progress callback starvation (#23699) by Ben Vanik · 3 hours ago
- de74bde [Base] Add status copy allocation and payload inspection APIs (#23698) by Ben Vanik · 3 hours ago
- b4af1cd Replace libc printf with eyalroz/printf and add streaming status formatting. (#23694) by Ben Vanik · 4 hours ago
- 69b8c51 [docs] Clarify bazelisk use on linux (#23691) by Jakub Kuderski · 7 hours ago
- ed0d4f4 [VM] Fix ref leak from incorrect MOVE bit on branch block args (#23689) by Jorn Tuyls · 8 hours ago
- bef4552 Fixes for UBSan compatibility across the runtime (#23692) by Ben Vanik · 2 days ago
- 41f14d3 Cherry-pick ODS double-space fix and update tests (#23690) by Jakub Kuderski · 2 days ago
- 3a4c991 Async and base infrastructure for cross-process shared memory. (#23688) by Ben Vanik · 2 days ago
- 16155fd [VectorExt] Add TransferScatterOp definition, verifier, lit tests. (#23666) by Keshav Vinayak Jha · 2 days ago
- 0314c27 Fix Android shm_test and skip coordinated test on RISC-V by Ben Vanik · 2 days ago
- a91a278 Fix CI failures on Android, RISC-V/QEMU, and GCC by Ben Vanik · 2 days ago
- 36d64f8 [Async] Fix POSIX proactor short write handling in socket sends by Ben Vanik · 2 days ago
- 1135e41 Fix clang-format in primitive_test.cc by Ben Vanik · 2 days ago
- 37e4cb6 [Async] Fix axis failure propagation to semaphores by Ben Vanik · 2 days ago
- 11b205b [Async] Fix send data lifetime during CQE callback processing by Ben Vanik · 2 days ago
- bc2a1e8 [Testing] Add self-launching coordinated test harness by Ben Vanik · 4 days ago
- 4cd002d [Async] Add primitive handle dup/close and tests by Ben Vanik · 4 days ago
- 57ac620 [Async] Add shared (cross-process) buffer pool by Ben Vanik · 5 days ago
- 1f58d51 [Async] Add carrier freelist pool to IOCP proactor by Ben Vanik · 5 days ago
- c8dc247 [Base] Add memory sealing for shared memory regions by Ben Vanik · 5 days ago
- 5958763 [Async] Use NtAssociateWaitCompletionPacket for IOCP event waits by Ben Vanik · 2 days ago
- cc210c2 [Async] Fix test_base.h for Windows and macOS compatibility by Ben Vanik · 2 days ago
- 7d77ecf [Async] Add proactor inline progress callbacks by Ben Vanik · 2 days ago
- daaa3e1 [Async] Use doubly-linked list for IOCP active_carriers by Ben Vanik · 6 days ago
- fefad4f [Async] Add tail fast-path for timer list insertion by Ben Vanik · 6 days ago
- ce282a8 [Async] Fix cross-process notification epoch handling in proactors by Ben Vanik · 2 days ago
- 2cef494 [Async] Add cross-process shared notification support by Ben Vanik · 12 days ago
- 5833a21 [Base] Add lock-free SPSC queue for shared memory transport by Ben Vanik · 12 days ago
- c02e48b [Base] Add platform-abstracted shared memory primitives by Ben Vanik · 12 days ago
- 07fb538 Integrate llvm/llvm-project@ab10f0848b22417afc6352051b6e26c7704084d2 (#23683) by RattataKing · 2 days ago latest-snapshot
- 314b4e6 [Codegen] Add PipelineAttrInterface and PassPipelineAttr (#23590) by Quinn Dawkins · 2 days ago
- d8b6df3 Integrate llvm/llvm-project@a631c3f4077cf84c90f9dfa98a9bdbc1e568f055 (#23670) by RattataKing · 3 days ago
- 7de2750 [NFC][Codegen] Move distribution pattern infrastructure to VectorExt (#23669) by Kunwar Grover · 3 days ago
- 80d70d6 [GlobalOpt] Support Img2Col Transformation for Conv2D Including Quantized Types (#23278) by Phoebe Chen · 3 days ago
- 205b17f [Tokenizer] Add tiktoken format loader for OpenAI BPE vocabularies. (#23663) by Ben Vanik · 3 days ago
- c114a89 [PCF] Add tensor.collapse_shape fusion into pcf.generic/loop (#23491) by Quinn Dawkins · 3 days ago
- a02e85f [CPU] Propagate the reduction tile sizes to producers because of fusion. (#23660) by Han-Chung Wang · 4 days ago
- 098465b Integrate llvm/llvm-project@03773c3b06b2a9a023f8bab892f237fb3b980439 (#23651) by RattataKing · 4 days ago
- 8df2db9 [VectorDistribution] Relax layout size constraint (#23625) by Lukas Sommer · 4 days ago
- 04784a9 Rewriting the HAL CTS to support bazel and scale better. (#23644) by Ben Vanik · 4 days ago
- 92bbb28 [Async] Gracefully degrade io_uring slab registration on RLIMIT_MEMLOCK (#23654) by Ben Vanik · 4 days ago
- 927c775 [DispatchCreation] Fix fusion of scalar reduction with consumer (#23659) by Ian Wood · 4 days ago
- a385a68 Add samples/hal/hello: pure HAL buffer fill, copy, and readback. (#23645) by Ben Vanik · 4 days ago
- e002a09 [GPUHeuristics] Prefer larger MMA intrinsics for very large compute-bound GEMMs (#23641) by Alan Li · 5 days ago
- a657d73 Bump stablehlo to d0484f6 (#23649) by RattataKing · 5 days ago
- 2df49e5 [test] Enable e2e attention tests for arm64 targets. (#23635) by Han-Chung Wang · 5 days ago
- 1ea0b66 [Codegen] Remove attention transpose intrinsic hacks (#23633) by Kunwar Grover · 5 days ago
- 46b54e8 Integrate llvm/llvm-project@386a3afa553fbf2854ee9d002e83bd456942ad39 (#23639) by RattataKing · 5 days ago
- ce92d23 [docs] Update python versions listed on the website (#23647) by Jakub Kuderski · 5 days ago
- 2284319 [GPU] Make GPUVectorAlloc allocate shared memory based on layout analysis (#23631) by Kunwar Grover · 5 days ago
- c7072e9 [Codegen] Fix layout analysis fixup crashes (#23630) by Kunwar Grover · 5 days ago
- f63db8d [GPUHeuristics] Add min-based tile distribution for imbalanced M/N problems (#23619) by Vivian Zhang · 5 days ago
- a1ef7f9 A couple of fixes picked up in the fusilli tests using sanitizers. (#23617) by Andrew Woloszyn · 5 days ago
- 776210b [Python] Stop building cp313 wheels on MacOS and Windows (#23640) by Jakub Kuderski · 5 days ago
- 2e4ef0f [Torch] Add flag to enable shape refinement (#23632) by Ian Wood · 5 days ago
- ae93b81 [CMake][Python] Auto-enable Stable ABI (abi3) for CPython 3.12+ with GIL (#23629) by Jakub Kuderski · 5 days ago
- 39805c6 [Python][CI] Fix release validation installing wrong package versions (#23634) by Jakub Kuderski · 5 days ago
- 5fcbdbc [VMVX] Enable scf.forall distribution for VMVX pipelines. (#23615) by Han-Chung Wang · 5 days ago
- 771eac5 Initial plumbing for inner_tiled on CPU with data-tiled MMA attribute. (#23494) by Benoit Jacob · 6 days ago
- f2a2972 [Python][CMake] Set py_limited_api on CMakeExtension for abi3 wheels (#23628) by Jakub Kuderski · 6 days ago
- 84f18ff [CMake][Python] Bump minimum CMake version to 3.26 (#23607) by Jakub Kuderski · 6 days ago
- 4e8dde4 [VectorDistribute] Refactor VectorLayoutAnalysis into 2-phase forward/backward design (#23611) by Kunwar Grover · 6 days ago
- 34f93d2 [VectorDistribute] Consider all compute ops for thread tile size (#23394) by Lukas Sommer · 6 days ago
- 9c4aa35 Integrate llvm/llvm-project@783eeb2ccfcac136f2191bda705ecd4a1a553718 (#23616) by RattataKing · 6 days ago
- e95e5d3 [SPIRV] Enable small float support in SPIR-V pipeline. (#23391) by Han-Chung Wang · 7 days ago
- 06fa699 [CPU] Drop scf.for distribution option from CPU pipeline. (#23614) by Han-Chung Wang · 7 days ago
- b8eef48 [CI] Add typos pre-commit hook and dictionary. (6/6) (#23606) by Jakub Kuderski · 7 days ago
- e889208 [LLVMGPU] Remove old dead code for LLVMGPUVectorDistribute (#23596) by Kunwar Grover · 7 days ago
- a21fa38 [Python] Add Stable ABI (abi3) support for Python 3.12+ (#23600) by Jakub Kuderski · 7 days ago
- fb7e890 Fix low-frequency typos in compiler (non-Codegen). NFC. (4/6) (#23604) by Jakub Kuderski · 7 days ago
- d66e44e Fix low-frequency typos in runtime, docs, and build tools. NFC. (5/6) (#23605) by Jakub Kuderski · 7 days ago
- afb0265 Fix low-frequency typos in compiler/Codegen. NFC. (3/6) (#23603) by Jakub Kuderski · 7 days ago
- fbf5541 Fix medium-frequency typos. NFC. (2/6) (#23602) by Jakub Kuderski · 7 days ago
- 566c36f Fix high-frequency typos. NFC. (1/6) (#23601) by Jakub Kuderski · 7 days ago
- c18957b [Codegen][GPU] Clean up prefetch pipeline stages flag to support integer values (#23568) by Zhewen Yu · 9 days ago
- 4ac7074 Fix the ordering of the external stream. (#23598) by Andrew Woloszyn · 10 days ago
- eb76100 [CI] Bump Python version from 3.9 to 3.10 in build_package.yml (#23599) by RattataKing · 10 days ago
- 1cf18c7 [LDS] Add fallback for CoalescedGatherDMA lowering. (#23560) by Alan Li · 10 days ago
- 9c1a851 [LLVMGPU] Use inner_tiled in LLVMGPUVectorDistribute (#23595) by Kunwar Grover · 10 days ago
- 1fe030b Add --exclude-libs=ALL to libIREECompiler.so shared library (#23574) by Aaron St George · 10 days ago
- b082090 [CPU] Expose more options to CPUCodegenOptions. (#23586) by Han-Chung Wang · 10 days ago
- a314ffe [Python] Bump minimum required Python version to 3.10 (#23591) by RattataKing · 10 days ago
- 37b1b8d [NFC][LLVMGPU] Remove unused ROCDLLowerExecutableTargetPass (#23593) by Quinn Dawkins · 10 days ago
- 731f21e [ROCM][DT] Add gfx950 f8e4m3fn ukernel (#23581) by Zhewen Yu · 10 days ago
- 0254eef Revert "Bump the github-actions group with 2 updates" (#23592) by Sambhav Jain · 10 days ago
- 3e6d203 Revert "Reordered calls to set_external_stream_data_completed." (#23589) by Andrew Woloszyn · 11 days ago
- e4531e6 [CODEGEN] Pattern to hoist expand_shape & collapse_shape from scf.for loop (#23572) by YashDeshpande25 · 11 days ago
- d25e1ac Reordered calls to set_external_stream_data_completed. (#23583) by Andrew Woloszyn · 11 days ago
- 6423b51 Fix Vulkan driver crash from UNIMPLEMENTED `query_capabilities`. (#23582) by Ben Vanik · 12 days ago
- eacda0d Add `iree_hal_device_group_t` to own device topology lifecycle (#23576) by Ben Vanik · 12 days ago
- a2c8b6b [NFC] Move flags from iree/base/internal to iree/base/tooling. (#23578) by Ben Vanik · 12 days ago
- ccc5adf Integrate llvm/llvm-project@ffd341d670 (#23567) by Kunwar Grover · 12 days ago
- efadc51 [LLVMGPU] Remove fold unit extent dims from vector distribute (#22906) by Kunwar Grover · 12 days ago
- fd3db2d Clean up RISC-V toolchain files (#23457) by Han-Kuan Chen · 12 days ago
- 0dca45a [Async] Fix multishot CTS test flakes: use blocking waits for completions (#23577) by Ben Vanik · 12 days ago
- e0b5919 Add device topology infrastructure to HAL (#23573) by Ben Vanik · 12 days ago
- 2d25310 [Async] Fix macOS CTS test flakes: dangling stack ops, RST detection, kqueue event loss (#23570) by Ben Vanik · 12 days ago
- 51a03bc [CI] Disable internal linkage clang-tidy checks (#23569) by Jakub Kuderski · 13 days ago