1. 7404ce9 [SHM] Add huge page and NUMA placement support (#23697) by Ben Vanik · 2 hours ago main
  2. 487d2d3 Add MPSC queue. (#23700) by Ben Vanik · 2 hours ago
  3. 2604130 Async proactor fixes: TSAN bridge and progress callback starvation (#23699) by Ben Vanik · 3 hours ago
  4. de74bde [Base] Add status copy allocation and payload inspection APIs (#23698) by Ben Vanik · 3 hours ago
  5. b4af1cd Replace libc printf with eyalroz/printf and add streaming status formatting. (#23694) by Ben Vanik · 4 hours ago
  6. 69b8c51 [docs] Clarify bazelisk use on linux (#23691) by Jakub Kuderski · 7 hours ago
  7. ed0d4f4 [VM] Fix ref leak from incorrect MOVE bit on branch block args (#23689) by Jorn Tuyls · 8 hours ago
  8. bef4552 Fixes for UBSan compatibility across the runtime (#23692) by Ben Vanik · 2 days ago
  9. 41f14d3 Cherry-pick ODS double-space fix and update tests (#23690) by Jakub Kuderski · 2 days ago
  10. 3a4c991 Async and base infrastructure for cross-process shared memory. (#23688) by Ben Vanik · 2 days ago
  11. 16155fd [VectorExt] Add TransferScatterOp definition, verifier, lit tests. (#23666) by Keshav Vinayak Jha · 2 days ago
  12. 0314c27 Fix Android shm_test and skip coordinated test on RISC-V by Ben Vanik · 2 days ago
  13. a91a278 Fix CI failures on Android, RISC-V/QEMU, and GCC by Ben Vanik · 2 days ago
  14. 36d64f8 [Async] Fix POSIX proactor short write handling in socket sends by Ben Vanik · 2 days ago
  15. 1135e41 Fix clang-format in primitive_test.cc by Ben Vanik · 2 days ago
  16. 37e4cb6 [Async] Fix axis failure propagation to semaphores by Ben Vanik · 2 days ago
  17. 11b205b [Async] Fix send data lifetime during CQE callback processing by Ben Vanik · 2 days ago
  18. bc2a1e8 [Testing] Add self-launching coordinated test harness by Ben Vanik · 4 days ago
  19. 4cd002d [Async] Add primitive handle dup/close and tests by Ben Vanik · 4 days ago
  20. 57ac620 [Async] Add shared (cross-process) buffer pool by Ben Vanik · 5 days ago
  21. 1f58d51 [Async] Add carrier freelist pool to IOCP proactor by Ben Vanik · 5 days ago
  22. c8dc247 [Base] Add memory sealing for shared memory regions by Ben Vanik · 5 days ago
  23. 5958763 [Async] Use NtAssociateWaitCompletionPacket for IOCP event waits by Ben Vanik · 2 days ago
  24. cc210c2 [Async] Fix test_base.h for Windows and macOS compatibility by Ben Vanik · 2 days ago
  25. 7d77ecf [Async] Add proactor inline progress callbacks by Ben Vanik · 2 days ago
  26. daaa3e1 [Async] Use doubly-linked list for IOCP active_carriers by Ben Vanik · 6 days ago
  27. fefad4f [Async] Add tail fast-path for timer list insertion by Ben Vanik · 6 days ago
  28. ce282a8 [Async] Fix cross-process notification epoch handling in proactors by Ben Vanik · 2 days ago
  29. 2cef494 [Async] Add cross-process shared notification support by Ben Vanik · 12 days ago
  30. 5833a21 [Base] Add lock-free SPSC queue for shared memory transport by Ben Vanik · 12 days ago
  31. c02e48b [Base] Add platform-abstracted shared memory primitives by Ben Vanik · 12 days ago
  32. 07fb538 Integrate llvm/llvm-project@ab10f0848b22417afc6352051b6e26c7704084d2 (#23683) by RattataKing · 2 days ago latest-snapshot
  33. 314b4e6 [Codegen] Add PipelineAttrInterface and PassPipelineAttr (#23590) by Quinn Dawkins · 2 days ago
  34. d8b6df3 Integrate llvm/llvm-project@a631c3f4077cf84c90f9dfa98a9bdbc1e568f055 (#23670) by RattataKing · 3 days ago
  35. 7de2750 [NFC][Codegen] Move distribution pattern infrastructure to VectorExt (#23669) by Kunwar Grover · 3 days ago
  36. 80d70d6 [GlobalOpt] Support Img2Col Transformation for Conv2D Including Quantized Types (#23278) by Phoebe Chen · 3 days ago
  37. 205b17f [Tokenizer] Add tiktoken format loader for OpenAI BPE vocabularies. (#23663) by Ben Vanik · 3 days ago
  38. c114a89 [PCF] Add tensor.collapse_shape fusion into pcf.generic/loop (#23491) by Quinn Dawkins · 3 days ago
  39. a02e85f [CPU] Propagate the reduction tile sizes to producers because of fusion. (#23660) by Han-Chung Wang · 4 days ago
  40. 098465b Integrate llvm/llvm-project@03773c3b06b2a9a023f8bab892f237fb3b980439 (#23651) by RattataKing · 4 days ago
  41. 8df2db9 [VectorDistribution] Relax layout size constraint (#23625) by Lukas Sommer · 4 days ago
  42. 04784a9 Rewriting the HAL CTS to support bazel and scale better. (#23644) by Ben Vanik · 4 days ago
  43. 92bbb28 [Async] Gracefully degrade io_uring slab registration on RLIMIT_MEMLOCK (#23654) by Ben Vanik · 4 days ago
  44. 927c775 [DispatchCreation] Fix fusion of scalar reduction with consumer (#23659) by Ian Wood · 4 days ago
  45. a385a68 Add samples/hal/hello: pure HAL buffer fill, copy, and readback. (#23645) by Ben Vanik · 4 days ago
  46. e002a09 [GPUHeuristics] Prefer larger MMA intrinsics for very large compute-bound GEMMs (#23641) by Alan Li · 5 days ago
  47. a657d73 Bump stablehlo to d0484f6 (#23649) by RattataKing · 5 days ago
  48. 2df49e5 [test] Enable e2e attention tests for arm64 targets. (#23635) by Han-Chung Wang · 5 days ago
  49. 1ea0b66 [Codegen] Remove attention transpose intrinsic hacks (#23633) by Kunwar Grover · 5 days ago
  50. 46b54e8 Integrate llvm/llvm-project@386a3afa553fbf2854ee9d002e83bd456942ad39 (#23639) by RattataKing · 5 days ago
  51. ce92d23 [docs] Update python versions listed on the website (#23647) by Jakub Kuderski · 5 days ago
  52. 2284319 [GPU] Make GPUVectorAlloc allocate shared memory based on layout analysis (#23631) by Kunwar Grover · 5 days ago
  53. c7072e9 [Codegen] Fix layout analysis fixup crashes (#23630) by Kunwar Grover · 5 days ago
  54. f63db8d [GPUHeuristics] Add min-based tile distribution for imbalanced M/N problems (#23619) by Vivian Zhang · 5 days ago
  55. a1ef7f9 A couple of fixes picked up in the fusilli tests using sanitizers. (#23617) by Andrew Woloszyn · 5 days ago
  56. 776210b [Python] Stop building cp313 wheels on MacOS and Windows (#23640) by Jakub Kuderski · 5 days ago
  57. 2e4ef0f [Torch] Add flag to enable shape refinement (#23632) by Ian Wood · 5 days ago
  58. ae93b81 [CMake][Python] Auto-enable Stable ABI (abi3) for CPython 3.12+ with GIL (#23629) by Jakub Kuderski · 5 days ago
  59. 39805c6 [Python][CI] Fix release validation installing wrong package versions (#23634) by Jakub Kuderski · 5 days ago
  60. 5fcbdbc [VMVX] Enable scf.forall distribution for VMVX pipelines. (#23615) by Han-Chung Wang · 5 days ago
  61. 771eac5 Initial plumbing for inner_tiled on CPU with data-tiled MMA attribute. (#23494) by Benoit Jacob · 6 days ago
  62. f2a2972 [Python][CMake] Set py_limited_api on CMakeExtension for abi3 wheels (#23628) by Jakub Kuderski · 6 days ago
  63. 84f18ff [CMake][Python] Bump minimum CMake version to 3.26 (#23607) by Jakub Kuderski · 6 days ago
  64. 4e8dde4 [VectorDistribute] Refactor VectorLayoutAnalysis into 2-phase forward/backward design (#23611) by Kunwar Grover · 6 days ago
  65. 34f93d2 [VectorDistribute] Consider all compute ops for thread tile size (#23394) by Lukas Sommer · 6 days ago
  66. 9c4aa35 Integrate llvm/llvm-project@783eeb2ccfcac136f2191bda705ecd4a1a553718 (#23616) by RattataKing · 6 days ago
  67. e95e5d3 [SPIRV] Enable small float support in SPIR-V pipeline. (#23391) by Han-Chung Wang · 7 days ago
  68. 06fa699 [CPU] Drop scf.for distribution option from CPU pipeline. (#23614) by Han-Chung Wang · 7 days ago
  69. b8eef48 [CI] Add typos pre-commit hook and dictionary. (6/6) (#23606) by Jakub Kuderski · 7 days ago
  70. e889208 [LLVMGPU] Remove old dead code for LLVMGPUVectorDistribute (#23596) by Kunwar Grover · 7 days ago
  71. a21fa38 [Python] Add Stable ABI (abi3) support for Python 3.12+ (#23600) by Jakub Kuderski · 7 days ago
  72. fb7e890 Fix low-frequency typos in compiler (non-Codegen). NFC. (4/6) (#23604) by Jakub Kuderski · 7 days ago
  73. d66e44e Fix low-frequency typos in runtime, docs, and build tools. NFC. (5/6) (#23605) by Jakub Kuderski · 7 days ago
  74. afb0265 Fix low-frequency typos in compiler/Codegen. NFC. (3/6) (#23603) by Jakub Kuderski · 7 days ago
  75. fbf5541 Fix medium-frequency typos. NFC. (2/6) (#23602) by Jakub Kuderski · 7 days ago
  76. 566c36f Fix high-frequency typos. NFC. (1/6) (#23601) by Jakub Kuderski · 7 days ago
  77. c18957b [Codegen][GPU] Clean up prefetch pipeline stages flag to support integer values (#23568) by Zhewen Yu · 9 days ago
  78. 4ac7074 Fix the ordering of the external stream. (#23598) by Andrew Woloszyn · 10 days ago
  79. eb76100 [CI] Bump Python version from 3.9 to 3.10 in build_package.yml (#23599) by RattataKing · 10 days ago
  80. 1cf18c7 [LDS] Add fallback for CoalescedGatherDMA lowering. (#23560) by Alan Li · 10 days ago
  81. 9c1a851 [LLVMGPU] Use inner_tiled in LLVMGPUVectorDistribute (#23595) by Kunwar Grover · 10 days ago
  82. 1fe030b Add --exclude-libs=ALL to libIREECompiler.so shared library (#23574) by Aaron St George · 10 days ago
  83. b082090 [CPU] Expose more options to CPUCodegenOptions. (#23586) by Han-Chung Wang · 10 days ago
  84. a314ffe [Python] Bump minimum required Python version to 3.10 (#23591) by RattataKing · 10 days ago
  85. 37b1b8d [NFC][LLVMGPU] Remove unused ROCDLLowerExecutableTargetPass (#23593) by Quinn Dawkins · 10 days ago
  86. 731f21e [ROCM][DT] Add gfx950 f8e4m3fn ukernel (#23581) by Zhewen Yu · 10 days ago
  87. 0254eef Revert "Bump the github-actions group with 2 updates" (#23592) by Sambhav Jain · 10 days ago
  88. 3e6d203 Revert "Reordered calls to set_external_stream_data_completed." (#23589) by Andrew Woloszyn · 11 days ago
  89. e4531e6 [CODEGEN] Pattern to hoist expand_shape & collapse_shape from scf.for loop (#23572) by YashDeshpande25 · 11 days ago
  90. d25e1ac Reordered calls to set_external_stream_data_completed. (#23583) by Andrew Woloszyn · 11 days ago
  91. 6423b51 Fix Vulkan driver crash from UNIMPLEMENTED `query_capabilities`. (#23582) by Ben Vanik · 12 days ago
  92. eacda0d Add `iree_hal_device_group_t` to own device topology lifecycle (#23576) by Ben Vanik · 12 days ago
  93. a2c8b6b [NFC] Move flags from iree/base/internal to iree/base/tooling. (#23578) by Ben Vanik · 12 days ago
  94. ccc5adf Integrate llvm/llvm-project@ffd341d670 (#23567) by Kunwar Grover · 12 days ago
  95. efadc51 [LLVMGPU] Remove fold unit extent dims from vector distribute (#22906) by Kunwar Grover · 12 days ago
  96. fd3db2d Clean up RISC-V toolchain files (#23457) by Han-Kuan Chen · 12 days ago
  97. 0dca45a [Async] Fix multishot CTS test flakes: use blocking waits for completions (#23577) by Ben Vanik · 12 days ago
  98. e0b5919 Add device topology infrastructure to HAL (#23573) by Ben Vanik · 12 days ago
  99. 2d25310 [Async] Fix macOS CTS test flakes: dangling stack ops, RST detection, kqueue event loss (#23570) by Ben Vanik · 12 days ago
  100. 51a03bc [CI] Disable internal linkage clang-tidy checks (#23569) by Jakub Kuderski · 13 days ago