1. c285d58 Copy sample code into samples/dynamic_shapes/README.md. (#19699) by Scott Todd · 8 weeks ago
  2. 3c95042 Re-enable MI250 workflows. (#19705) by saienduri · 8 weeks ago
  3. 27e7a90 [DT][Encoding] Use layouts to calculate storage size when it is present. (#19686) by Han-Chung Wang · 8 weeks ago
  4. a953763 Temporarily Disable MI250 workflow due to machine outage (#19702) by Akansha Bansal · 8 weeks ago
  5. c320935 Bump dawidd6/action-download-artifact from 3.1.4 to 7 in the github-actions group (#19692) by dependabot[bot] · 8 weeks ago
  6. 6fd0fd0 [LinalgExt] Implement PartialReductionOpInterface for OnlineAttentionOp (#19684) by Kunwar Grover · 8 weeks ago
  7. 3c963dd Update PyTorch sample notebooks using latest iree-turbine code. (#19658) by Scott Todd · 8 weeks ago
  8. 01c9f14 [LLVMGPUVectorDistribute] Add support for inter-subgroup multi_reduction (#19596) by Manupa Karunaratne · 8 weeks ago
  9. 21b0101 [GPU] Disable prefetching for loops with no computation (#19695) by Nirvedh Meshram · 8 weeks ago
  10. 8d1d867 [GPU] Add thread tile size inference for scatter (#19694) by Quinn Dawkins · 8 weeks ago
  11. 158c636 Revert "Increase default threshold of TileLargeTensor pass (#19671)" (#19693) by Nirvedh Meshram · 9 weeks ago
  12. 3e34e03 Bump the github-actions group with 8 updates (#19689) by dependabot[bot] · 9 weeks ago
  13. 3978ce6 Increase default threshold of TileLargeTensor pass (#19671) by Nirvedh Meshram · 9 weeks ago
  14. 2452b22 [Codegen][GPU] Let integer range optimization narrow GPU computations to i32 (#19473) by Krzysztof Drewniak · 9 weeks ago
  15. 2b29155 Update GH actions with Dependabot (#19663) by Marius Brehler · 9 weeks ago
  16. 9b35412 Run on schedule in iree-org only (#19685) by Marius Brehler · 9 weeks ago
  17. d90c505 Reshape propagation to enable broadcast(transpose) -> attention(q, kt, vt) fusion. (#19661) by MaheshRavishankar · 9 weeks ago
  18. cac7a96 Update IREE test suite to use iree-org/iree-test-suites@c47d13c (#19617) by MaheshRavishankar · 9 weeks ago
  19. 40c19e3 Better support multidevice placement with `stream.async.barrier` (#19651) by Rob Suderman · 9 weeks ago
  20. 88d5f59 Update PkgCI test_amd to use MI300x conductor cluster (#19517) by yamiyysu · 9 weeks ago
  21. ae50c5e [DOCS] Update VectorExt::NestedLayoutAttr docs (#19246) by Manupa Karunaratne · 9 weeks ago
  22. 1441caa Enable macOS Tracy CI build. (#19668) by Scott Todd · 9 weeks ago
  23. a583b25 [GPU] Teach GPUApplyTilingLevel PartialReduction tiling (#19682) by Kunwar Grover · 9 weeks ago
  24. 9f93691 [LLVMGPU] Use LLVMGPUDistribute for small input scatters (#19670) by Quinn Dawkins · 9 weeks ago
  25. f7a2157 Remove Upcasting schedule from TileAndFuse (#19669) by Nirvedh Meshram · 9 weeks ago
  26. 039b8b4 Using tracy::GetQueue instead of the sketchy static variable reference. (#19653) by Ben Vanik · 9 weeks ago
  27. 1d91bec Supporting file descriptors in iree_io_stream_open. (#19665) by Ben Vanik · 9 weeks ago
  28. 106371d Bump torch-mlir to f92c587cb6150e73078f32cf847dc3892be16f93 (#19659) by jinchen · 9 weeks ago
  29. a88555c Add macOS workflow running on M1 (#19656) by Marius Brehler · 9 weeks ago
  30. e64cb12 Increase strictness of global isel use for ROCM (#19247) by Tres · 9 weeks ago
  31. 2aca091 [Codegen][Nearly NFC] Move PropagateDispatchSizeBounds to Common/ (#19650) by Krzysztof Drewniak · 9 weeks ago
  32. 6245db1 [Stream] Attach layouts to tensor ops in encoding specialization pass. (#19649) by Han-Chung Wang · 9 weeks ago
  33. c793f90 [i1] Implement `packed_storage` layout encoding attribute (#19354) by lialan · 9 weeks ago
  34. 801e2c1 Expand runtime_tracing job to include Windows and macOS. (#19655) by Scott Todd · 9 weeks ago
  35. 7d21c5d Revert (2nd) of "Propagate reshapes through generics with reduction" (#19647) by MaheshRavishankar · 9 weeks ago
  36. b3ff1ed Rename `unroll_{m,n,k}` to `intrinsics_{m,n,k}` (#19652) by Benoit Jacob · 9 weeks ago
  37. 6d6bd6e [runtime] Fix runtime tracing compile failure on gcc (#19642) by Ian Wood · 9 weeks ago
  38. bb1c561 Erase all address spaces and get inlined ukernels (#19646) by Benoit Jacob · 9 weeks ago
  39. a7bac5d [Flow] Fix dispatch naming for dynamic shaped fusions (#19439) by Quinn Dawkins · 9 weeks ago
  40. 9055c9d [hip] Fix race in the cleanup of queue read operations. (#19645) by Andrew Woloszyn · 9 weeks ago
  41. 82e37d6 Fix (cross) compiling for 32-bit targets (#19644) by Marius Brehler · 9 weeks ago
  42. 02d145e [Stream] Implement SpecializeEncodings pass (1/n) (#19502) by Han-Chung Wang · 9 weeks ago
  43. 74f8d3c [LinalgExt] Scatter fusion by expansion 3/3 (#19588) by Ian Wood · 9 weeks ago
  44. 2347d9f Supporting (and renaming) IREE_HAL_WHOLE_BUFFER in binding table resolve. (#19640) by Ben Vanik · 9 weeks ago
  45. 126f0ac Add docs for updating release git tags manually. (#19637) by Scott Todd · 9 weeks ago
  46. af416b3 Bump version to 3.2.0 after releasing 3.1.0. (#19638) by Scott Todd · 9 weeks ago
  47. c484058 [GPU] Add barriers when resolving GPUMappedForall to fix race condition (#19635) by Nirvedh Meshram · 9 weeks ago
  48. 9b4906e [DispatchCreation] Drop fusion restriction for stride != 1 conv (#19634) by Quinn Dawkins · 9 weeks ago
  49. c75b686 [GPU][Codegen] Allowing mfma for narrow problem config sizes (#19615) by Zhuoran Yin · 9 weeks ago
  50. 7b9aa28 When dumping intermediates, dump how to reproduce the `.optimized.ll` (#19633) by Benoit Jacob · 9 weeks ago
  51. be75a30 Update minor Python versions used to build packages (#19632) by Marius Brehler · 9 weeks ago
  52. fb21dd6 Adding experimental Tracy API for TLS-less event recording. (#19625) by Ben Vanik · 9 weeks ago
  53. a5c3879 Reapply "Propagate reshapes through generics with reduction… (#18968) by Ian Wood · 9 weeks ago
  54. 80cbf6b [GPU] Add a pass to convert accumulating GEMMs to GEMMs (#19587) by Nirvedh Meshram · 9 weeks ago
  55. 550d88e [GPU] Add lowering configuration logic for scatter (#19624) by Quinn Dawkins · 9 weeks ago
  56. 349026b Add explicit tolerances to SDXL benchmark test times. (#19628) by Scott Todd · 9 weeks ago
  57. 9a83239 [GPU] Add chained reshape support for scf.forall expand destination pattern (#19597) by Nirvedh Meshram · 9 weeks ago
  58. 7047cc3 Rollup of minor runtime fixes/cleanup from the AMDGPU branch. (#19621) by Ben Vanik · 9 weeks ago
  59. aa06523 [NFC] Comment fixes in iree_bitcode_library. by Ben Vanik · 9 weeks ago
  60. 66723e4 Cleaning up null HAL driver options. by Ben Vanik · 9 weeks ago
  61. 2199c1d Adding iree_arena_block_pool_preallocate. by Ben Vanik · 9 weeks ago
  62. ea462c8 Removing some IREE_RETURN_AND_END_ZONE_IF_ERROR usage that was ugly. by Ben Vanik · 9 weeks ago
  63. 4a04c0a Adding minor iree/base/ time, string view, and memory utilities. by Ben Vanik · 9 weeks ago
  64. a8f7a32 Adding iree_hal_queue_affinity_* utilities. by Ben Vanik · 9 weeks ago
  65. c9fb739 Fixing HAL driver CTS test to not assume numerical indices exist. by Ben Vanik · 9 weeks ago
  66. 1ccabe5 Adding COMPILER_TARGET_DEVICE to iree_hal_cts_test_suite. by Ben Vanik · 9 weeks ago
  67. d517661 [runtime][python] Add debug sink to bindings (#19013) by Boian Petkantchin · 9 weeks ago
  68. c97b084 Including the .kd symbol suffix in AMDGPU executables. by Ben Vanik · 9 weeks ago
  69. d224220 Bump LLVM to llvm/llvm-project@21edac25f09faee23015c6a69d95fcbda287efe2 (#19616) by MaheshRavishankar · 9 weeks ago
  70. b245e6b Delete test_models job using SHARK-TestSuite/iree_tests. (#19614) by Scott Todd · 10 weeks ago
  71. 1445cef Set MLIR_LINK_MLIR_DYLIB to not link shared libMLIR (#19613) by Marius Brehler · 10 weeks ago
  72. 340ffbb [LinalgExt] Drop the unit dims on scatter ops 2/3 (#19450) by Ian Wood · 10 weeks ago
  73. 0820f10 [hip] Don't join the status in dispatch_thread. (#19583) by Andrew Woloszyn · 10 weeks ago
  74. cdf24b9 [Dispatch] Two fixes for CollapseDimensionsPass (#19598) by Ian Wood · 10 weeks ago
  75. 763406f [Codegen][Tuner] skip linking based on the default entry point attribute (#19603) by Bangtian Liu · 10 weeks ago
  76. c992d29 [runtime][hip] Fix format errors and conflicting types. (#19607) by Han-Chung Wang · 10 weeks ago
  77. e7d4fec Bump to LLVM with nanobind reverts (#19605) by Marius Brehler · 10 weeks ago
  78. 71d6de7 [Flow] Improve DeduplicateExecutables bucketing (#19601) by Ian Wood · 10 weeks ago
  79. 6b64fca [LinalgExt] Fusion support for LinalgExt ScatterOp 1/3 (#19560) by Ian Wood · 10 weeks ago
  80. 9cb984f [Codegen] Fix use after erase bug in optimize-tensor-extract-slices (#19460) by Kunwar Grover · 10 weeks ago
  81. 89afd4e Add missing definition of `build` method in `Flow::CallOp` (#19552) by Twice · 10 weeks ago
  82. 0c6d267 Fix typo in serializeExecutable of CUDA target (#19609) by Twice · 10 weeks ago
  83. c7086cf Bump LLVM to llvm/llvm-project@9f5cefe (#19600) by MaheshRavishankar · 10 weeks ago
  84. b5272df [docs] Update copyright year to 2025 in the website footer. (#19599) by Scott Todd · 10 weeks ago
  85. 9df82fd [LinalgExt] Add support for fusing scatter with producers (#19584) by Quinn Dawkins · 10 weeks ago
  86. 5a97523 [GPU] Disable insert/extract slice lowering from pack/unpack by default (#19590) by Zhuoran Yin · 10 weeks ago
  87. 1e935c4 [PJRT] Fix ninja not found error while re-building the plugin (#19553) by Twice · 10 weeks ago
  88. c2d408f Update IREE third-party/benchmark for RISC-V Compatibility (#19538) by Phoebe Chen · 10 weeks ago
  89. e6ac016 [Codegen][Tuner] verifier for the default tuning spec (#19525) by Bangtian Liu · 2 months ago
  90. 26b24f2 Bump LLVM to llvm/llvm-project@cbff02b (#19589) by MaheshRavishankar · 2 months ago
  91. fc6c518 [docs] Fix diagram rendering on C API reference page. (#19586) by Scott Todd · 2 months ago
  92. 88848b4 [runtime][hip] Do not error when peered memory access already enabled (#19582) by Boian Petkantchin · 2 months ago
  93. c203e6b [LLVMGPU] Prefer non-scatter ops as the configuration root (#19581) by Quinn Dawkins · 2 months ago
  94. fa325c5 [infra] Add `--force` flag to `git submodule update` in script. (#19550) by Scott Todd · 2 months ago
  95. f27feff Update LLVM to llvm/llvm-project@ac8bb735 (#19566) by Stanley Winata · 2 months ago
  96. a43d893 [Dispatch] Disable scatter fusion with producers (#19565) by Ian Wood · 3 months ago
  97. d746a57 Remove revert for https://github.com/llvm/llvm-project/pull/120115 (#19567) by Kunwar Grover · 3 months ago
  98. f1e1866 Update LLVM to llvm/llvm-project@b13592219c421820b (#19554) by Stanley Winata · 3 months ago
  99. 1f19761 Enable peering among all devices on the system. (#19555) by Andrew Woloszyn · 3 months ago
  100. 0184eee [Codegen][RoCDL] Add patterns for lowering bit-width emulation operations to LLVM (#19551) by MaheshRavishankar · 3 months ago