1. 206b60c [DispatchCreation] Extend multi-use producer fusion (#18551) by Ian Wood · 6 months ago
  2. c6056d1 [docs] Update and harmonize guids for deployment (#18762) by Marius Brehler · 6 months ago
  3. f9fa934 Integrates LLVM @ a758bcdbd92efb64a3482eb95d2769d74e33f5bb (#18783) by Vivian · 6 months ago
  4. 2945399 Revert `tensor.cast` to flow reshape conversion (#18772) by Ian Wood · 6 months ago
  5. 9f93073 [CMake] Don't update compile definitions for imported targets for MSCV (#18766) by Maksim Levental · 6 months ago
  6. 6295784 Bump torch-mlir to 45bb17e (#18782) by zjgarvey · 6 months ago
  7. 3ccd4f1 [Codegen] Replace LICM with a version that checks trip count (#18679) by Quinn Dawkins · 6 months ago
  8. a3d8ad6 [ROCM] Fix feature flags for gfx1100 and improve flag handling (#18781) by Kunwar Grover · 6 months ago
  9. afe18d2 [Codegen][GPU] Add tiling cleanup pattern to fuse pad without zero gaurd (#18748) by Kunwar Grover · 6 months ago
  10. 7622770 Integrate LLVM @ 7900daaa7ba57b5f9729bbbdb54f4e0599a45cd7 (#18773) by Vivian · 6 months ago
  11. ab88871 Add a default lowering config setting for custom_op. (#18737) by MaheshRavishankar · 6 months ago
  12. 6d8f0b2 [LLVMGPU] Add configuration tests for IGEMM, fix NCHW case (#18734) by Max191 · 6 months ago
  13. 2eddaea [IGEMM] Generate matmuls with expanded H and W dims (#18735) by Max191 · 6 months ago
  14. c3d74d0 [Flow] Fold flow reshape with mismatching dyn dims (#18680) by Ian Wood · 6 months ago
  15. 3e52344 [LLVMCPU] Enable tileDispatchUsingForall for multiTilingExpert (#18730) by Prashant Kumar · 6 months ago
  16. 1e6bbb8 [LinalgExt] Remove default implementation for getStaticLoopRanges (#18745) by Kunwar Grover · 6 months ago
  17. d7378bb [Codegen][GPU] Add pass for fallback distribution (#18726) by Quinn Dawkins · 6 months ago
  18. 1f3382d Bump torch-mlir to ab62f35373c3944b68e564214fd04fff39dd92fc (#18763) by saienduri · 6 months ago
  19. 1e155cc Use integer range and divisibility analysis to propagate int assumptions into dispatch executables. (#18755) by Stella Laurenzo · 6 months ago
  20. 0c69c9e Adding `iree_vm_context_fork` to fork a context. (#18751) by Ben Vanik · 6 months ago
  21. 25e5760 [Codegen] Remove unused arguements from attention op (#18743) by Kunwar Grover · 6 months ago
  22. 726112d [Codegen] Remove memref optimizations from OptimizeTensorInsertExtractSlices (#18732) by Kunwar Grover · 6 months ago
  23. 660f388 Enable analysis based integer optimizations. (#18756) by Stella Laurenzo · 6 months ago
  24. 90a2e1c [NFC][GPU] remove some dead code in setTileAndFuseLoweringConfig (#18754) by Nirvedh Meshram · 6 months ago
  25. 164a60e [ROCM] Disable mixed precision fma instructions that cause numeric issues (#18753) by Nirvedh Meshram · 6 months ago
  26. 9aa11e5 Bump torch-mlir to 2665ed343b19713ba5c1c555b2366a93de8b9d2b (#18750) by saienduri · 6 months ago
  27. ea52c36 Adding `IREE_STATUS_INCOMPATIBLE`. (#18752) by Ben Vanik · 6 months ago
  28. dd3f2a3 [Codegen] Remove old attention transformations (#18740) by Kunwar Grover · 6 months ago
  29. e50eda3 Integrate LLVM at f8b7a653 by Alan Li · 6 months ago
  30. fa670d6 [docs] Further improve Python API docs (#18667) by Marius Brehler · 6 months ago
  31. 5f3f863 Add `AggregatedOpInterface` to `iree_linalg_ext.custom_op` (#18700) by MaheshRavishankar · 6 months ago
  32. 598a60e [Im2col] Allow multiple batch, M, and K dimensions on im2col result (#18593) by Max191 · 6 months ago
  33. ce4f098 [Codegen] Use constant int matchers for interface reshape foldings (#18733) by Max191 · 6 months ago
  34. d019851 Bump torch-mlir to 94f54109134506005052632af96944ca24068f72. (#18715) by saienduri · 6 months ago
  35. 01b9521 Add `TilingInterface` implementation of `iree_linalg_ext.custom_op`. (#18676) by MaheshRavishankar · 6 months ago
  36. 2963600 Integrate LLVM at f59b151f by Alan Li · 6 months ago
  37. eb15493 e2e matmul test improvements (#18725) by Benoit Jacob · 6 months ago
  38. 5270093 Add an integer divisibility analysis. (#18727) by Stella Laurenzo · 6 months ago
  39. 1b719b3 [Codegen][GPU] Do not promote fill matmul operands (#18722) by Quinn Dawkins · 6 months ago
  40. 0e16a89 [Codegen][GPU] Disable consumer fusion for multi use cases (#18723) by Quinn Dawkins · 6 months ago
  41. 0f28d44 Use upstream dataflow tooling to build an arithmetic opt pass. (#18702) by Stella Laurenzo · 6 months ago
  42. 0889d13 Add the support to yield multiple results (#18717) by Prashant Kumar · 6 months ago
  43. 7fb28e0 [Codegen][GPU] Fix delinearized index order in forall resolution (#18724) by Quinn Dawkins · 6 months ago
  44. 4636257 [Encoding][NFC] Moving Encoding attr/enum to Encoding[Types|Attrs].* (#18711) by Han-Chung Wang · 6 months ago
  45. e8ff07e [Codegen][GPU] Stop using redundant transfer hoisting pattern on memrefs (#18721) by Quinn Dawkins · 6 months ago
  46. 6c095ed Revert "Revert "[CPU] Enable tileDispatchUsingForall for mmt4d and co… (#18718) by Prashant Kumar · 6 months ago
  47. 8f3f1e2 [docs] Remove call to `t()` in external param example. (#18706) by Vinayak Dev · 6 months ago
  48. e8ab5f8 Integrate LLVM at 634c57d7 by Alan Li · 6 months ago
  49. f6a9b6b [Codegen][GPU] Enable destination fusion for unit trip loops (#18674) by Quinn Dawkins · 6 months ago
  50. ad68964 [LLVMGPU] Pad to intrinsic shape in LLVMGPUPadAndVectorDistribute pipeline (#18632) by Kunwar Grover · 6 months ago
  51. 6001f9c Fix distribution logic when number of parallel loops is greater than 3 (#18714) by MaheshRavishankar · 6 months ago
  52. 5b0680d [Codegen] Add option to disable copy vectorization (#18673) by Quinn Dawkins · 6 months ago
  53. cc3b28f [Codegen][GPU] Improve loop fusion pattern verification (#18671) by Quinn Dawkins · 6 months ago
  54. 88cb0ab Add @bjacob and myself to Encoding dialect owners. (#18712) by Han-Chung Wang · 6 months ago
  55. 62f8d7e Revert "[CPU] Enable tileDispatchUsingForall for mmt4d and convolution pipelines. " (#18707) by Han-Chung Wang · 6 months ago
  56. 8c9f2cb Integrate with LLVM 20241007 by Alan Li · 6 months ago
  57. 691aa4f Bump llvm/torch-mlir@e9ed4af9ced23c201 (#18699) by Rob Suderman · 6 months ago
  58. 23b63cd [GPU][DT] Add e2e matmul tests for GPU data tiling (#18627) by Max191 · 6 months ago
  59. 65158ac Rework util.assume.* ops to util.assume.int and base on attributes. (#18703) by Stella Laurenzo · 6 months ago
  60. caacf6c Honor narrow dims in GPUMaterializeEncoding (#18622) by Benoit Jacob · 7 months ago
  61. bb5f2f5 [CPU] Enable tileDispatchUsingForall for mmt4d and convolution pipelines. (#18618) by Prashant Kumar · 7 months ago
  62. 126e334 [CPU][NFC] Collapsing methods that query a config is enabled. (#18686) by Han-Chung Wang · 7 months ago
  63. 4710c98 Integrate LLVM at 208f42fd (#18696) by Benoit Jacob · 7 months ago
  64. b89ba05 [CPU] Switching to linalg::LinalgOp for MaterializeContractionOp pattern (#18690) by Han-Chung Wang · 7 months ago
  65. 3801a5d [GPU][DT] Add support for materializing gemm with batch dimensions. (#18693) by Han-Chung Wang · 7 months ago
  66. 58dac26 [Encoding] Introduce getElementTypesArray helper in EncodingAttr. (#18692) by Han-Chung Wang · 7 months ago
  67. f7a02a0 [DT] Use createOrFold to simplify the logic of creating tensor.cast op. (#18691) by Han-Chung Wang · 7 months ago
  68. 067ba0e [GPU][NFC] Updates comments/style/TODO/debug_message for GPU data-tiling (#18688) by Han-Chung Wang · 7 months ago
  69. 79e979f Legalize newly added compiler hints ops. (#18689) by Stella Laurenzo · 7 months ago
  70. a9f5628 Limit transposition of narrow-N to only CPUMaterializeEncoding (#18684) by Benoit Jacob · 7 months ago
  71. c7a9af7 Integrate Stablehlo at d40285e (#18685) by Benoit Jacob · 7 months ago
  72. f39884d Cherry-pick stablehlo build fix for MSVC. (#18683) by Scott Todd · 7 months ago
  73. 747081a [Codegen][GPU] Force linalg ops to always use the same vector size (#18654) by Quinn Dawkins · 7 months ago
  74. 0e8a573 Adding vm.cast.f32.si64 and vm.cast.f32.ui64 ops (#18642) by Nirvedh Meshram · 7 months ago
  75. a6043e2 Adding a null HAL driver. (#18675) by Ben Vanik · 7 months ago
  76. 0bbc432 [VectorDistribution]Add distribution pattern and test mlir file for vector.gather (#18660) by Bangtian Liu · 7 months ago
  77. 88153eb [Codegen][GPU] Fuse into destinations for parallel tiling (#18666) by Quinn Dawkins · 7 months ago
  78. 24ee841 [Codegen][GPU] Fix alloc creation for dynamic outputs in loop fusion (#18665) by Quinn Dawkins · 7 months ago
  79. 42ca044 Integrate LLVM at e8137503 (#18678) by Benoit Jacob · 7 months ago
  80. 718b4fd [docs][pytorch] Add examples for compiling with external weights. (#18658) by Vinayak Dev · 7 months ago
  81. 206c1f2 [Codegen] Allow vectorizing linalg.copy ops on memrefs (#18672) by Quinn Dawkins · 7 months ago
  82. 903ab0a Integrate LLVM at 9fa55ec3 (#18670) by Benoit Jacob · 7 months ago
  83. cd48b10 [NFC] Delete dead ops after cloning (#18669) by Ian Wood · 7 months ago
  84. 7a2705d Bump stablehlo to `f7f8e4e35` and drop LLVM local reverts (#18668) by Benoit Jacob · 7 months ago
  85. d341128 [ExternalInterfaces] Make fill non-hoistableLeafOp, hoist linalg init operands (#18634) by Max191 · 7 months ago
  86. 66c3397 [docs] Update Python API docs (#18662) by Marius Brehler · 7 months ago
  87. 916bb88 Stopped threads from holding a reference to themselves. (#18636) by Andrew Woloszyn · 7 months ago
  88. 84ac47b [LLVMGPU] Switch LLVMGPUVectorDistribute to use iree_gpu.lowering_config (#18651) by Kunwar Grover · 7 months ago
  89. 462ecb6 [torch] Materialize all derivable bounds and divisor information in the IR. (#18646) by Stella Laurenzo · 7 months ago
  90. 8de9856 [LinalgExt] Add Interfaces for implementing fusion support for `iree_linalg_ext.custom_op`. (#18647) by MaheshRavishankar · 7 months ago
  91. 451ef71 [Codegen] Add pass for unrolling annotated for loops (#18641) by Quinn Dawkins · 7 months ago
  92. 9c39a29 [Codegen][GPU] Fix forall hositing arg use in single trip loops (#18657) by Nirvedh Meshram · 7 months ago
  93. c86b621 Switch build_package to arm-hosted runner. (#18656) by Scott Todd · 7 months ago
  94. 0a71ea5 [Codegen][GPU] Add iree_gpu.multi_mma op to PartitionableLoopsInterface (#18653) by Max191 · 7 months ago
  95. e45c570 Switch linux_arm64_clang workflow to arm-hosted runner. (#18643) by Scott Todd · 7 months ago
  96. 839f7f6 [GPU] Use shared memory for data tiled multi_mma ops (#18625) by Max191 · 7 months ago
  97. 4d31d89 [LinalgExt] Avoid rank reduction in Im2Col lowering pattern (#18637) by Quinn Dawkins · 7 months ago
  98. 0d65b6e Integrate LLVM at `a86e966a` (#18644) by Benoit Jacob · 7 months ago
  99. a7d84f9 [ROCm] Fix known target info for MI300A (#18648) by Jakub Kuderski · 7 months ago
  100. 20a7638 [ROCm] Always require `--iree-hip-target` (#18645) by Jakub Kuderski · 7 months ago