1. 1fc6e5b Add CDNA3 MFMA BF16 intrinsics. (#18892) by Benoit Jacob · 5 months ago
  2. 3b751a4 [LLVMCPU] Enable tileDispatchUsingForall as default (#18777) by Prashant Kumar · 5 months ago
  3. 8ce8bed Simplifications in e2e matmul tests (#18889) by Benoit Jacob · 5 months ago
  4. 225baf2 Add e2e tests for F8E5M2FNUZ and F8E4M3FNUZ data-tiled MFMA on CDNA3 (#18888) by Benoit Jacob · 5 months ago
  5. e3f2d47 Bump torch-mlir to 140cad5 and update TorchOnnxToTorch conversion pipeline (#18867) by Vivek Khandelwal · 5 months ago
  6. 0c6a151 Warn when --iree-llvmcpu-target-cpu defaults to "generic". (#18682) by Benoit Jacob · 5 months ago
  7. a488d38 Add region to `linalg_ext.attention` (#18728) by Rob Suderman · 5 months ago
  8. 2945399 Revert `tensor.cast` to flow reshape conversion (#18772) by Ian Wood · 5 months ago
  9. d019851 Bump torch-mlir to 94f54109134506005052632af96944ca24068f72. (#18715) by saienduri · 5 months ago
  10. eb15493 e2e matmul test improvements (#18725) by Benoit Jacob · 5 months ago
  11. 691aa4f Bump llvm/torch-mlir@e9ed4af9ced23c201 (#18699) by Rob Suderman · 5 months ago
  12. 23b63cd [GPU][DT] Add e2e matmul tests for GPU data tiling (#18627) by Max191 · 6 months ago
  13. d341128 [ExternalInterfaces] Make fill non-hoistableLeafOp, hoist linalg init operands (#18634) by Max191 · 6 months ago
  14. 84ac47b [LLVMGPU] Switch LLVMGPUVectorDistribute to use iree_gpu.lowering_config (#18651) by Kunwar Grover · 6 months ago
  15. 14728a7 Bump torch-mlir to 9938abf25e1e7526ca7f43a8c49e9078c14fc55c (#18615) by Vivek Khandelwal · 6 months ago
  16. 0b29f7b [GPU][DT] Add support for GPU data-tiling E2E tests. (#18591) by Han-Chung Wang · 6 months ago
  17. 9ee061d [LinalgExt] Masked Attention Implementation (#18525) by rohan-tan-bhowmik · 6 months ago
  18. 337d49c [LinalgExt] Use f32 for accumulation for online_attention (#18456) by Kunwar Grover · 6 months ago
  19. 27b0829 Bump llvm/llvm-project@030c6da7af826b641db005be925b20f956c3a6bb (#18512) by Rob Suderman · 6 months ago
  20. f013a69 Bump torch-mlir to 2960538 (#18417) by zjgarvey · 7 months ago
  21. 3ceffb6 Revert "Bump torch-mlir to 98e08023bbf71e00ab81e980eac9f7c96f1f24b4" (#18405) by jinchen · 7 months ago
  22. dfaf12c Bump torch-mlir to 98e08023bbf71e00ab81e980eac9f7c96f1f24b4 (#18388) by jinchen · 7 months ago
  23. 9f7b25e Move einsum_inner_prod test for rocm as well (#18379) by Chris Sosa · 7 months ago
  24. e2a2b2b Removing descriptor set layouts from HAL IR and simplifying bindings. by Ben Vanik · 7 months ago
  25. 758ef19 Dropping WGSLReplacePushConstantsPass. by Ben Vanik · 7 months ago
  26. 7050033 [Codegen][GPU] Add support for WMMA_I32_16x16x16_I8 (#18372) by Quinn Dawkins · 7 months ago
  27. 7e8691d Add HIP Backend test support for attention FP16 (#18333) by erman-gurses · 7 months ago
  28. d7e8bcc [ONNX Regression Suite] Move einsum_inner_prod test (#18357) by Nirvedh Meshram · 7 months ago
  29. d2e74cd Reapply "[Flow] Convert from tensor.cast to flow.tensor.reshape" (#18351) by Nirvedh Meshram · 7 months ago
  30. 7a7bfe1 [Flow] Move first part of Flow transforms to new pipeline (#18290) by Ian Wood · 7 months ago
  31. f8d8e60 Bump torch-mlir (#18345) by Rob Suderman · 7 months ago
  32. 588732c Re-land e2e attention test suite. (#18320) by erman-gurses · 7 months ago
  33. c44d29b [compiler] Make cuda/hip/vulkan target cl options consistent (#17710) by Lei Zhang · 7 months ago
  34. 0247962 Revert "Add e2e test suite for the Attention - CPU Backend" (#18302) by Scott Todd · 7 months ago
  35. 95d5562 [vulkan] Update default RDNA GPU subgroup size to 32 (#18207) by Nithin Meganathan · 7 months ago
  36. 2d629c6 Add e2e test suite for the Attention - CPU Backend (#17751) by erman-gurses · 7 months ago
  37. 9862e99 Switch ONNX ops test suite to use iree-org/iree-test-suites. (#18223) by Scott Todd · 7 months ago
  38. 9b05f17 Delete all in-tree benchmark infrastructure code. (#18144) by Scott Todd · 7 months ago
  39. df3d588 Erase shape_assertion ops (#18167) by Jacques Pienaar · 7 months ago
  40. e9e24f8 [GPU] Follow the official naming convention for WMMA attributes. (#18147) by Han-Chung Wang · 8 months ago
  41. 31bfc93 [GPU] Fix e2e matmul generator to extract the input element type. (#18140) by Han-Chung Wang · 8 months ago
  42. ca24b96 [GPU] Updates mfma/wmma attribute names in the matmul test generator. (#18134) by Han-Chung Wang · 8 months ago
  43. 2193406 Attaching pipeline layout to hal.interface.binding.subspan & co. (#18098) by Ben Vanik · 8 months ago
  44. f004bf1 Delete obsolete tests/microbenchmarks/ folder. (#18051) by Scott Todd · 8 months ago
  45. 6145b65 Simplify tests/e2e/tosa_ops. (#17850) by Scott Todd · 8 months ago
  46. 2e6dbfa Simplify tests/e2e/tensor_ops. (#17854) by Scott Todd · 8 months ago
  47. c05323f New AssignTargetDevices pass to replace the legacy one. by Ben Vanik · 10 months ago
  48. ae00c4f Revert "Revert "[LLVMGPU][ROCm] Add MFMA_F32_16x16x4_F32 instruction"… (#17921) by Prashant Kumar · 8 months ago
  49. 76cad82 [LinalgExt] Retire `LinalgExt::ReverseOp` (#17866) by lialan · 8 months ago
  50. 8b44f61 Enable Python bindings builds/tests in 'runtime' CI builds. (#15878) by Scott Todd · 8 months ago
  51. 30e2c20 Integrate llvm-project @266a5a9cb9daa96c1eeaebc18e10f5a37d638734 (#17911) by Avinash Sharma · 8 months ago
  52. 6a82eb5 Add F8_16x16x32_F32 support for MFMA (#17792) by Stanley Winata · 8 months ago
  53. 2ed3f92 Add nop pass to different backend. by Alan Li · 9 months ago
  54. 02c2000 Revert "[LLVMGPU][ROCm] Add MFMA_F32_16x16x4_F32 instruction" (#17894) by Scott Todd · 8 months ago
  55. d65c6d4 [LLVMGPU][ROCm] Add MFMA_F32_16x16x4_F32 instruction (#17847) by Prashant Kumar · 8 months ago
  56. 9d2d766 [LinalgExt] Adding IndexingMaps to linalg_ext.attentionOp (#17864) by Stanley Winata · 8 months ago
  57. 3b2c85b Drop unmaintained transform dialect tests (#17858) by Quinn Dawkins · 8 months ago
  58. 645c966 Disable failing Pixel6/Vulkan linalg_ext reverse test again. (#17861) by Scott Todd · 8 months ago
  59. e794ce8 Simplify tests/e2e/linalg_ext_ops. (#17856) by Scott Todd · 8 months ago
  60. d174e8b Disable failing Pixel6/Vulkan stablehlo_ops tests again. (#17851) by Scott Todd · 8 months ago
  61. 78c0051 Simplify tests/e2e/stablehlo_ops. (#17843) by Scott Todd · 8 months ago
  62. dcba7c5 [LLVMGPU][ROCm] Plumb through i8, i8 -> i32 MFMA intrinsics (#17764) by Jakub Kuderski · 9 months ago
  63. 695e193 [Codegen] Change CompilationInfoAttr to take a lowering config interface (#17752) by Quinn Dawkins · 9 months ago
  64. 794a3ca Update dotprod microbenchmark artifacts by mariecwhite · 9 months ago
  65. 024c48b Add check tests for more tensor dialect ops. (#17726) by Scott Todd · 9 months ago
  66. a6c5ebf Remove attention transform dialect e2e tests (#17682) by Kunwar Grover · 9 months ago
  67. 71c07fa [CPU] Signal errors if there are large vectors. (#17620) by Han-Chung Wang · 9 months ago
  68. cda3ccb [GPU] Enable tensor.pack e2e tests for rocm backend. (#17587) by Han-Chung Wang · 9 months ago
  69. 5404ad7 Add versioned, automatically installed 'buildifier' pre-commit hook. (#17589) by Scott Todd · 10 months ago
  70. 29472a1 [CPU] Reland "Data tiling: transpose narrow-N into narrow-M" (#17545) by lialan · 10 months ago
  71. 62efaee Format files across the project using pre-commit. (#17534) by Scott Todd · 10 months ago
  72. ab8f668 Revert "Data tiling: transpose narrow-N into narrow-M" (#17503) by Benoit Jacob · 10 months ago
  73. 16bdaa9 Data tiling: transpose narrow-N into narrow-M (#17446) by lialan · 10 months ago
  74. f7ca45d [ArmSME][test] Enable TransposeMatmulPass and peeling for e2e matmuls (#17452) by Benjamin Maxwell · 10 months ago
  75. 29d0ceb Enable a test suite for convolution + winograd. (#17447) by Han-Chung Wang · 10 months ago
  76. 4132d2e [runtime][hal][hip] Implement collectives via RCCL (#17270) by Boian Petkantchin · 10 months ago
  77. 4f8ee51 Moving demotion/promotion passes to input conversion. (#17422) by Ben Vanik · 10 months ago
  78. a3b7e12 Integrate both llvm-project@2083e97e (+1 :leftwards_arrow_with_hook:, +1 :cherries:) and torch-mlir@bce800a3 (#17330) by Benoit Jacob · 10 months ago
  79. 6a9b175 Add test_amd_w7900 CI job including ROCM matmul tests (#17298) by erman-gurses · 10 months ago
  80. 3ca0a49 Moving OutlineConstantsPass to flow and adding parameter support. (#17303) by Ben Vanik · 11 months ago
  81. 098b0c4 Add test_amd_mi250 CI job including ROCm matmul tests. (#17293) by erman-gurses · 11 months ago
  82. c1cfbfc Add WMMA to matmul test suite for LLVMGPUVectorDistribute (#17285) by Kunwar Grover · 11 months ago
  83. 71a9945 Adding `--iree-opt-import-parameters=` and cleaning up export. (#16828) by Ben Vanik · 11 months ago
  84. 7b144fc Cleanup: Drop import yaml in generate script (#17253) by Nancy Yuen · 11 months ago
  85. 313c4d7 [LLVMGPU] Remove redundant fields from mma_schedule (#17195) by Kunwar Grover · 11 months ago
  86. aa1769e Moving the LocalDevice impl out of LLVM-CPU/VMVX. by Ben Vanik · 11 months ago
  87. 3dde925 [VectorDistribution] Add distribution pattern for vector::MultiDimReductionOp (#17076) by Kunwar Grover · 11 months ago
  88. e8f4948 [DT] Teach encoding about padding. (#17077) by Han-Chung Wang · 11 months ago
  89. 0ec9166 Add f32_to_i2 and i2_to_f32 e2e tests. (#17074) by Han-Chung Wang · 11 months ago
  90. 3f51a55 Replace openxla/iree with iree-org/iree across the project. (#17110) by Scott Todd · 11 months ago
  91. 3d626b5 [Preprocessing] NFC: Finish migrating passes to use new tablegen (#17047) by Quinn Dawkins · 11 months ago
  92. 39091a7 [Flow] Switch to new pass generation tablegen definitions (#17046) by Quinn Dawkins · 11 months ago
  93. 080657f Fix failing transform dialect CUDA tests. (#17042) by MaheshRavishankar · 11 months ago
  94. c2abb93 Disable TD tests on CUDA backends due to failure. (#17041) by MaheshRavishankar · 11 months ago
  95. 55fafcf Forking dynamic behavior from flow.tensor.constant. (#17034) by Ben Vanik · 11 months ago
  96. 954cb36 Move Codegen pass pipelines to nest on `FunctionOpInterface`. (#16665) by MaheshRavishankar · 11 months ago
  97. 76e9cfe Allowing flow.tensor.constant to be used for constants. (#17024) by Ben Vanik · 11 months ago
  98. d42e457 Add e2e tests for FA2 (#16953) by erman-gurses · 12 months ago
  99. 3cce7fc Add the conv2d test generator (for NCHW-FCHW layout). (#16849) by Prashant Kumar · 12 months ago
  100. 2c88e49 [LLVMGPU] Wmma layout for LLVMGPU vector distribute pipeline (#16928) by Stanley Winata · 12 months ago