1. 2dffc9e Fix MSVC compilation following #18682 (#18830) by Benoit Jacob · 5 months ago
  2. 4f33005 Skip ROCM/test/opt_pass_plugin on Windows while broken. (#18823) by Scott Todd · 5 months ago
  3. 0c6a151 Warn when --iree-llvmcpu-target-cpu defaults to "generic". (#18682) by Benoit Jacob · 5 months ago
  4. 1500641 Various tweaks to numeric optimizations found while looking at programs. (#18765) by Stella Laurenzo · 5 months ago
  5. a488d38 Add region to `linalg_ext.attention` (#18728) by Rob Suderman · 5 months ago
  6. 8568efa [GPU] Adding support for opt pass plugins during AMDGPU executable serialization (#18347) by Jose Manuel Monsalve Diaz · 5 months ago
  7. a3d8ad6 [ROCM] Fix feature flags for gfx1100 and improve flag handling (#18781) by Kunwar Grover · 5 months ago
  8. 7622770 Integrate LLVM @ 7900daaa7ba57b5f9729bbbdb54f4e0599a45cd7 (#18773) by Vivian · 5 months ago
  9. 164a60e [ROCM] Disable mixed precision fma instructions that cause numeric issues (#18753) by Nirvedh Meshram · 5 months ago
  10. 5270093 Add an integer divisibility analysis. (#18727) by Stella Laurenzo · 5 months ago
  11. 65158ac Rework util.assume.* ops to util.assume.int and base on attributes. (#18703) by Stella Laurenzo · 5 months ago
  12. 7a2705d Bump stablehlo to `f7f8e4e35` and drop LLVM local reverts (#18668) by Benoit Jacob · 6 months ago
  13. 462ecb6 [torch] Materialize all derivable bounds and divisor information in the IR. (#18646) by Stella Laurenzo · 6 months ago
  14. a7d84f9 [ROCm] Fix known target info for MI300A (#18648) by Jakub Kuderski · 6 months ago
  15. 20a7638 [ROCm] Always require `--iree-hip-target` (#18645) by Jakub Kuderski · 6 months ago
  16. eef4623 [LLVMGPU][ROCm] Move kernel annotation before serialization (#18573) by Jakub Kuderski · 6 months ago
  17. 9ee061d [LinalgExt] Masked Attention Implementation (#18525) by rohan-tan-bhowmik · 6 months ago
  18. 0f15c8d [LLVMGPU][ROCm] Add validation on finalized llvm bitcode (#18552) by Jakub Kuderski · 6 months ago
  19. 7d823d2 [torch] Add dynamic support for `tm_tensor.attention` (#18527) by Rob Suderman · 6 months ago
  20. 861695b Integrates/llvm 20240910 (#18480) by Nirvedh Meshram · 6 months ago
  21. 767e288 Undo revert of https://github.com/llvm/llvm-project/pull/104668 (#18451) by MaheshRavishankar · 7 months ago
  22. 705ccce Change lowering to make `AttentionOp`'s outermost iterators parallel (#18047) by Ian Wood · 7 months ago
  23. 1acbedc Bumping HAL API version to 0.5. by Ben Vanik · 7 months ago
  24. e2a2b2b Removing descriptor set layouts from HAL IR and simplifying bindings. by Ben Vanik · 7 months ago
  25. 758ef19 Dropping WGSLReplacePushConstantsPass. by Ben Vanik · 7 months ago
  26. e044271 Converting Metal target to support executable-create2. by Ben Vanik · 7 months ago
  27. 9bbc926 Removing legacy pipeline layout and dispatch binding model. by Ben Vanik · 7 months ago
  28. 387b772 Converting Vulkan target to support executable-create2. by Ben Vanik · 7 months ago
  29. 21946fe Converting local CPU target to support executable-create2. by Ben Vanik · 7 months ago
  30. f51d4ea Converting CUDA target to support executable-create2. by Ben Vanik · 7 months ago
  31. 894dfbe Converting HIP target to support executable-create2. by Ben Vanik · 7 months ago
  32. f43f59b Renaming [spirv|wgsl]_executable_def to [vulkan|webgpu]. by Ben Vanik · 7 months ago
  33. 4349df7 Renaming rocm executable -> hip executable. by Ben Vanik · 7 months ago
  34. 5c06d4b Factoring out common debug info from GPU executable flatbuffers. by Ben Vanik · 7 months ago
  35. cd5037e Fixing ROCM/CUDA BlockSizeDef -> BlockSize. (tables have Def, structs don't) by Ben Vanik · 7 months ago
  36. c90a885 Fixing some incorrect TODOs referencing #18154. by Ben Vanik · 7 months ago
  37. 7050033 [Codegen][GPU] Add support for WMMA_I32_16x16x16_I8 (#18372) by Quinn Dawkins · 7 months ago
  38. 10aa470 [LLVMGPU][ROCm] Disable kernarg preloading on pre-CDNA3 targets (#18343) by Jakub Kuderski · 7 months ago
  39. bd78854 [torch] Support `torch.aten.view.dtype` conversion to `flow` (#18346) by Rob Suderman · 7 months ago
  40. c44d29b [compiler] Make cuda/hip/vulkan target cl options consistent (#17710) by Lei Zhang · 7 months ago
  41. cea581f Move LinalgQuantized* passes to GlobalOptimization (#18287) by Quinn Dawkins · 7 months ago
  42. 551cd54 [TOSA] Switch to tablegen pass generation (#18227) by Marius Brehler · 7 months ago
  43. 878a99b [torch] Switch to tablegen pass generation (#18226) by Marius Brehler · 7 months ago
  44. 300af39 [codegen] Add max_workgroup_counts to TargetWgpAttr (#17771) by Krzysztof Drewniak · 7 months ago
  45. 8dc6820 Adding simplified HAL dispatch methods. (#18189) by Ben Vanik · 7 months ago
  46. 50f18f1 [NFC][Encoding] Outline encodings in lit tests (#18165) by Max191 · 7 months ago
  47. df3d588 Erase shape_assertion ops (#18167) by Jacques Pienaar · 7 months ago
  48. 643f719 Add canonicalization pass for torch import (#18150) by Rob Suderman · 8 months ago
  49. e9e24f8 [GPU] Follow the official naming convention for WMMA attributes. (#18147) by Han-Chung Wang · 8 months ago
  50. de679c9 Creating reusable command buffers in stream->hal lowering. (#18100) by Ben Vanik · 8 months ago
  51. 82012e6 [GPU][NFC] Follow the official convention to define mfma/wmma attributes (#18127) by Han-Chung Wang · 8 months ago
  52. e9ee5fa [VectorExt] Move VectorExt from llvm-external-projects to Codegen/Dialect (#18082) by Kunwar Grover · 8 months ago
  53. 6c45bef [runtime][HIP] Retire ROCm HAL backend (#17029) by Nithin Meganathan · 8 months ago
  54. 9aaae34 [GlobalOpt] Transition SetEncoding to use round_dims_to and stop creating tensor.pad (#17931) by Max191 · 8 months ago
  55. e900692 Updating various tests to the latest changes. by Ben Vanik · 8 months ago
  56. 866c0c0 Changing stream conversion to use a value/op affinity analysis. by Ben Vanik · 10 months ago
  57. c05323f New AssignTargetDevices pass to replace the legacy one. by Ben Vanik · 10 months ago
  58. 31074e7 Adding iree.abi.affinity arg/result attrs on the native ABI. by Ben Vanik · 10 months ago
  59. 7e35749 Wiring up AssignTargetDevices and associated passes. by Ben Vanik · 1 year ago
  60. ddad4ec Adding #hal.device.select and related attributes. by Ben Vanik · 1 year, 1 month ago
  61. 437fbe2 Adding an IntegerSet utility and making PackConstants use it. (#18013) by Ben Vanik · 8 months ago
  62. ae00c4f Revert "Revert "[LLVMGPU][ROCm] Add MFMA_F32_16x16x4_F32 instruction"… (#17921) by Prashant Kumar · 8 months ago
  63. 3e3d9da Integrate llvm-project @97c0dbe1ad6dacbcca84e63e9d726b85b65af4fe (#17946) by Avinash Sharma · 8 months ago
  64. 76cad82 [LinalgExt] Retire `LinalgExt::ReverseOp` (#17866) by lialan · 8 months ago
  65. 37a3db2 Integrate llvm-project @9372a3b70cf3969dac2d1a14cf41358205944e60 (#17926) by Max191 · 8 months ago
  66. 30e2c20 Integrate llvm-project @266a5a9cb9daa96c1eeaebc18e10f5a37d638734 (#17911) by Avinash Sharma · 8 months ago
  67. 781be38 Add torch-fuse-quantized-ops pass to the torch-to-iree pipeline (#17908) by zjgarvey · 8 months ago
  68. 6a82eb5 Add F8_16x16x32_F32 support for MFMA (#17792) by Stanley Winata · 8 months ago
  69. 02c2000 Revert "[LLVMGPU][ROCm] Add MFMA_F32_16x16x4_F32 instruction" (#17894) by Scott Todd · 8 months ago
  70. d65c6d4 [LLVMGPU][ROCm] Add MFMA_F32_16x16x4_F32 instruction (#17847) by Prashant Kumar · 8 months ago
  71. 9d2d766 [LinalgExt] Adding IndexingMaps to linalg_ext.attentionOp (#17864) by Stanley Winata · 8 months ago
  72. 429aafd [Codegen] Improve ROCm-specific LLVM translations (#17742) by Krzysztof Drewniak · 8 months ago
  73. dd7d4a1 Integrate llvm-project @31015240d366e4bf6f114856caa6e9ce90742b7f (#17799) by Quinn Dawkins · 9 months ago
  74. 4cdc8e4 [rocm] Add --iree-rocm-legacy-sync flag (default true). (#17786) by Stella Laurenzo · 9 months ago
  75. dcba7c5 [LLVMGPU][ROCm] Plumb through i8, i8 -> i32 MFMA intrinsics (#17764) by Jakub Kuderski · 9 months ago
  76. 0d2c780 Ensure IREE GPU dialect is registered for all GPU targets (fixes #17736) (#17737) by Andrea 🦈 · 9 months ago
  77. 7b58c71 Integrates/llvm 20240621 (#17723) by Nirvedh Meshram · 9 months ago
  78. ac418d1 Integrate llvm/llvm-project@27ac46e6bea2 (#17662) by Lei Zhang · 9 months ago
  79. 90f29a6 Reland "[spirv] Switch to use common target description" (#17699) by Lei Zhang · 9 months ago
  80. d792d24 Revert "[spirv] Switch to use common target description" (#17698) by Scott Todd · 9 months ago
  81. 7b9fb12 [spirv] Switch to use common target description (#17623) by Lei Zhang · 9 months ago
  82. 3428231 [LLVMCPU] Populate ArmSVE to LLVM conversion patterns (#17665) by Benjamin Maxwell · 9 months ago
  83. 5404ad7 Add versioned, automatically installed 'buildifier' pre-commit hook. (#17589) by Scott Todd · 10 months ago
  84. 6291224 Bump llvm-project@534590144f7c7ec34b8e5e95aba3e4f214b074eb (#17572) by Rob Suderman · 10 months ago
  85. 62efaee Format files across the project using pre-commit. (#17534) by Scott Todd · 10 months ago
  86. 5b243a8 [Backend][ROCM] Add gfx1150 support. (#17508) by Stanley Winata · 10 months ago
  87. 006af5d [GPU] Support specifying LLVMGPU backend target features (#17451) by Lei Zhang · 10 months ago
  88. f6a38ac [GPU] Thread through a common target description (#17217) by Lei Zhang · 10 months ago
  89. dc61fcc Register ShapeDialect in StableHLO plugin. (#17444) by Scott Todd · 10 months ago
  90. 4f8ee51 Moving demotion/promotion passes to input conversion. (#17422) by Ben Vanik · 10 months ago
  91. b4fc0b4 Implementing the f64 VM extension and flipping the flag by default. (#17416) by Ben Vanik · 10 months ago
  92. b716704 Update git-clang-format ref and clang-format version. (#16792) by Scott Todd · 10 months ago
  93. 9f0282b Fixes double-free in ReorderBroadcastInDimOpAndElementwiseOp. (#17394) by Ben Vanik · 10 months ago
  94. a78cee1 Add support for serializing the textual representation of LLVM IR. (#17193) by Phoebe Chen · 10 months ago
  95. 4f27e64 Generalize overriding llvm func attr flags in translation info (#17365) by Kunwar Grover · 10 months ago
  96. d2dd9e2 Replacing hal.tensor.export storage for hal.tensor.alias. (#17339) by Ben Vanik · 10 months ago
  97. a3b7e12 Integrate both llvm-project@2083e97e (+1 :leftwards_arrow_with_hook:, +1 :cherries:) and torch-mlir@bce800a3 (#17330) by Benoit Jacob · 10 months ago
  98. 035da66 [NFC] Fixing stray space and unneeded modules in some lit tests. (#17338) by Ben Vanik · 10 months ago
  99. 8f6ecc5 Hide ExecutableVariantOp from TargetBackend pipeline factory methods. (#17255) by Ben Vanik · 11 months ago
  100. 01cdb0c [CodeGen] Clean up header includes and build dependencies (#17209) by Lei Zhang · 11 months ago