1. 794a3ca Update dotprod microbenchmark artifacts by mariecwhite · 9 months ago
  2. 8686edd [Im2col] Add decomposition pass for iree_linalg_ext.im2col (#17728) by Max191 · 9 months ago
  3. e07f317 Fix build_package.yml by giving uploads unique names. (#17746) by Scott Todd · 9 months ago
  4. 2401be2 [CPU] Tile outer parallel dims with 1 before lowering to ukernels. (#17731) by Han-Chung Wang · 9 months ago
  5. 9da0309 [LLVMGPU] Set prefetching on translation info (#17744) by Kunwar Grover · 9 months ago
  6. c62fc9d Add docs for Encoding, IREECodegen, and IREEGPU dialects. (#17743) by Scott Todd · 9 months ago
  7. ebc9f0f [LinalgExt] Add im2col tiling implementation (#17671) by Max191 · 9 months ago
  8. 71028b2 [Codegen] Teach Subset hoisting about insertion ops with loop invariant tensors (#17713) by Kunwar Grover · 9 months ago
  9. 36aec8a Change to mi250 runner for sdxl benchmarking. (#17741) by saienduri · 9 months ago
  10. ee65a67 Bump most GitHub Actions 'uses' to their latest tagged releases. (#17705) by Scott Todd · 9 months ago
  11. 450db0c Change `EncodingRole` to `IntegerAttr` (#17708) by lialan · 9 months ago
  12. 0d9e587 [NestedLayout] Make layout use strides instead of basis (#17284) by Kunwar Grover · 9 months ago
  13. 5fcdddb [GlobalOpt] Disable pack->expand_shape propagation. (#17739) by Han-Chung Wang · 9 months ago
  14. 22cf0b0 [Flow] Change the definition of "dequantization" recognizer. (#17711) by MaheshRavishankar · 9 months ago
  15. 0d2c780 Ensure IREE GPU dialect is registered for all GPU targets (fixes #17736) (#17737) by Andrea 🦈 · 9 months ago
  16. 1c53595 Add developer doc for model development debugging. (#17732) by Scott Todd · 9 months ago
  17. 247de36 [Python API] Fix python api for bytecode (#17343) by maxbartel · 9 months ago
  18. 1f69b85 [LinalgExt] Add iree_linalg_ext.im2col op and verifier (#17644) by Max191 · 9 months ago
  19. e41e71c [CPU] Limit the use of [8, 32, 16] gemm vector sizes to CPUs w/ avx512f feature (#17727) by Han-Chung Wang · 9 months ago
  20. fe571e4 [LLVMCPU][ArmSME] Rework how Arm streaming mode is set on dispatches (#17646) by Benjamin Maxwell · 9 months ago
  21. 024c48b Add check tests for more tensor dialect ops. (#17726) by Scott Todd · 9 months ago
  22. 9eb62c4 Fix cmake generators on Windows and enable in pre-commit. (#17619) by Scott Todd · 9 months ago
  23. 7b58c71 Integrates/llvm 20240621 (#17723) by Nirvedh Meshram · 9 months ago
  24. ac418d1 Integrate llvm/llvm-project@27ac46e6bea2 (#17662) by Lei Zhang · 9 months ago
  25. f427965 Add extra info to error message in transfer_read operation with element and thread count info (#17695) by RattataKing · 9 months ago
  26. 9fd55d2 [Codegen][GPU] Update greedy tile + fuse pipeline to generate mfma (#17617) by Quinn Dawkins · 9 months ago
  27. d01fb23 add indexing maps for `iree_linalg_ext.scatter`'s out operand (#17704) by Ian Wood · 9 months ago
  28. 643a7cd [Flow] move tensor lowerings out of FormDispatchWorkgroupsPass (#17282) by Ian Wood · 9 months ago
  29. 12d43e8 [Codegen][GPU] Allow serial tiling of online_attention op (#17702) by Kunwar Grover · 9 months ago
  30. 90f29a6 Reland "[spirv] Switch to use common target description" (#17699) by Lei Zhang · 9 months ago
  31. 7c41049 Fixing broken fill builtins that were double offsetting. (#17696) by Ben Vanik · 9 months ago
  32. 1997902 Bump actions/checkout to v4.1.7. (#17703) by Scott Todd · 9 months ago
  33. d792d24 Revert "[spirv] Switch to use common target description" (#17698) by Scott Todd · 9 months ago
  34. 7b9fb12 [spirv] Switch to use common target description (#17623) by Lei Zhang · 9 months ago
  35. 6f17869 Only set one narrow M/N at a time (#17647) by lialan · 9 months ago
  36. 3461314 Drop tile sizes specific to the ukernels-disabled case. (#17631) by lialan · 9 months ago
  37. 2b3c46c [GPUDistributionPatterns] Propagate predicate attribute for cmpf op (#17664) by Avinash Sharma · 9 months ago
  38. 1f954b2 [LLVMGPU] Generalize AMDGPUChainedMatmul pass to multiple dimensions (#17684) by Kunwar Grover · 9 months ago
  39. 3835c8b Run w7900 tests in serial instead of parallel. (#17686) by Scott Todd · 9 months ago
  40. a6c5ebf Remove attention transform dialect e2e tests (#17682) by Kunwar Grover · 9 months ago
  41. 7b782a8 [LinalgExt] Reland: Add online_attention op (#17681) by Kunwar Grover · 9 months ago
  42. eede9f2 Re-enable test_amd_w7900 CI job. (#17675) by Scott Todd · 9 months ago
  43. 42ac742 Revert "Skip tests failing on latest GitHub Actions runner image." (#17667) by Scott Todd · 9 months ago
  44. 1ea21d1 Fix hip dynamic_symbols_test to check min version. (#17674) by Scott Todd · 9 months ago
  45. c5d4b96 Allow flags to be set with greater flexibility (#17659) by Dave Liddell · 10 months ago
  46. 3428231 [LLVMCPU] Populate ArmSVE to LLVM conversion patterns (#17665) by Benjamin Maxwell · 10 months ago
  47. 045bf32 Change calculation of reassociation indicies in ConvertConvToChannelsLast.cpp (#17668) by Ian Wood · 10 months ago
  48. 5f07787 Drop @dcaballe from CODEOWNERS (#17672) by Diego Caballero · 10 months ago
  49. dc10693 Enable Workgroup Reordering Based on Translation Info Config Entries (#17645) by Bangtian Liu · 10 months ago
  50. eecd4b8 [doc] Add tip for stack trace symbols with Python and ASan. (#17666) by Scott Todd · 10 months ago
  51. b4321ea Update CI to include benchmarking changes in Test Suite. (#17655) by saienduri · 10 months ago
  52. 97fbe5f Update nvidia docker image. (#17661) by Jacques Pienaar · 10 months ago
  53. 2ff4102 Revert "[LinalgExt] Add online_attention op" (#17658) by Scott Todd · 10 months ago
  54. 71c07fa [CPU] Signal errors if there are large vectors. (#17620) by Han-Chung Wang · 10 months ago
  55. 0a561c4 [Codegen][GPU] Make operand promotion pattern work with generics (#17650) by Quinn Dawkins · 10 months ago
  56. abf0087 [LinalgExt] Add online_attention op (#17536) by Kunwar Grover · 10 months ago
  57. 52b21f8 [GPUHeuristic] Modify schedule generator to consider distribution of tranfer_read layout anchor (#17636) by Stanley Winata · 10 months ago
  58. c1e542d Change macos runner to regular (#17634) by Jacques Pienaar · 10 months ago
  59. 6e1d80a [Flow] Make the output indexing_map of elementwise ops identity. (#17583) by Ian Wood · 10 months ago
  60. db7974c [util] Add serialization support for `f64` resources (#17640) by Markus Böck · 10 months ago
  61. cda3ccb [GPU] Enable tensor.pack e2e tests for rocm backend. (#17587) by Han-Chung Wang · 10 months ago
  62. d7744b7 [Codegen][GPU] Loosen dim mapping restrictions on forall fusion (#17612) by Max191 · 10 months ago
  63. 8ab07d2 [Codegen][LLVMGPU][NFC] Cleanup contract distribution pattern for LayoutAttr (#17581) by Kunwar Grover · 10 months ago
  64. 363e088 [Vecdist][GPU] Distribute LayoutConflict to roundtrip to shared memory. (#17618) by Stanley Winata · 10 months ago
  65. a625a02 Add myself as CODEOWNER to a few directories (#17633) by Kunwar Grover · 10 months ago
  66. 088aef8 [LLVMGPU] Generalize VectorContractOpInfo based on indexing maps (#17625) by Kunwar Grover · 10 months ago
  67. 1943bc6 Remove AVX-512 tile sizes for non-ukernel case. (#17628) by lialan · 10 months ago
  68. cda9fd1 Rebuild docker images (#17568) by Jacques Pienaar · 10 months ago
  69. f4cfb55 Enable end-of-file-fixer and trailing-whitespace hooks. (#17630) by Scott Todd · 10 months ago
  70. f062b19 [LLVMGPU] Fix linear dim selection in GPUApplyTilingLevel (#17611) by Max191 · 10 months ago
  71. 2baf6c3 Remove pytype lint check. (#17551) by Scott Todd · 10 months ago
  72. d68a859 Optimize bazel_to_cmake and check_path_lengths pre-commit hooks. (#17599) by Scott Todd · 10 months ago
  73. 8aae294 Refresh some docs and samples from SHARK-Turbine to iree-turbine. (#17577) by Scott Todd · 10 months ago
  74. 51b4007 Add notes on pre-commit to contributing guide. (#17558) by Scott Todd · 10 months ago
  75. 59fbf3a Redirect submodules from shark-infra to iree-org. (#17614) by Scott Todd · 10 months ago
  76. 7fc1a67 Update Github runner to 2.317.0 (#17616) by Nancy Yuen · 10 months ago
  77. ae04c67 [Codegen][LLVMGPU] Add pass pipeline for greedy tile + fuse (#17559) by Quinn Dawkins · 10 months ago
  78. 3b5d269 Enable the `mmt4d` ukernel by default on `x86_64` and on `arm_64` (outside of SVE/SME). (#17502) by Benoit Jacob · 10 months ago
  79. 6d9475e Fixing iree_vm_ref_wrap_retain. (#17610) by Ben Vanik · 10 months ago
  80. 93fc09a Update Github runner to 2.316.1 (#17609) by Nancy Yuen · 10 months ago
  81. 2f89c7d Skip tests failing on latest GitHub Actions runner image. (#17595) by Scott Todd · 10 months ago
  82. 58feff3 [CPU] Add support for unpack ukernel preparation (#17498) by Prashant Kumar · 10 months ago
  83. 5404ad7 Add versioned, automatically installed 'buildifier' pre-commit hook. (#17589) by Scott Todd · 10 months ago
  84. 6a43e05 add fused op to linalgext annotation (#17474) by Ian Wood · 10 months ago
  85. 29472a1 [CPU] Reland "Data tiling: transpose narrow-N into narrow-M" (#17545) by lialan · 10 months ago
  86. 87fec7e Bump llvm-project@53ddc87454669c0d595c0e3d3174e35cdc4b0a61 (#17588) by Han-Chung Wang · 10 months ago
  87. 9a33952 [cuda][hip] Fix a resource leak when using deferred command buffers. (#17582) by Andrew Woloszyn · 10 months ago
  88. 5a44639 [CodeGen] Fix `in_bounds` attribute bug in tensor.extract_slice folding patterns. (#17563) by lialan · 10 months ago
  89. b44581a [LLVMGPU][ROCM][Layoutv1] Landing Implementation of WMMA on layoutV1 (#17580) by Stanley Winata · 10 months ago
  90. 9d60462 Bump torch-mlir to d59d0b6e5a88252d1d7e9b380e5488f49fadf87f (#17578) by jinchen · 10 months ago
  91. aef06ed [iree][global] Control the demotion of ops (#17515) by Prashant Kumar · 10 months ago
  92. 65bbc4b Update internal time library to allow user defined now function (#17576) by CindyLiu · 10 months ago
  93. c03d81d Enable magiclink extension for mkdocs, linkifying bare URLs. (#17575) by Scott Todd · 10 months ago
  94. 6291224 Bump llvm-project@534590144f7c7ec34b8e5e95aba3e4f214b074eb (#17572) by Rob Suderman · 10 months ago
  95. a5bd834 Fix conversion of pathlib.Path to str (#17573) by patosgui · 10 months ago
  96. bb7fc1f Document example CMake configurations and update options page. (#17569) by Scott Todd · 10 months ago
  97. 7388d75 [CPU][ArmSME] Enable transposes for f32 and f64 (#17440) by Cullen Rhodes · 10 months ago
  98. 0467f48 Re-enable a100 tests and benchmarks. (#17567) by Scott Todd · 10 months ago
  99. 3803de5 Ignore "build" dirs in check_path_lengths. (#17562) by Scott Todd · 10 months ago
  100. f9451d6 Bump llvm-project@45964eb9b88c46045e4e84beb4e2135cdeed6855 (#17554) by Rob Suderman · 10 months ago