1. 5b243a8 [Backend][ROCM] Add gfx1150 support. (#17508) by Stanley Winata · 10 months ago
  2. aa0bc40 [Codegen][GPU] Add pass to fuse and hoist scf.forall ops (#17505) by Quinn Dawkins · 10 months ago
  3. 29e70ab Update onnx package version minimum to 1.16.0. (#17504) by Scott Todd · 10 months ago
  4. a6a56a9 Add `LinalgFusionInterface` to support fusion for linalg_ext ops (added `scatter` and `reverse`) (#17428) by Ian Wood · 10 months ago
  5. 3d1364e [Codegen][GPU] Add pattern to lower iree_gpu.multi_mma to intrinsics (#17457) by Quinn Dawkins · 10 months ago
  6. ab8f668 Revert "Data tiling: transpose narrow-N into narrow-M" (#17503) by Benoit Jacob · 10 months ago
  7. e33ca89 [LinalgExt] Split TileAndDecomposeAttention (#17468) by Kunwar Grover · 10 months ago
  8. 322d688 [Codegen][GPU] Add pattern to drop lead unit dims of multi_mma ops (#17456) by Quinn Dawkins · 10 months ago
  9. 117cb43 Test 'console' provider in 'tracing' job. (#16454) by Scott Todd · 10 months ago
  10. 16bdaa9 Data tiling: transpose narrow-N into narrow-M (#17446) by lialan · 10 months ago
  11. 6c75aa1 [Codegen][GPU] Allow iree_gpu.tensor_barrier to take vectors (#17479) by Quinn Dawkins · 10 months ago
  12. 1750e2b Integrate LLVM at 855eef2abd81cb8c7543d4748353d5e378fdd4c2 (#17501) by Benoit Jacob · 10 months ago
  13. 051c361 NFC: Make a few loop transformations more accessible (#17489) by Quinn Dawkins · 10 months ago
  14. 9e3d27a Upgrade to nanobind 2.0. (#17497) by Stella Laurenzo · 10 months ago
  15. cad02f9 [Codegen][GPU] Add unrolling pattern for iree_gpu.multi_mma (#17454) by Quinn Dawkins · 10 months ago
  16. 46c6bf5 [CPU] Add support for pack ukernel preparation. (#17472) by Han-Chung Wang · 10 months ago
  17. 3d6a8ee Bump Tracy to https://github.com/wolfpld/tracy/commit/cf2344111. (#17488) by Scott Todd · 10 months ago
  18. abdf550 Update IREE onnx import to be in sync with Torch-MLIR (#17476) by saienduri · 10 months ago
  19. a842527 [Codegen][GPU] Drop dead PassDetail.h file (#17490) by Quinn Dawkins · 10 months ago
  20. 440c870 Bump torch-mlir to 5bb1a65 on 2024-05-23 (#17483) by zjgarvey · 10 months ago
  21. 63dff03 [Codegen][GPU] Add iree_gpu.tensor_barrier op (#17478) by Quinn Dawkins · 10 months ago
  22. 31e1a30 [Codegen][GPU] Add dictionary based lowering config attribute (#17463) by Quinn Dawkins · 10 months ago
  23. 3a2617f [runtime][hip][cuda] Fix semaphore multi-wait, action GPU events and cleanup (#17213) by Boian Petkantchin · 10 months ago
  24. ea7d01e Pin nanobind to 1.9.2 to defer 2.0.0 API changes. (#17481) by Scott Todd · 10 months ago
  25. fe3fb24 Allow passwordless sudo in docker images (#17473) by Boian Petkantchin · 10 months ago
  26. 008add9 [CodeGen][NFC] Rename DecomposeBatchMmt4DOps to CPUPrepareUkernels. (#17471) by Han-Chung Wang · 10 months ago
  27. 30e0238 Bump LLVM to llvm/llvm-project@bd3f5a4bd3d9d7ee8ae801c24c5081073b20abd4 (#17470) by MaheshRavishankar · 10 months ago
  28. 9fe159d [LinalgExt] Generalize attention tiling interface implementation (#17408) by Kunwar Grover · 10 months ago
  29. 900ec67 Split benchmark jobs into their own independent workflow file. (#17400) by Scott Todd · 10 months ago
  30. f7ca45d [ArmSME][test] Enable TransposeMatmulPass and peeling for e2e matmuls (#17452) by Benjamin Maxwell · 10 months ago
  31. 1316c92 [Codegen] NFC: Move the lowering config to an attribute interface (#17439) by Quinn Dawkins · 10 months ago
  32. e36b355 Update integrate branch and title regexes for new naming. (#17464) by Scott Todd · 10 months ago
  33. db8b536 Bump torch-mlir to b870729efe5929b1ee6ff1c7b27d4d1857cdacc7 on 2024-05-21 (#17460) by zjgarvey · 10 months ago
  34. 02c660c Log compile and run commands on successful model tests too. (#17290) by Scott Todd · 10 months ago
  35. de5760d Bump LLVM to llvm/llvm-project@1727594 (#17459) by MaheshRavishankar · 10 months ago
  36. 7813fd3 [CPU] Fix a distribution bug and limiting distribution tile sizes. (#17436) by Han-Chung Wang · 10 months ago
  37. d4aa849 [CPU] Add support for pack/unpack ukernels enablement on llvm-cpu path. (#17427) by Han-Chung Wang · 10 months ago
  38. 9f59514 Add AVX-512 pack ukernel tile function for `16x2xbf16`. (#17432) by Benoit Jacob · 10 months ago
  39. 01b020e Re-enable w7900 jobs. (#17445) by saienduri · 10 months ago
  40. 6c5198d Folding no-op stream.async.update ops away. (#17458) by Ben Vanik · 10 months ago
  41. 006af5d [GPU] Support specifying LLVMGPU backend target features (#17451) by Lei Zhang · 10 months ago
  42. a36773a [Codegen][GPU] Add vectorization pattern for iree_gpu.multi_mma (#17453) by Quinn Dawkins · 10 months ago
  43. f6a38ac [GPU] Thread through a common target description (#17217) by Lei Zhang · 10 months ago
  44. 62a996b [Codegen] Add lane distribution for scf.forall (#17373) by Quinn Dawkins · 10 months ago
  45. 080b1fa [Codegen][GPU] Add a contraction like operation for mma intrinsics (#17374) by Quinn Dawkins · 10 months ago
  46. 29d0ceb Enable a test suite for convolution + winograd. (#17447) by Han-Chung Wang · 10 months ago
  47. e0f3c05 [Codegen][GPU] Change iree_gpu.shuffle_tensor to take a region for the read (#17425) by Quinn Dawkins · 10 months ago
  48. dc61fcc Register ShapeDialect in StableHLO plugin. (#17444) by Scott Todd · 10 months ago
  49. 2a2a4d0 Update various deps to their latest commits. (#17442) by Scott Todd · 10 months ago
  50. a3b74bc [CPU][ArmSME] Update tiling to use all SME accumulators (#16389) by Benjamin Maxwell · 11 months ago
  51. 4132d2e [runtime][hal][hip] Implement collectives via RCCL (#17270) by Boian Petkantchin · 11 months ago
  52. f849e2f Integrate LLVM at `502ccd81` (clean) (#17429) by Ingo Müller · 11 months ago
  53. 98973b3 Add tip for adding new signing key to github (#17420) by Kunwar Grover · 11 months ago
  54. 6d95f8c Integrate LLVM at `74a87548` (clean) (#17423) by Ingo Müller · 11 months ago
  55. 4f8ee51 Moving demotion/promotion passes to input conversion. (#17422) by Ben Vanik · 11 months ago
  56. dece30e [CPU] Do not decompose pack/unpack ops on x86 backends. (#17366) by Prashant Kumar · 11 months ago
  57. 218b934 Support GGUF version 2 as well as 3. (#17319) by Scott Todd · 11 months ago
  58. c1fdd75 Introduce new logo assets. (#17424) by Scott Todd · 11 months ago
  59. f2fcbbf [iree][global] Add conv2d op to demote to bf16 pass (#17410) by Prashant Kumar · 11 months ago
  60. 3b5b70a Integrate LLVM at `1650f1b3` (clean) (#17418) by Ingo Müller · 11 months ago
  61. b4fc0b4 Implementing the f64 VM extension and flipping the flag by default. (#17416) by Ben Vanik · 11 months ago
  62. b716704 Update git-clang-format ref and clang-format version. (#16792) by Scott Todd · 11 months ago
  63. 8fcab13 [Flow] Improve annotation name for conv (#17417) by MaheshRavishankar · 11 months ago
  64. a19fa24 Add tips on signing commits for the DCO check. (#17412) by Scott Todd · 11 months ago
  65. 356e2b7 [Codegen] Add op for flattening warp and thread ids of forall ops (#17368) by Quinn Dawkins · 11 months ago
  66. b17410c Integrate LLVM at `fb9a028b` (clean) (#17411) by Ingo Müller · 11 months ago
  67. 90db41a [LLVMGPU] Add Winograd pipeline for LLVMGPU (#17302) by Max191 · 11 months ago
  68. 05d5710 Integrate LLVM at `c5e67b86` (+1 local revert) (#17409) by Ingo Müller · 11 months ago
  69. 4021109 [Winograd] Add filtering by annotations for Winograd rewrites (#17332) by Max191 · 11 months ago
  70. 0260947 [GlobalOpt] Simplify the logic used to pick the groups. (#17405) by MaheshRavishankar · 11 months ago
  71. bf0fbf0 Fix typo in community/blog/posts/mmt4d.md (#17406) by Bruce Lai · 11 months ago
  72. 9a294eb [Winograd] Use output_tile_size for more static output transform tiling (#17200) by Max191 · 11 months ago
  73. 748db31 Fuse Generic Ops Generated by `gather` Lowering (#17341) by Ian Wood · 11 months ago
  74. 428adf2 [LLVMGPU] Add debug prints for vector distribution config (#17404) by Jakub Kuderski · 11 months ago
  75. ecc6983 Drop Tracy from CI benchmarks. (#17383) by Scott Todd · 11 months ago
  76. 78f5e8d Integrate torch-mlir@ec6d7aa onnx.resize op (#17358) by Chi_Liu · 11 months ago
  77. 2a8d681 [CPU] Remove CPUDoubleTilingPeelingExpert (#17329) by Andrzej Warzyński · 11 months ago
  78. b0f5521 [GitHub] Add Jakub to codeowners for SPIR-V/Vulkan and LLVMGPU/ROCm (#17399) by Jakub Kuderski · 11 months ago
  79. 3bac7ec Add math expand patterns pass (#17395) by jinchen · 11 months ago
  80. 9f0282b Fixes double-free in ReorderBroadcastInDimOpAndElementwiseOp. (#17394) by Ben Vanik · 11 months ago
  81. 29a12f3 [Preprocessing] Remove `input=none` option from TransposeMatmulPass (#17364) by Benjamin Maxwell · 11 months ago
  82. a78cee1 Add support for serializing the textual representation of LLVM IR. (#17193) by Phoebe Chen · 11 months ago
  83. 8d8d18c [LinalgExt] Simplify Attention unit tests (#17393) by Kunwar Grover · 11 months ago
  84. a8404a8 [LLVMGPU] Preserve config dictionary during MapNestedForallToGpuThreadsOp application (#17381) by Kunwar Grover · 11 months ago
  85. 2ed4778 Integrate LLVM at `a1d43c14d` (+1 revert) (#17380) by Benoit Jacob · 11 months ago
  86. 06eb43d Use coalesce loops (#17314) by MaheshRavishankar · 11 months ago
  87. 01ef465 Bump LLVM to llvm/llvm-project@04ce103 (#17352) by MaheshRavishankar · 11 months ago
  88. 07d6508 Drop `--retries=2` from pytest to fix `--timeout` behavior. (#17384) by Scott Todd · 11 months ago
  89. 4bada64 Switch docs and samples from 'tf-nightly' to 'tensorflow'. (#17382) by Scott Todd · 11 months ago
  90. bf93db7 Switch TensorFlow test requirement off of tf-nightly. (#17378) by Scott Todd · 11 months ago
  91. 4f27e64 Generalize overriding llvm func attr flags in translation info (#17365) by Kunwar Grover · 11 months ago
  92. ab0258d Switch pkgci CPU ONNX tests to use standard GitHub runner. (#17375) by Scott Todd · 11 months ago
  93. 2a701d5 [LLVMGPU] Add translation_info config knobs to disable passes (#17340) by Jakub Kuderski · 11 months ago
  94. 309831a Disable all w7900 jobs until the runner is stable. (#17371) by Scott Todd · 11 months ago
  95. 45ca23e [CPU] Take native_vector_size into accounts for attention op tiling. (#17349) by Han-Chung Wang · 11 months ago
  96. a8930d7 Disable test_amd_w7900 job in ci.yml while runner is unstable. (#17369) by Scott Todd · 11 months ago
  97. 4cc52f7 Bump jinja2 from 2.11.3 to 3.1.4 in /build_tools/benchmarks/reporting (#17288) by dependabot[bot] · 11 months ago
  98. 3625c60 Revert "Add math expand patterns pass" (#17367) by Scott Todd · 11 months ago
  99. d657082 [LLVMGPU] Switch GPU passes to tablegen definitions. NFC. (#17361) by Jakub Kuderski · 11 months ago
  100. a9ca8e6 Add math expand patterns pass (#17324) by jinchen · 11 months ago