1. 65680c6 [VectorDistribution] Add patterns for distributing transfer_read/transfer_write (#16115) by Kunwar Grover · 1 year, 3 months ago
  2. d71c147 Refresh website branding. (#16151) by Scott Todd · 1 year, 3 months ago
  3. ee9b206 [ROCM][Ukernel] Fix index types. (#16154) by Stanley Winata · 1 year, 3 months ago
  4. c859e29 Fix web and Colab sample CI builds. (#16155) by Scott Todd · 1 year, 3 months ago
  5. 0f481d8 Remove run_shark_tank.yml and supporting code. (#15825) by Scott Todd · 1 year, 3 months ago
  6. dc0eaf7 Fix: `--iree-llvmcpu-loop-*` command-line options were not having any effect. (#16153) by Benoit Jacob · 1 year, 3 months ago
  7. 58cd112 Enforce CUDA >= 12 and fix its CMake search procedure (#16142) by Boian Petkantchin · 1 year, 3 months ago
  8. 3aab7b7 Add e2e tests for packing on i8 types. (#16148) by Han-Chung Wang · 1 year, 3 months ago
  9. 7d736b5 Ukernels: simplify the architecture-specific bitcode build. (#16126) by Benoit Jacob · 1 year, 3 months ago
  10. 6ab1ed8 Preserve reflection attrs on functions when wrapping for the native ABI. (#16129) by Ben Vanik · 1 year, 3 months ago
  11. 91803de Allow specifying multiple --device= flags in tooling. (#16132) by Ben Vanik · 1 year, 3 months ago
  12. 4b1b8e2 Fixing task worker utilization tracing plot. (#16131) by Ben Vanik · 1 year, 3 months ago
  13. e178f45 [stablehlo] Implement product of householder reflectors (#15555) by Rob Suderman · 1 year, 3 months ago
  14. 390eeb0 Fixing copypasta in codegen/flow tests that included ABI ops. (#16140) by Ben Vanik · 1 year, 3 months ago
  15. 1cb0f99 [VectorExt] Fix LayoutIterator iteration for step != 0 (#16133) by Kunwar Grover · 1 year, 3 months ago
  16. e3db254 Simplify how mmt4d ukernels deal with the K=0 case. (#16137) by Benoit Jacob · 1 year, 3 months ago
  17. 51c30ab e2e microkernel pipeline + argmax ukernel on ROCM backend. (#15943) by Stanley Winata · 1 year, 3 months ago
  18. 6847e37 [CPU] Remove special distribution tile sizes from setting matmul config. (#15968) by Han-Chung Wang · 1 year, 3 months ago
  19. dd17612 Add `IREE_ENABLE_WERROR_FLAG` CMake option. (#16121) by Rechie Kho · 1 year, 3 months ago
  20. ddccda0 [HIP] Add macro for HIP build deps update (#16123) by Nithin Meganathan · 1 year, 3 months ago
  21. 510c77e [VectorDistribution] Preserve fastmath flags on elementwise ops (#16118) by Jakub Kuderski · 1 year, 3 months ago
  22. 063c46e [CPU] Propagate "scalability" when using peeling for vectorisation (#16058) by Andrzej Warzyński · 1 year, 3 months ago
  23. 5ac75d8 [VectorDistribution[ Fix bugs in vector distribution (#16116) by Kunwar Grover · 1 year, 3 months ago
  24. e8b2400 [CPU] Add clamping behavior back because of mid-air collision on #16041 (#16113) by Han-Chung Wang · 1 year, 3 months ago
  25. f47b76f [CPU] Remove legacy logics from matmul peeling expert. (#16041) by Han-Chung Wang · 1 year, 3 months ago
  26. 863d302 QOL fixes for portable and cross-compiling builds. (#16111) by Stella Laurenzo · 1 year, 3 months ago
  27. 914f306 [CodeGen] Switching to upstream eliminateCommonSubExpressions method. (#16105) by Han-Chung Wang · 1 year, 3 months ago
  28. c27ed41 [GlobalOpt][CPU] Move to using indexing maps for data tiling encodings instead of named op enums (#15984) by Max191 · 1 year, 3 months ago
  29. 17e9529 [spirv][vulkan] Refine device query to be more descriptive (#16101) by Lei Zhang · 1 year, 3 months ago
  30. 1b1e769 [LinalgExt] Delete dead codes. (#16104) by Han-Chung Wang · 1 year, 3 months ago
  31. 7ed7b96 [CPU] Break LLVMCPUVectorLowering pass to several small passes. (#16094) by Han-Chung Wang · 1 year, 3 months ago
  32. e32a502 Bump LLVM to llvm/llvm-project@f5145f4dc819 (#16073) by Han-Chung Wang · 1 year, 3 months ago
  33. 869e505 Disable const-eval for parameters unit test (#16089) by Max191 · 1 year, 3 months ago
  34. 3031ae6 Unifying helpers for size/shape-aware dim lookup. (#16095) by Ben Vanik · 1 year, 3 months ago
  35. 73fe86e Disable CUDA2 by default. (#16102) by Ben Vanik · 1 year, 3 months ago
  36. e2e126c [EmitC] Remove const casts in conversion (#15679) by Simon Camphausen · 1 year, 3 months ago
  37. 92f3a7f [CPU] Refine the logic to control vectorization pre-processing (#16078) by Andrzej Warzyński · 1 year, 3 months ago
  38. 0a69776 Rename and refactor HoistRedundantVectorTransfers (#16079) by Andrzej Warzyński · 1 year, 3 months ago
  39. 42e0a4b [spirv] Fix executable linking test to match real queries (#16100) by Lei Zhang · 1 year, 3 months ago
  40. c3518b2 [CPU] Unifly LLVMCPU cmd flags and variable names in KernelDisatpch.cpp (#16091) by Han-Chung Wang · 1 year, 3 months ago
  41. 02c5215 Revert "[Codegen] Re-Enable transform dialect configuration strategy sample (#15787)" (#16097) by Quinn Dawkins · 1 year, 3 months ago
  42. 562098f [VectorDistribution] Add infrastructure to support vector distribution based on layout (#16009) by Kunwar Grover · 1 year, 3 months ago
  43. 3b534c4 [Codegen] Re-Enable transform dialect configuration strategy sample (#15787) by Quinn Dawkins · 1 year, 3 months ago
  44. 8fb2680 Disable loop unrolling in LLVM IR optimization passes (#16092) by Benoit Jacob · 1 year, 3 months ago
  45. 4786ebc Remove SwiftShader Docker images and software Vulkan testing. (#15837) by Scott Todd · 1 year, 3 months ago
  46. dc81beb [CPU] Do not fuse ukernel ops into tiling loops. (#16054) by Han-Chung Wang · 1 year, 3 months ago
  47. baa911e Revert "Move Android benchmarks from Pixel 6 to Pixel 8" (#16090) by Jerry Wu · 1 year, 3 months ago
  48. 171e31c [cuda] Move to hal/drivers and wire up BUILD files (#14620) by Lei Zhang · 1 year, 3 months ago
  49. 74d1f01 [cuda] Break cyclic retain between device and device event pool (#16088) by Lei Zhang · 1 year, 3 months ago
  50. 381a16c [cuda] Fix deadlock when advancing deferred queue in driver thread (#15673) by Lei Zhang · 1 year, 3 months ago
  51. a7a7ad6 Add vm.buffer.hash and util.buffer.hash ops (#16003) by Quinn Dawkins · 1 year, 3 months ago
  52. 4b2aaaf Move Android benchmarks from Pixel 6 to Pixel 8 (#15796) by Jerry Wu · 1 year, 3 months ago
  53. 21d0153 Adding a robots.txt to iree.dev. (#16085) by Ben Vanik · 1 year, 3 months ago
  54. 198f271 [CPU] Fix multiconfig bug with tensor.pack op (#16082) by Max191 · 1 year, 3 months ago
  55. 81a47a7 Switch JAX pjrt-plugin link. (#15923) by Scott Todd · 1 year, 3 months ago
  56. c8ecc1c Reland "[spirv][vulkan] Enable device query generation and execution" (#16075) by Lei Zhang · 1 year, 3 months ago
  57. 2605fa1 Cherry-pick llvm/llvm-project@f5145f4dc819 to fix out-of-bounds access (#16074) by Han-Chung Wang · 1 year, 3 months ago
  58. 282ab77 Revert "[spirv][vulkan] Enable device query generation and execution" (#16077) by Han-Chung Wang · 1 year, 3 months ago
  59. 182a8f3 [HIP] Adds graph command buffer & descriptor set and pipeline layout (#15910) by Nithin Meganathan · 1 year, 3 months ago
  60. 852684a [spirv][vulkan] Enable device query generation and execution (#15977) by Lei Zhang · 1 year, 3 months ago
  61. b55ba25 Fixing/silencing some warnings that have crept in over time. (#16072) by Ben Vanik · 1 year, 3 months ago
  62. 776789e [GlobalOpt] Add a pass to simplify tensor pack/unpack ops. (#15993) by Han-Chung Wang · 1 year, 3 months ago
  63. c1edb82 [CodeGen] Implement MemoryEffectsOpInterface for ukernel ops. (#16053) by Han-Chung Wang · 1 year, 3 months ago
  64. 6aa310c [CPU] Move checking stack allocation cmd flag to Passes.cpp (#16062) by Han-Chung Wang · 1 year, 3 months ago
  65. bd2c92d [Stream] Update more op folders to verify matching types (#16070) by Quinn Dawkins · 1 year, 3 months ago
  66. d21a99c Check for source location resolution function in dynamic modules (#16065) by Quinn Dawkins · 1 year, 3 months ago
  67. 46b06d0 [minor code simplification] Implement algorithm without stack (#15999) by James Newling · 1 year, 3 months ago
  68. b16cee3 Bump LLVM to llvm/llvm-project@054b5fc0fd41 (#16055) by Han-Chung Wang · 1 year, 3 months ago
  69. ccd576c [LLVMGPU] Add AMDGPUToArith conversion patterns to ROCDL lowering (#16067) by Quinn Dawkins · 1 year, 3 months ago
  70. f0c8380 [LinalgExt] Retire RewriteForallToScfForOp transform op. (#16064) by Han-Chung Wang · 1 year, 4 months ago
  71. 6647a5b [CPU] Skip tiling if the compute op is not a TilingInterface op. (#16052) by Han-Chung Wang · 1 year, 4 months ago
  72. 124d562 Bump StableHLO to f8dcebfa1ec166806974f6ae0dfb902d36b47238 (#16049) by Jacques Pienaar · 1 year, 4 months ago
  73. d6dad12 ukernel: unroll the s16u4 VNNI ukernel, and drop the unused N0=16 variant (#16047) by Benoit Jacob · 1 year, 4 months ago
  74. ef344ac Bump LLVM to llvm/llvm-project@6b65d79 and deps (2023-12-29) (#16012) by Kunwar Grover · 1 year, 4 months ago
  75. b3200c8 [CPU] Enable mmt4d distribution for large reduction size cases. (#16037) by Han-Chung Wang · 1 year, 4 months ago
  76. 8869777 Add crosscompile utility binaries back to iree-dist tarball (#16034) by CindyLiu · 1 year, 4 months ago
  77. db83cc4 [LinalgExt] Delete fuse_producer transform op. (#16044) by Han-Chung Wang · 1 year, 4 months ago
  78. 0f0e0e7 [CodeGen] Carry over lowering_config when decomposing batch_mmt4d ops. (#16043) by Han-Chung Wang · 1 year, 4 months ago
  79. 6ac9b7e [LinalgExt] Expose attention tile size parameter (#16030) by harsh-nod · 1 year, 4 months ago
  80. 6711155 Add folding arithmetic extensions (#15953) by erman-gurses · 1 year, 4 months ago
  81. c4739bc [LinalgExt] Delete LinalgExt tiling patterns and passes. (#15921) by Han-Chung Wang · 1 year, 4 months ago
  82. ded4145 [LinalgExt] Switch tiling LinalgExt tests to use transform dialect. (#15904) by Han-Chung Wang · 1 year, 4 months ago
  83. 957af54 [LinalgExt] Switch distribution tests to use transform dialect. (#15922) by Han-Chung Wang · 1 year, 4 months ago
  84. e4aa589 [CodeGen] Add aflag to allow potentially to remove unnecessary code to improve performance. (#15862) by Lubomir Litchev · 1 year, 4 months ago
  85. ce282c8 [stablehlo] Add missing nullptr check for unregistered dialects (#16032) by Jakub Kuderski · 1 year, 4 months ago
  86. f7b108f Bump website copyright to 2024. (#16028) by Scott Todd · 1 year, 4 months ago
  87. 41da229 [CPU][NFC] Retire LLVMCPUTensorPad pass. (#16027) by Han-Chung Wang · 1 year, 4 months ago
  88. 80efa38 [GlobalOpt] Add f32->bf16 demotion cases for transposed matmuls (#16022) by Max191 · 1 year, 4 months ago
  89. b92ceb4 Remove SYSTEM scope from transitive includes. (#16018) by Stella Laurenzo · 1 year, 4 months ago
  90. 8a87bf1 Disable -Waddress warnings on GCC. by Stella Laurenzo · 1 year, 4 months ago
  91. 3e0583f Some CMake package install ergonomics. (#16015) by Stella Laurenzo · 1 year, 4 months ago
  92. 895645b [VectorLayoutAnalysis] Add transfer functions for vector.contract (#15996) by Kunwar Grover · 1 year, 4 months ago
  93. f9cdcfd [python] Expose python bindings for scf in iree.compiler.dialects (#16013) by Kunwar Grover · 1 year, 4 months ago
  94. e7384a1 [VectorLayoutAnalysis] Add debug printing (#16007) by Kunwar Grover · 1 year, 4 months ago
  95. c35d8e9 Standardizes CMake setup of C directory trees behind a macro. (#16011) by Stella Laurenzo · 1 year, 4 months ago
  96. 1ae94a5 [ROCM] Expose amdgpu-waves-per-eu opt hint (#16010) by harsh-nod · 1 year, 4 months ago
  97. 15c306f Build functioning dev packages for IREECompiler and IREERuntime. (#16008) by Stella Laurenzo · 1 year, 4 months ago
  98. b0e8f3c [VectorLayoutAnalysis] Fix bug in scf.for transfer functions (#15989) by Kunwar Grover · 1 year, 4 months ago
  99. 4592b8f [torch] Bump torch-mlir to d560698e3d610ecdc56667c713e2338c47bf4f44. (#16006) by Stella Laurenzo · 1 year, 4 months ago
  100. ccbe33f [VectorExt] Add layout iterator classes (#16004) by harsh-nod · 1 year, 4 months ago