1. abe9aed [NFC] Fixing typo double and (#16904) by Jose Manuel Monsalve Diaz · 1 year, 1 month ago
  2. e8f8888 [Flow][Transforms] Add dynamic dim capture support to `scf.for` (#16889) by Markus Böck · 1 year, 1 month ago
  3. e942406 [cmake] Require runtime tracing for compiler tracing (#16922) by Jakub Kuderski · 1 year, 1 month ago
  4. 5acacb7 [Codegen] Fix layout analysis for vector.transpose (#16820) (#16921) by Quinn Dawkins · 1 year, 1 month ago
  5. bfdbd16 Cherrypick llvm/llvm-project@c43932ebdc40. (#16920) by Scott Todd · 1 year, 1 month ago
  6. c46daa6 Fix for iree-run-module with --input=not-a-path.bin (#16919) by James Newling · 1 year, 1 month ago
  7. e44cf32 Move external test suite configs out of experimental. (#16907) by Scott Todd · 1 year, 1 month ago
  8. ff820d6 Re-land "start testing real weight models ..." (#16918) by Scott Todd · 1 year, 1 month ago
  9. aacdd33 Tighten up the `lower_to_ukernel_ops.mlir` test (#16883) by Benoit Jacob · 1 year, 1 month ago
  10. daee1dd Ukernels: replace boilerplate by tables. (#16879) by Benoit Jacob · 1 year, 1 month ago
  11. ab1a65b Integrate LLVM at llvm/llvm-project@a9d1fead9614 (#16891) by Scott Todd · 1 year, 1 month ago
  12. d6357b4 [TD][Preprocessing] Speed up `match.cast_compatible_dag_from_root` (#16914) by Jakub Kuderski · 1 year, 1 month ago
  13. 831fcfa [docs][website] Fix up indexing (#16912) by Jakub Kuderski · 1 year, 1 month ago
  14. 7bda2ec Bump torch-mlir to HEAD (e2343cf4ce9a13e8fa09d6c5ade6524fa7cf2b02). (#16911) by Stella Laurenzo · 1 year, 1 month ago
  15. c160cb4 [LLVMGPU] Send skinny matmuls to the gpu reduction pipeline (#16898) by Jakub Kuderski · 1 year, 1 month ago
  16. cd1068b Revert "Start testing real weight models from external test suite." (#16910) by Scott Todd · 1 year, 1 month ago
  17. de65adf [docs][website] Add subsection on profiling with perf and pprof (#16908) by Jakub Kuderski · 1 year, 1 month ago
  18. 03749e7 Fix conv preprocessing filtering logic (#16897) by Jakub Kuderski · 1 year, 1 month ago
  19. 8ab68b6 Start testing real weight models from external test suite. (#16801) by Scott Todd · 1 year, 1 month ago
  20. 61a1f2e Mark regression tests as passing that now pass. (#16900) by Stella Laurenzo · 1 year, 1 month ago
  21. 07a854c [CPU][ArmSME] Add `-arm-sme-vector-legalization` to ArmSME pipeline (#16881) by Benjamin Maxwell · 1 year, 1 month ago
  22. e3ced3a [CodeGen][NFC] Remove unused encoding utils. (#16892) by Han-Chung Wang · 1 year, 1 month ago
  23. b96adf6 Bump torch-mlir to HEAD (17eeac880af409c6c0473c5930a2c08e25209f4c). (#16896) by Stella Laurenzo · 1 year, 1 month ago
  24. 2cdf145 fix build errors when tracing mode = 1 (#16884) by Okwan Kwon · 1 year, 1 month ago
  25. aa72368 Bump TF to nightly dev20240207 (#16871) by Julian Walker · 1 year, 1 month ago
  26. f3b6bcd Address comments by mariecwhite · 1 year, 1 month ago
  27. 76515a7 Add i8*i4 matmul microbenchmark by mariecwhite · 1 year, 1 month ago
  28. 5f2743b [hip] Move into hal/drivers and build by default (#16706) by Lei Zhang · 1 year, 1 month ago
  29. f19780a Add AMDGPU dialect to registerMlirDialects. (#16859) by Han-Chung Wang · 1 year, 1 month ago
  30. 565225e [CPU] Add data-tiling for s8s4s32 Arm64 ukernels by mariecwhite · 1 year, 1 month ago
  31. 767a611 [tracing] fix build errors when tracing mode = 1 (#16853) by Okwan Kwon · 1 year, 1 month ago
  32. 9e95c38 [Flow] Fix exponential blowup when optimizing dynamic `tensor.dim`s (#16847) by Markus Böck · 1 year, 1 month ago
  33. e7eef08 Adds a nullptr check around optional implementation in dynamic modules. (#16845) by Stella Laurenzo · 1 year, 1 month ago
  34. 253881e Fix verifier on stream.async.call to allow call to unknown lifetime. (#16844) by Stella Laurenzo · 1 year, 1 month ago
  35. ed59bed Enabling external HIP HSACO support ala CUDA external PTX support. (#16830) by Ben Vanik · 1 year, 1 month ago
  36. 21e067b Integrate LLVM at llvm/llvm-project@1a6ec906fb37 (#16753) by Han-Chung Wang · 1 year, 1 month ago
  37. 77cc34b Bump numpy dep up to the 2.0 prerelease. (#16800) by Scott Todd · 1 year, 1 month ago
  38. 2eae9b3 [CPU] Remove 8x8x16 i8mm microkernel by mariecwhite · 1 year, 1 month ago
  39. a2ed5d1 Trace allocate/deallocate in rocm_allocator. (#16822) by Scott Todd · 1 year, 1 month ago
  40. 92fe572 [CPU] Add s8s4s32 i8mm ukernel (#16678) by mariecwhite · 1 year, 1 month ago
  41. b61a918 Enable LTO optimization by default for runtime releases. (#16811) by Stella Laurenzo · 1 year, 1 month ago
  42. ee32fc7 [rocm] Fix crash when executable source information is missing (#16805) by Lei Zhang · 1 year, 1 month ago
  43. 2395046 [rocm] Excises (almost) dependence on /opt/rocm from the compiler. (#16803) by Stella Laurenzo · 1 year, 1 month ago
  44. d05b4a1 Splitting stack trace support out of status.c. (#16791) by Ben Vanik · 1 year, 1 month ago
  45. 94a0108 Improving fixupGlobalMutability in IREE::VM::GlobalInitializationPass. (#16783) by Ben Vanik · 1 year, 1 month ago
  46. 15d9039 Embedding executable source contents in binaries for tracing. (#16757) by Ben Vanik · 1 year, 1 month ago
  47. 05ff3e2 Don't link `opencl.bc` when compiling for ROCm. (#16778) by Scott Todd · 1 year, 1 month ago
  48. 3b30ab4 Revert "Ukernels: enable limited debug information that is useful in profilers like Tracy." (#16779) by Benoit Jacob · 1 year, 1 month ago
  49. cdff01f Ukernels: enable limited debug information that is useful in profilers like Tracy. (#15756) by Benoit Jacob · 1 year, 1 month ago
  50. e074a44 [ROCm] Add MI300 and MI300A target chips to doc (#16767) by Boian Petkantchin · 1 year, 1 month ago
  51. 20913f8 Read LLVM_VERSION_MAJOR as a directory property (#16771) by Benoit Jacob · 1 year, 1 month ago
  52. d2542cd Update tracy docs regarding `--iree-hal-dump-executable-sources-to=` (#15814) by Benoit Jacob · 1 year, 1 month ago
  53. 50aa9a3 Update 'iree-samples' -> 'iree-experimental' after rename. (#16761) by Scott Todd · 1 year, 1 month ago
  54. e9ee873 Static link when building RISC-V Linux benchmark tools (#16752) by Jerry Wu · 1 year, 1 month ago
  55. 8711c81 [EmitC] Fix some of the TODOs introduced in #16357 (#16759) by Simon Camphausen · 1 year, 1 month ago
  56. 858cce6 [LLVMGPU] Fix fused elementwise broadcasts in mfma pipeline (#16756) by Quinn Dawkins · 1 year, 1 month ago
  57. 5497435 Fixing iree-run-mlir error messages. (#16749) by Ben Vanik · 1 year, 1 month ago
  58. 26924e4 Adding `iree.tensor.trace` support for printf debugging. (#16746) by Ben Vanik · 1 year, 1 month ago
  59. e12ab47 Replace `unaryOperator` by EmitC LogicalNotOp (#16730) by Marius Brehler · 1 year, 1 month ago
  60. 0b8a096 Use EmitCBuilder for VariableOp (#16740) by Marius Brehler · 1 year, 1 month ago
  61. e9f7ecd [cuda][hip] Drop name_literal reference like Vulkan side (#16742) by Lei Zhang · 1 year, 1 month ago
  62. 3c296a5 Disabling inlining on the torch async function. (#16739) by Ben Vanik · 1 year, 1 month ago
  63. de398c3 Let RISCV_LINKER_FLAGS_EXE can be assigned by user (#16738) by Yun Hsiang · 1 year, 1 month ago
  64. a3603c6 [ROCM] Fix build with runtime tracing enabled (#16737) by Quinn Dawkins · 1 year, 1 month ago
  65. 18d73c7 [rocm] Fix IREE_ROCM_TRACE_ZONE symbol (#16736) by Lei Zhang · 1 year, 1 month ago
  66. c8081fd Adding legacy ROCM tracing zones. (#16735) by Ben Vanik · 1 year, 1 month ago
  67. 0077030 [pkgci] Enable on sdxl feature branch. by Stella Laurenzo · 1 year, 1 month ago
  68. 12fae0e Update JAX and TFLite MLIR artifacts for benchmarking by mariecwhite · 1 year, 1 month ago
  69. 7f9d97b Add optional attribute to set MFMA read layout (#16733) by harsh-nod · 1 year, 1 month ago
  70. 331801c bump torch to 80c7bc3f7ae12413836a2f610a6491794b4dbb08 (#16717) by Daniel Garvey · 1 year, 1 month ago
  71. 3baa82b [CUDA] Fix CUDA transform tests for generalized mmt (#16729) by Jakub Kuderski · 1 year, 1 month ago
  72. 27b837c Replace `binaryOperator` by EmitC ops (#16728) by Marius Brehler · 1 year, 1 month ago
  73. 69432b6 Delete compiler/src/iree/compiler/Dialect/Flow/Transforms/SubsetInsertionOpInterfaceImpl.cpp by Ben Vanik · 1 year, 1 month ago
  74. fc684f4 [EmitC] Fix error message in compiler driver (#16727) by Simon Camphausen · 1 year, 1 month ago
  75. 7c2b48c [LLVMGPU][SPIR-V] Run named op generalization early in configuration pipeline (#16726) by Jakub Kuderski · 1 year, 1 month ago
  76. d153b1c [LLVMGPU] Add shared memory prefetching (#16723) by Kunwar Grover · 1 year, 1 month ago
  77. 50714ae Retry failed pytest cases to try limiting flakes. (#16718) by Scott Todd · 1 year, 1 month ago
  78. b7c4152 [LinalgExt] Add scaling to attention op (#16679) by Kunwar Grover · 1 year, 1 month ago
  79. 46326ef Integrate LLVM at llvm/llvm-project@abfac56 (#16710) by Jakub Kuderski · 1 year, 1 month ago
  80. 4691fc5 Use subgroup size when doing shuffles (#16698) by harsh-nod · 1 year, 1 month ago
  81. 7da8af6 Cleanup a few docs/references to old paths in HAL/Target. (#16716) by Scott Todd · 1 year, 1 month ago
  82. a283044 [hip] Make graph command buffer as default for initialization (#16707) by Lei Zhang · 1 year, 1 month ago
  83. de02c1d Disable external test suite on ROCm while flaky. (#16705) by Scott Todd · 1 year, 1 month ago
  84. c2a3245 Convert LLVMCPU compiler target to a plugin. (#16704) by Scott Todd · 1 year, 1 month ago
  85. b5af996 Prefer broadcasting RHS over LHS in AVX-512 multiply-accumulate instructions (#16709) by Benoit Jacob · 1 year, 1 month ago
  86. 47dfaa5 If negative tolerance in numerical test, then emit error (#16694) by James Newling · 1 year, 1 month ago
  87. 7884dc8 Revert adding unit dim folding to GlobalOps (#16708) by Max191 · 1 year, 1 month ago
  88. f513fe2 Add timeouts to pytest cases in pkgci/iree_tests. (#16703) by Scott Todd · 1 year, 1 month ago
  89. 0545746 [hip] Mark device local + host visible as low performance (#16701) by Lei Zhang · 1 year, 1 month ago
  90. b027da4 Convert VulkanSPIRV compiler target into a plugin. (#16699) by Scott Todd · 1 year, 1 month ago
  91. a86b8bf [Preprocessing] Change nesting of FoldUnitExtentDims (#16697) by Max191 · 1 year, 1 month ago
  92. 2f1d32d [linalg] Add the lowering of quantized_batch_matmul op. (#16615) by Prashant Kumar · 1 year, 1 month ago
  93. 6d03805 [CPU] Centralize pipeline lowering options and apply them consistently. (#16690) by Han-Chung Wang · 1 year, 2 months ago
  94. c344e26 Cleanup compiler plugin directory and include paths. (#16691) by Scott Todd · 1 year, 2 months ago
  95. c87eafe Update external test suite version pin and XFAIL sets. (#16675) by Scott Todd · 1 year, 2 months ago
  96. bb9409f [VectorDistribution] Emit diagnostics for invalid layouts (#16688) by Jakub Kuderski · 1 year, 2 months ago
  97. c07d110 [GlobalOpt][Flow] Add GlobalOp folding to FoldUnitExtentDims (#16611) by Max191 · 1 year, 2 months ago
  98. e612e91 Use createOrFold for linalg_ext dim queries. (#16685) by Ben Vanik · 1 year, 2 months ago
  99. f812ce2 Drop lists of VulkanSPIRV flags. (#16680) by Scott Todd · 1 year, 2 months ago
  100. 3bdb45b Use correct 'webgpu-spirv' flag name in samples. (#16681) by Scott Todd · 1 year, 2 months ago