1. cc2ccc3 Use our private copy of Vulkan header files explicitly. (#13346) by Scott Todd · 2 years ago
  2. 0b77bfd Unify the two plugin examples to use the same micro-kernel. (#13324) by MaheshRavishankar · 2 years ago
  3. 935b110 [hal][cts] Add more tests for drivers device creation APIs (#12064) by Lei Zhang · 2 years ago
  4. 1cbb0fe benchmark: doubling batch count, set bytes processed (#13269) by bjacob · 2 years ago
  5. 3cff105 Update tflite bindings to accept both identifier forms (#13195) by Jacques Pienaar · 2 years ago
  6. e312348 [StableHLO] Port lowering to LinalgExt (#13331) by Jakub Kuderski · 2 years ago
  7. b8a8f5c Update python bindings for `iree-benchmark-module` to use `--module=-` (#13345) by Kojo Acquah · 2 years ago
  8. 72c6169 Making iree_hal_channel_provider_t a ref object and exposing on devices. (#13317) by Ben Vanik · 2 years ago
  9. 8f03e8d [StableHLO] Port control flow legalization (#13337) by Jakub Kuderski · 2 years ago
  10. 50a6c17 Add prefetches to fix performance regression on ARM Cortex-X2 (#13342) by bjacob · 2 years ago
  11. 5d15caf Removing a verifier check on equality between threadblock K shape and problem K shape (#13310) by Manish Gupta · 2 years ago
  12. 6c2f27d [StableHLO] Port shape computation legalization (#13328) by Jakub Kuderski · 2 years ago
  13. 475af42 [spirv][vulkan] Add f16 e2e matmul tests (#13327) by Jakub Kuderski · 2 years ago
  14. d318c54 Rework iree-run-mlir to operate against the IREE compiler C API. (#12715) by Stella Laurenzo · 2 years ago
  15. c792591 nonfatal failures in ukernel tests (#13316) by bjacob · 2 years ago
  16. 4208fee Refactor OOBI to save framework agnostic results (#13206) by Phoenix Meadowlark · 2 years ago
  17. cca2173 [CI] Fix buildkite pipeline (#13322) by Jakub Kuderski · 2 years ago
  18. 9f09ae4 Add trailer to skip default benchmarking on LLVM integration PR (#13314) by Jerry Wu · 2 years ago
  19. af5f4c3 [spirv][vulkan] Do not run nvidia gpu tests on buildkite (#13319) by Jakub Kuderski · 2 years ago
  20. 3912546 Add a sample that uses the iree_codegen.ukernel.generic operation to call a microkernel supplied as a plugin. (#13132) by MaheshRavishankar · 2 years ago
  21. 02f85ea Moving MPI library loading to hal/utils. (#13152) by Calin Cascaval · 2 years ago
  22. 32a92b6 Post GCS links in CI summary (#13273) by Jerry Wu · 2 years ago
  23. ad65e56 Always run benchmarks on LLVM integration PR (#13298) by Jerry Wu · 2 years ago
  24. c6ba2a8 polish ukernel test cpu features (#13266) by bjacob · 2 years ago
  25. dc1684d [spirv][vulkan] Run e2 i8 matmul tests in CI (#13312) by Jakub Kuderski · 2 years ago
  26. 5cd6e1d Update sample Colab notebooks following TF importer changes. (#13311) by Scott Todd · 2 years ago
  27. ebf8490 Integrate llvm/llvm-project@be9c91843bab (#13296) by Lei Zhang · 2 years ago
  28. f0fac24 Move benchmark config generation to build_e2e_test_artifacts (#13291) by Jerry Wu · 2 years ago
  29. 7a86df4 [SPIRV] Add ability to specify transform dialect codegen spec file (#13267) by Quinn Dawkins · 2 years ago
  30. e0dc18e Remove --split-input-file from iree-run-mlir. (#13302) by Ben Vanik · 2 years ago
  31. 6852a34 [StableHLO][NFC] Port remaining op tests (#13297) by Jakub Kuderski · 2 years ago
  32. 6677e81 [spirv][vulkan] Enable integer dot prod on desktop GPUs (#13279) by Jakub Kuderski · 2 years ago
  33. ddb3223 [NFC] Organize `compiler/Utils/TracingUtils.cpp`. (#13087) by Scott Todd · 2 years ago
  34. 26f9cdf Unify typed VMVX entry points into untyped functions. (#13270) by bjacob · 2 years ago
  35. 4f5d55b Handle supported ImportOptions in tf.py and fix Windows compatibility. (#13287) by Scott Todd · 2 years ago
  36. eca606e [spirv][webgpu] Expand extended multiplication ops (#13274) by Jakub Kuderski · 2 years ago
  37. 3537df7 Stop preserving `"encoding"` attribute on `pack` op. (#13293) by bjacob · 2 years ago
  38. 550b0c4 Swap tile sizes adjustment about pack/unpack to a proper order. (#13272) by Han-Chung Wang · 2 years ago
  39. b476624 Add support for computing dispatch workgroup count using program slices. (#13038) by MaheshRavishankar · 2 years ago
  40. 34e0770 Update pjrt-plugin link in jax.md. (#13284) by Scott Todd · 2 years ago
  41. 5ad8800 Update mmperf (#13221) by mariecwhite · 2 years ago
  42. 31de207 Disable failing ukernel tests on Windows (#13276) by bjacob · 2 years ago
  43. 8cf1db2 Using FindThreads instead of hardcode `-lpthread`. (#13118) by Rechie Kho · 2 years ago
  44. 49d0123 ukernels: pack: move the generation of i64 padding_value to codegen (#13264) by bjacob · 2 years ago
  45. 1fd449b ukernels: fold type enums into flags (#13260) by bjacob · 2 years ago
  46. 9a8e63e Ukernel interface: take offsets, reorder fields (#13235) by bjacob · 2 years ago
  47. 47da9cf Disable TF Bert-Large due to #13211 (#13212) by mariecwhite · 2 years ago
  48. 5f16489 ukernels: let `pack` take `padding_value` by value (#13233) by bjacob · 2 years ago
  49. 6bd3211 ukernels: drop the unused `i8` case in `unpack`. (#13231) by bjacob · 2 years ago
  50. 4bca308 Ukernels: separate public vs internal headers (#13230) by bjacob · 2 years ago
  51. 772a335 [runtime] Fix std::array parameter unpacking (#13222) by Eugene Zhulenev · 2 years ago
  52. 52861ca Integrate llvm-project at 9e9b1effac34 and bump dependencies (#13232) by Kojo Acquah · 2 years ago
  53. c6c713c [StableHLO] Add initial input conversion pipeline and e2e tests (#13223) by Jakub Kuderski · 2 years ago
  54. 786e1a0 Add missing canonicalization to LLVMCPUVectorLowering (#13169) by Diego Caballero · 2 years ago
  55. 84d938e Make python dependency in lit test configurable (#13224) by Tori Baker · 2 years ago
  56. ddbb3dd Relax the application conditions for the matmul tensorcore transform … (#13228) by Nicolas Vasilache · 2 years ago
  57. c59cf83 NFC - Improve debug messages around transform dialect matching (#13226) by Nicolas Vasilache · 2 years ago
  58. 585d387 Plumb compiler version info through to built Python wheels. (#13214) by Stella Laurenzo · 2 years ago
  59. e199a08 Split mhlo.optimization_barrier lowering into several util.optimization_barrier ops if there are multiple inputs (#13210) by NatashaKnk · 2 years ago
  60. 69236fa Remove outdated python version requirement for Pytype (#13209) by Phoenix Meadowlark · 2 years ago
  61. 43ceb15 Update matmul tensorcore strategy to properly trigger pipelining with… (#13194) by Nicolas Vasilache · 2 years ago
  62. fe604d3 [StableHLO][NFC] Trim down stablehlo to linalg conversion deps (#13180) by Jakub Kuderski · 2 years ago
  63. 978754a tidy up elementwise ukernels (#13204) by bjacob · 2 years ago
  64. 4dd4eac [llvmgpu] check if operands can be distributable for shmem promotion (#13201) by Okwan Kwon · 2 years ago
  65. ab37989 Integrate llvm-project at 571eaead173b and bump dependencies (#13200) by Kojo Acquah · 2 years ago
  66. e1a4a2b Remove the `matmul` ukernel (#13175) by bjacob · 2 years ago
  67. e6b7fae port `pack` and `unpack` to `ukernel.generic` (#13173) by bjacob · 2 years ago
  68. 0d82968 Use git directly instead of `actions/checkout` on Windows (#13186) by Geoffrey Martin-Noble · 2 years ago
  69. 0ab01b6 Use GCS for Windows ccache (#13183) by Geoffrey Martin-Noble · 2 years ago
  70. de2ecca Remove Pixel 4 GPU benchmarks (#13188) by Jerry Wu · 2 years ago
  71. eafc042 Fix ASAN issue casting to uint32 (#13193) by Tori Baker · 2 years ago
  72. 0e9c462 Add lowering for BF16 arith/math operations to F32 with truncf/extf wrappers (#12841) by Rob Suderman · 2 years ago
  73. 302a289 Generalize matmul tensorcore strategy to work with arbitrary unaligned f32 tens… (#13192) by Nicolas Vasilache · 2 years ago
  74. 3730042 Refactor strategy towards generalization (#13190) by Nicolas Vasilache · 2 years ago
  75. a806149 Disable models due to recent regression: https://github.com/openxla/iree/issues/13189 by mariecwhite · 2 years ago
  76. 7758993 Remove pull_request by mariecwhite · 2 years ago
  77. af46cd8 Add Bert-Large TF models in various batch sizes by mariecwhite · 2 years, 1 month ago
  78. ed174dc Fix batch_matmul + transpose using nvgpu.mma.sync.1688.f32 (#13075) (#13170) by Manish Gupta · 2 years ago
  79. ab5bcbd Add compiler C API `ireeCompilerGetRevision`. (#13184) by Stella Laurenzo · 2 years ago
  80. 8a1182c Put ccache hint in docs on one line. (#13185) by Stella Laurenzo · 2 years ago
  81. 20e014b Enable Shared Memory Swizzle Optimization for Batched Matmul (#13060) by Manish Gupta · 2 years ago
  82. 4880d00 [Dev Container] - Adding VSCode Dev Container Capability (#12921) by Jonathan DEKHTIAR · 2 years ago
  83. 8548216 Fix exportTargetAttr construction in InlineExecutablesPass. (#13144) by Scott Todd · 2 years ago
  84. 6a7f69f Fixing vm::ref operator& after type consistency changes. (#13178) by Ben Vanik · 2 years ago
  85. b8527c6 NFC - Parameterize the matmul tensor core strategy towards generaliza… (#13177) by Nicolas Vasilache · 2 years ago
  86. 04d2ba5 Updating architecture diagram with "plugins" in a few places. (#12615) by Ben Vanik · 2 years ago
  87. d2f5a7c Update `experimental/web/sample_dynamic/` after runtime changes. (#13174) by Scott Todd · 2 years ago
  88. 56c4eff Update TensorFlow package in `samples` Dockerfile to 2.12.0. (#13171) by Scott Todd · 2 years ago
  89. e4e2398 Add CI workflow to run android tests on mobile phones (#13024) by Jerry Wu · 2 years ago
  90. 81cf28c Simplifying iree-run-mlir by making it run only a single function. (#13149) by Ben Vanik · 2 years ago
  91. 0d5f837 [xla] Drop arg & res attributes on XLA import (#13154) by Jacques Pienaar · 2 years ago
  92. 9040c43 Opt in to `-Wc++20-extensions` warning in clang. (#13158) by Scott Todd · 2 years ago
  93. 48d7a42 Replace `ukernel.mmt4d` by `ukernel.generic`. (#13143) by bjacob · 2 years ago
  94. 479900a [mhlo] Remove redundant preprocessing pattern. (#13153) by Jacques Pienaar · 2 years ago
  95. abbe3da [StableHLO][NFC] Do not alias the stablehlo namespace (#13150) by Jakub Kuderski · 2 years ago
  96. cc783ff [StableHLO] Port convolution to linalg lowering (#13151) by Jakub Kuderski · 2 years ago
  97. 73f29cf Appease Windows compiler (#13155) by Nicolas Vasilache · 2 years ago
  98. cf3d786 Add a transform dialect CUDA strategy for unaligned f32 matmul that are still divisible by 4 (#13098) by Thomas · 2 years ago
  99. 8d1a790 Integrate llvm-project at c2c49f464435 and bump dependencies (#13135) by Kojo Acquah · 2 years ago
  100. 0b92d79 Enable passing tests on CPU. (#13147) by Han-Chung Wang · 2 years ago