1. 52861ca Integrate llvm-project at 9e9b1effac34 and bump dependencies (#13232) by Kojo Acquah · 2 years ago
  2. c6c713c [StableHLO] Add initial input conversion pipeline and e2e tests (#13223) by Jakub Kuderski · 2 years ago
  3. 786e1a0 Add missing canonicalization to LLVMCPUVectorLowering (#13169) by Diego Caballero · 2 years ago
  4. 84d938e Make python dependency in lit test configurable (#13224) by Tori Baker · 2 years ago
  5. ddbb3dd Relax the application conditions for the matmul tensorcore transform … (#13228) by Nicolas Vasilache · 2 years ago
  6. c59cf83 NFC - Improve debug messages around transform dialect matching (#13226) by Nicolas Vasilache · 2 years ago
  7. 585d387 Plumb compiler version info through to built Python wheels. (#13214) by Stella Laurenzo · 2 years ago
  8. e199a08 Split mhlo.optimization_barrier lowering into several util.optimization_barrier ops if there are multiple inputs (#13210) by NatashaKnk · 2 years ago
  9. 69236fa Remove outdated python version requirement for Pytype (#13209) by Phoenix Meadowlark · 2 years ago
  10. 43ceb15 Update matmul tensorcore strategy to properly trigger pipelining with… (#13194) by Nicolas Vasilache · 2 years ago
  11. fe604d3 [StableHLO][NFC] Trim down stablehlo to linalg conversion deps (#13180) by Jakub Kuderski · 2 years ago
  12. 978754a tidy up elementwise ukernels (#13204) by bjacob · 2 years ago
  13. 4dd4eac [llvmgpu] check if operands can be distributable for shmem promotion (#13201) by Okwan Kwon · 2 years ago
  14. ab37989 Integrate llvm-project at 571eaead173b and bump dependencies (#13200) by Kojo Acquah · 2 years ago
  15. e1a4a2b Remove the `matmul` ukernel (#13175) by bjacob · 2 years ago
  16. e6b7fae port `pack` and `unpack` to `ukernel.generic` (#13173) by bjacob · 2 years ago
  17. 0d82968 Use git directly instead of `actions/checkout` on Windows (#13186) by Geoffrey Martin-Noble · 2 years ago
  18. 0ab01b6 Use GCS for Windows ccache (#13183) by Geoffrey Martin-Noble · 2 years ago
  19. de2ecca Remove Pixel 4 GPU benchmarks (#13188) by Jerry Wu · 2 years ago
  20. eafc042 Fix ASAN issue casting to uint32 (#13193) by Tori Baker · 2 years ago
  21. 0e9c462 Add lowering for BF16 arith/math operations to F32 with truncf/extf wrappers (#12841) by Rob Suderman · 2 years ago
  22. 302a289 Generalize matmul tensorcore strategy to work with arbitrary unaligned f32 tens… (#13192) by Nicolas Vasilache · 2 years ago
  23. 3730042 Refactor strategy towards generalization (#13190) by Nicolas Vasilache · 2 years ago
  24. a806149 Disable models due to recent regression: https://github.com/openxla/iree/issues/13189 by mariecwhite · 2 years ago
  25. 7758993 Remove pull_request by mariecwhite · 2 years ago
  26. af46cd8 Add Bert-Large TF models in various batch sizes by mariecwhite · 2 years, 1 month ago
  27. ed174dc Fix batch_matmul + transpose using nvgpu.mma.sync.1688.f32 (#13075) (#13170) by Manish Gupta · 2 years ago
  28. ab5bcbd Add compiler C API `ireeCompilerGetRevision`. (#13184) by Stella Laurenzo · 2 years ago
  29. 8a1182c Put ccache hint in docs on one line. (#13185) by Stella Laurenzo · 2 years ago
  30. 20e014b Enable Shared Memory Swizzle Optimization for Batched Matmul (#13060) by Manish Gupta · 2 years ago
  31. 4880d00 [Dev Container] - Adding VSCode Dev Container Capability (#12921) by Jonathan DEKHTIAR · 2 years ago
  32. 8548216 Fix exportTargetAttr construction in InlineExecutablesPass. (#13144) by Scott Todd · 2 years ago
  33. 6a7f69f Fixing vm::ref operator& after type consistency changes. (#13178) by Ben Vanik · 2 years ago
  34. b8527c6 NFC - Parameterize the matmul tensor core strategy towards generaliza… (#13177) by Nicolas Vasilache · 2 years ago
  35. 04d2ba5 Updating architecture diagram with "plugins" in a few places. (#12615) by Ben Vanik · 2 years ago
  36. d2f5a7c Update `experimental/web/sample_dynamic/` after runtime changes. (#13174) by Scott Todd · 2 years ago
  37. 56c4eff Update TensorFlow package in `samples` Dockerfile to 2.12.0. (#13171) by Scott Todd · 2 years ago
  38. e4e2398 Add CI workflow to run android tests on mobile phones (#13024) by Jerry Wu · 2 years ago
  39. 81cf28c Simplifying iree-run-mlir by making it run only a single function. (#13149) by Ben Vanik · 2 years ago
  40. 0d5f837 [xla] Drop arg & res attributes on XLA import (#13154) by Jacques Pienaar · 2 years ago
  41. 9040c43 Opt in to `-Wc++20-extensions` warning in clang. (#13158) by Scott Todd · 2 years ago
  42. 48d7a42 Replace `ukernel.mmt4d` by `ukernel.generic`. (#13143) by bjacob · 2 years ago
  43. 479900a [mhlo] Remove redundant preprocessing pattern. (#13153) by Jacques Pienaar · 2 years ago
  44. abbe3da [StableHLO][NFC] Do not alias the stablehlo namespace (#13150) by Jakub Kuderski · 2 years ago
  45. cc783ff [StableHLO] Port convolution to linalg lowering (#13151) by Jakub Kuderski · 2 years ago
  46. 73f29cf Appease Windows compiler (#13155) by Nicolas Vasilache · 2 years ago
  47. cf3d786 Add a transform dialect CUDA strategy for unaligned f32 matmul that are still divisible by 4 (#13098) by Thomas · 2 years ago
  48. 8d1a790 Integrate llvm-project at c2c49f464435 and bump dependencies (#13135) by Kojo Acquah · 2 years ago
  49. 0b92d79 Enable passing tests on CPU. (#13147) by Han-Chung Wang · 2 years ago
  50. 1859381 [StableHLO] Port rng to linalg lowering (#13137) by Jakub Kuderski · 2 years ago
  51. 9461d3b Adding support for loading VM modules from dynamic libraries. (#13112) by Ben Vanik · 2 years ago
  52. 62153df Generalize `ukernel.generic` with `strided_outer_dims` attribute (#13138) by bjacob · 2 years ago
  53. 169e6a4 Update mmperf docker image by mariecwhite · 2 years ago
  54. 26a145a Update benchmark documentation to store artifacts in build directory (#13141) by Phoenix Meadowlark · 2 years ago
  55. 5f81fd3 [StableHLO] Port reduce_window to linalg lowering (#13128) by Jakub Kuderski · 2 years ago
  56. fe53bc6 [NVIDIA Container] Fix `PATH` to include NVIDIA binaries (#13126) by Jonathan DEKHTIAR · 2 years ago
  57. 27179e2 Use MPI for NCCL unique ID exchange by default (#12902) by Okwan Kwon · 2 years ago
  58. c205355 fatal non-numerical errors in e2e matmul test (#13139) by bjacob · 2 years ago
  59. 0c3a30e Revert "Reorder ukernel operands to match what `ukernel.generic_raw` can generate" (#13136) by bjacob · 2 years ago
  60. 6f3c0a4 [codeGen] Added math.exp2 expansion. (#13124) by Balaji V. Iyer · 2 years ago
  61. 4b09dc5 Add warpDims argument to buildMapToBlockAndThreads (#13134) by Nicolas Vasilache · 2 years ago
  62. 5abc05f Integrate llvm-project at de088dd3a0aa and bump dependencies (#13130) by Kojo Acquah · 2 years ago
  63. 63fee48 Adding support for passing processor_id/data to dynamic imports. (#13109) by Ben Vanik · 2 years ago
  64. 1b1571d [StableHLO][NFC] Port select_and_scatter tests (#13122) by Jakub Kuderski · 2 years ago
  65. e4c0a11 [StableHLO][NFC] Port reduction tests (#13121) by Jakub Kuderski · 2 years ago
  66. 7171c45 [TransformExtensions] Add a `pad(extract_slice(fill))` -> `fill` pattern to `tiling_canonicalization` (#13093) by qcolombet · 2 years ago
  67. e19fc8e Adding a local executable plugin mechanism. (#12625) by Ben Vanik · 2 years ago
  68. 51fbedf Update KernelConfig.cpp to properly tile batch dim for convolutions. (#13108) by Kojo Acquah · 2 years ago
  69. 70985b5 Expand the CUDA header file pattern for building with CUDA 12.x. (#13106) by pjannaty · 2 years ago
  70. e68e0ba [ci] Make vendor/arch definitions in benchmarks better (#13099) by Lei Zhang · 2 years ago
  71. 3f1c154 Reorder ukernel operands to match what `ukernel.generic_raw` can generate (#13103) by bjacob · 2 years ago
  72. f98b5d8 Compute reduction across batch dims (#13091) by harsh-nod · 2 years ago
  73. b798319 Fix MSVC warning: wrong pointer type in `_mm_prefetch` (#13102) by bjacob · 2 years ago
  74. e2ce623 `ukernel.generic_raw` op passing only buffer and offset for memref operands (#13053) by bjacob · 2 years ago
  75. ff19816 Integrate LLVM at llvm/llvm-project@75f1f158 (#13094) by Thomas · 2 years ago
  76. c582ec7 Make boot disks bigger instead of futzing with ramdisk and local ssd (#13079) by Geoffrey Martin-Noble · 2 years ago
  77. 544f785 [TransformExtensions] Fix the vector_to_mma_conversion op (#13092) by qcolombet · 2 years ago
  78. 245a4f2 Some of the recently added tests require sm80 (#13077) by Tori Baker · 2 years ago
  79. 1b06f74 Allow more general accumulators when distributing reduction + broadcast + transpose (#13039) by harsh-nod · 2 years ago
  80. 186cdd4 [ROCm] Fix package name prefix and package root dir (#13088) by Boian Petkantchin · 2 years ago
  81. 644820c Add support to emit masked cp.async with 2-D masks (#13084) by Nicolas Vasilache · 2 years ago
  82. 09630d6 Finally moving VM type registration to iree_vm_instance_t. (#12650) by Ben Vanik · 2 years ago
  83. 8be370c [StableHLO] Port patterns to handle scalar op to arith lowering (#13083) by Jakub Kuderski · 2 years ago
  84. df166ed Adding flow.tensor.alloc op for unique allocations. (#13081) by Ben Vanik · 2 years ago
  85. 5c005b7 [StableHLO] Re-enable linalg.map lowering tests for non-pointwise ops (#13082) by Jakub Kuderski · 2 years ago
  86. 87ef655 Add a structured op matcher for 2d convolutions (#12805) by Quinn Dawkins · 2 years ago
  87. 40ecb03 [StableHLO] Port more non-pointwise linalg lowering tests (#13080) by Jakub Kuderski · 2 years ago
  88. 586f19e Add codegen side plumbing for calling micro kernels from codegen backends. (#12831) by MaheshRavishankar · 2 years ago
  89. e635b2f Disable gather vectorization (#13067) by Jacques Pienaar · 2 years ago
  90. 60b8ef4 [StableHLO][NFC] Enable FileCheck variable scope in linalg lowering tests (#13063) by Jakub Kuderski · 2 years ago
  91. e670ee5 [StableHLO] Add missing pointwise op tests (#13066) by Jakub Kuderski · 2 years ago
  92. d904673 [docs] Fix the cmake command for the tracy convenience targets (#13041) by qcolombet · 2 years ago
  93. 0eeae4f [LLVMGPU] Add conversion of masked transfer read op to async copy (#13070) by Thomas · 2 years ago
  94. 4ce5fb6 Switch to upstream the TrackingListener (#12775) by Matthias Springer · 2 years ago
  95. 27b4b5b [runtime] Add iree::vm::make_ref helper (#12985) by Eugene Zhulenev · 2 years ago
  96. 8af7ef7 Integrate LLVM at llvm/llvm-project@da68d216 (#13074) by Thomas · 2 years ago
  97. 76f0aa1 Remove iree-tools-xla from the release build. (#13071) by Stella Laurenzo · 2 years ago
  98. 8ba879a Delete unused iree-import-xla path (#13068) by Jacques Pienaar · 2 years ago
  99. 035bfc5 Use pinned iree-tools-tflite in benchmarks (#13057) by Jerry Wu · 2 years ago
  100. fe5409a Add transform dialect op to reorder transpose (#13045) by harsh-nod · 2 years ago