1. 4b9fd17 Add verifiers for types of Input ops global load/store (#8908) by Jacques Pienaar · 3 years ago
  2. a6fbb76 Enable compilation tests for microbenchmarks (#8904) by Han-Chung Wang · 3 years ago
  3. 68fc8a0 Merge pull request #8896 from cathyzhyi/tmtensor-wip by Yi Zhang · 3 years ago
  4. eb5de2b Turn on IREE_BUILD_TORCH_MLIR_SUPPORT for standalone compiler-api build by Yi Zhang · 3 years ago
  5. 2014511 drop OPT_FLAGS parameter, no longer used (#8897) by bjacob · 3 years ago
  6. 81a9f30 Vectorize Linalg ops for dynamic cases. (#8888) by Han-Chung Wang · 3 years ago
  7. abe06bb Cherry pick b40e901333b903fd71f17c3314d3e40f8abde074 for llvm-project. (#8901) by MaheshRavishankar · 3 years ago
  8. 41ff121 Refactor run_benchmarks_on_android.py (#8859) by Jerry Wu · 3 years ago
  9. 68fbba0 Update Tracy to 6998546f to pull in @bjacob's fixes (#8898) by Lei Zhang · 3 years ago
  10. 59c4c87 Undo use of upstream pass for first level tile + distribute. (#8885) by MaheshRavishankar · 3 years ago
  11. 27675c2 Add a pattern to remove constant dest dependancy. (#8844) by Han-Chung Wang · 3 years ago
  12. e734c61 [spirv] Add a pipeline to use workgroup memory (#8425) by Lei Zhang · 3 years ago
  13. ae38fb9 Update -mlir-print-ir-* flag names in comments and scripts. (#8884) by Scott Todd · 3 years ago
  14. 7ee7cd5 Make the dylib CTS support IREE_BYTECODE_MODULE_FORCE_SYSTEM_DYLIB_LINKER (#8890) by bjacob · 3 years ago
  15. 75fa2e7 Merge pull request #8840 from cathyzhyi/tmtensor-wip by Yi Zhang · 3 years ago
  16. 871e9f0 Fixup python package metadata for PyPI publishing. (#8886) by Scott Todd · 3 years ago
  17. c35fee8 Make torch-mlir-dialects optional by Yi Zhang · 3 years ago
  18. 6b38136 Restructure launch_configuration tests into couple files. (#8882) by Han-Chung Wang · 3 years ago
  19. 417649d Integrate llvm-project at 95f0f69f1ff8eff34a00a47a236c2f91a2392c70 (#8860) by MaheshRavishankar · 3 years ago
  20. e05a68d Rearranging iree_notification_t code to make it easier to follow. (#8877) by Ben Vanik · 3 years ago
  21. b4c0963 Update README.md (#8880) by Diego Caballero · 3 years ago
  22. 07057c8 Documentation updates for testing and build scripts. (#8720) by Scott Todd · 3 years ago
  23. 844e208 [vulkan] Append entry point name to executable creation Tracy zone (#8875) by Lei Zhang · 3 years ago
  24. b07c0b9 Merge pull request #8873 from google/benvanik-join-workers by Ben Vanik · 3 years ago
  25. 9ca7563 Make benchmark artifact generation a bit more flexible. (#8846) by Scott Todd · 3 years ago
  26. 519815e Merge pull request #8853 from google/benvanik-benchmark-executables by Ben Vanik · 3 years ago
  27. 4afc558 Enable CUDA on remainder of eligible bots. (#8868) by Stella Laurenzo · 3 years ago
  28. 22112e0 Force push latest-snapshot (#8869) by powderluv · 3 years ago
  29. 5943902 Always build CUDA on CI. (#8867) by Stella Laurenzo · 3 years ago
  30. 87b8cbf Update docker images to pick up iree_cuda_deps in base. (#8865) by Stella Laurenzo · 3 years ago
  31. 9d54563 Add a sript to fetch and trim the CUDA toolkit deps needed to build. (#8862) by Stella Laurenzo · 3 years ago
  32. 4f03b4c Add BenchmarkDriver to run common benchmark flow. (#8654) by Jerry Wu · 3 years ago
  33. 6af56a2 [ci] Publish source MLIR modules after benchmark compilation (#8858) by Lei Zhang · 3 years ago
  34. 2453fe5 matmul test, remove native_vector[] from config (#8857) by Thomas · 3 years ago
  35. d3f2fc3 Fix the format of driver and target_arch. (#8829) by Jerry Wu · 3 years ago
  36. 7cf793f Fix data race: different mutexes were guarding the same data (#8856) by bjacob · 3 years ago
  37. d30bd2b Order worker initialization vs other threads stealing work (#8854) by bjacob · 3 years ago
  38. fc22cb3 Fix memory handling problem in matmul_test and renable cuda tests (#8797) by Thomas · 3 years ago
  39. f8e11fd Adding dispatch support to iree-benchmark-module. by Ben Vanik · 3 years ago
  40. 7a0a746 Adding -iree-hal-dump-executable-benchmarks-to= flag. by Ben Vanik · 3 years ago
  41. 37f0e7d Make MapElementTypeToDType consistent with the enum (#8810) by Sean Silva · 3 years ago
  42. b788b1b Fix tracing macro for IREE_HAL_CUDA_ALLOCATOR_ID (#8852) by nirvedhmeshram · 3 years ago
  43. 697fe38 Merge pull request #8837 from matthias-springer/fix_bufferization_inparallel2 by Matthias Springer · 3 years ago
  44. 795bb53 Fix bufferization of in_parallel by Matthias Springer · 3 years ago
  45. 74f7772 Modify elementwise fusion for changes in D123236. (#8841) by MaheshRavishankar · 3 years ago
  46. d07d59c Add tests and benchmarks for transpose op targeting AVX2 (#8750) by Diego Caballero · 3 years ago
  47. fed9cca Add tracing macro for IREE_HAL_CUDA_ALLOCATOR_ID (#8845) by nirvedhmeshram · 3 years ago
  48. 24267c6 Proper fixes for padding in IREE CodegenStrategy (#8843) by Han-Chung Wang · 3 years ago
  49. 1ddd917 Fixes for CompilationInfoAttr::get methods. (#8842) by Han-Chung Wang · 3 years ago
  50. 58ed257 Handle offsets/strides correctly while rewriting destructive updates. (#8793) by MaheshRavishankar · 3 years ago
  51. 6b93fc7 Merge pull request #8838 from google/benvanik-buffer-utils by Ben Vanik · 3 years ago
  52. 9e0536e Generalize bazel_to_cmake iree-dialects conversions. (#8800) by Scott Todd · 3 years ago
  53. 8df3a79 Remove retired RISC-V LLVM option (#8830) by CindyLiu · 3 years ago
  54. 5c16bbb Update SwiftShader to d15c4248 (2022-04-07) by Lei Zhang · 3 years ago
  55. 2d5bb6f Adding dedicated tracy allocation tracking for CUDA. by Ben Vanik · 3 years ago
  56. f9f670c Updating iree.natvis to the latest enums. by Ben Vanik · 3 years ago
  57. 210f4f9 Adding utility for reusing the subspan buffer implementation. by Ben Vanik · 3 years ago
  58. 435e309 Adding IREE_ASSERT_REF_COUNT_ZERO helper. by Ben Vanik · 3 years ago
  59. 2444059 Adding IREE_HOST_SIZE_MAX and IREE_DEVICE_SIZE_MAX. by Ben Vanik · 3 years ago
  60. ae8ecce Specially handle width-sensitive arith cast ops. (#8809) by Sean Silva · 3 years ago
  61. 6eac24d Add peeling support to fusion. (#8835) by Nicolas Vasilache · 3 years ago
  62. 08dfc09 Add support for non-string padding values. (#8834) by Nicolas Vasilache · 3 years ago
  63. 766a515 hide output of adb push (#8806) by bjacob · 3 years ago
  64. 210c1f5 redirect capture output (#8828) by bjacob · 3 years ago
  65. 7a3bbd3 Add support for tile interchange. (#8664) by Han-Chung Wang · 3 years ago
  66. 793b9cf Add workgroup swizzling for better cache reuse (#8789) by nirvedhmeshram · 3 years ago
  67. 1b9e84c Adopt TransformDialectExtension and add iree_bufferize + iree_set_num_workgroups_to_one transform ops (#8821) by Nicolas Vasilache · 3 years ago
  68. eb0d678 Improve compilation time and execution time for quantized matmul on ARM (#8815) by Han-Chung Wang · 3 years ago
  69. 7094918 Bump LLVM to 50de659adcc1 (#8819) by Nicolas Vasilache · 3 years ago
  70. 9f81fa0 [spirv] Fix SPIRVTileAndDistribute flow (#8808) by Lei Zhang · 3 years ago
  71. 05000ad [spirv] Remove obsolete pass for distributing copy (#8813) by Lei Zhang · 3 years ago
  72. fd0f3d1 Add Mobilenet V3 UINT8 to presubmits (#8812) by mariecwhite · 3 years ago
  73. b2076c8 Bump tolerance to 1e-5 and enable passing tests. (#8792) by Han-Chung Wang · 3 years ago
  74. 030e197 [spirv] Fix fp16 vectorization flow (#8799) by Lei Zhang · 3 years ago
  75. fa9b0e1 Expose tm_tensor input type to Python (#8803) by Sean Silva · 3 years ago
  76. d2fa8a2 Add Mobilebert Quant to Presubmits (#8796) by mariecwhite · 3 years ago
  77. e4f2143 Cherry pick MLIR "[mlir][vector] Fold extract(broadcast) of same rank" (#8804) by Lei Zhang · 3 years ago
  78. f4b23d7 Remove redundant linalg.fill op in mhlo.concatenate -> Linalg lowering. (#8795) by Han-Chung Wang · 3 years ago
  79. 7814a0a Delete most get_started/ docs and clean up what's left. (#8801) by Scott Todd · 3 years ago
  80. af961d7 Allow using capstone-next for the tracy profiler. (#8760) by bjacob · 3 years ago
  81. 773e654 Update releases to use bazel 5.1.0. by Stella Laurenzo · 3 years ago
  82. d52f47e [NFC] Apply some cleanups to HAL::EntryPointOp (#8787) by Nicolas Vasilache · 3 years ago
  83. 68ee844 Integrate llvm-project and bump dependencies. (#8786) by Han-Chung Wang · 3 years ago
  84. 3148a51 Adding a comment and print to iree-benchmark-module. (#8788) by Ben Vanik · 3 years ago
  85. daec83b Disabling CUDA e2e matmul tests pending #8784. (#8785) by Ben Vanik · 3 years ago
  86. 6954424 Update bug_report.md (#8783) by Han-Chung Wang · 3 years ago
  87. 8934c94 Fixes configurations for matvec and dot. (#8775) by Han-Chung Wang · 3 years ago
  88. 89c1f8a [vulkan] Improve iree-run-module with GUI (#8781) by Lei Zhang · 3 years ago
  89. 107b2a8 Remove default assignee for issues (#8776) by Geoffrey Martin-Noble · 3 years ago
  90. 4a0b85b [spirv] Invoke TransposeOp canonicalization after vectorization (#8726) by Lei Zhang · 3 years ago
  91. b47b7c3 Add BenchmarkSuite to load benchmarks. (#8753) by Jerry Wu · 3 years ago
  92. 90c0649 Integrate llvm-project and bump dependencies. (#8777) by Han-Chung Wang · 3 years ago
  93. a9ce257 Merge pull request #8768 from google/benvanik-limit-host-visibility by Ben Vanik · 3 years ago
  94. 406d7b3 Merge pull request #8738 from google/benvanik-generic-type-demotion by Ben Vanik · 3 years ago
  95. 0339fa0 Removing IREE_HAL_BUFFER_USAGE_ALL and tightening up host visibility. by Ben Vanik · 3 years ago
  96. 91f017e Copy iree-dialects/ to integrations/tensorflow (#8774) by Han-Chung Wang · 3 years ago
  97. dc6bbbd Allow multiple nested InParallelOp -> HAL rewrite. (#8771) by Nicolas Vasilache · 3 years ago
  98. 80bd1c5 [flow] Verify the dynamic dims for all ShapeAwareOp's (#8773) by Sean Silva · 3 years ago
  99. 380e154 Forward tensor.insert_slice coming from in_parallel lowering to flow.… (#8757) by Nicolas Vasilache · 3 years ago
  100. fde35ef Fix for InitializeEmptyTensors (#8772) by Sean Silva · 3 years ago