1. fe9cb17 Touch up Python build instructions. (#11424) by Scott Todd · 2 years, 4 months ago
  2. fcae6d7 Remove strip_overloads from native_training example. by Daniel Ellis · 2 years, 4 months ago
  3. 150d591 Cherry pick llvm @b2bba5b65c9f90f9c75da35fcedec08a01640d80 (#11469) by Guray Ozen · 2 years, 4 months ago
  4. d3b3cc1 Fix bug in tile_to_foreach_thread mapping computation (#11312) by Matthias Springer · 2 years, 4 months ago
  5. d52c5f3 Avoid mutating stream partition op infos while testing. (#11465) by Ben Vanik · 2 years, 4 months ago
  6. 034fe6a Cherry pick D139308 (#11454) by MaheshRavishankar · 2 years, 4 months ago
  7. 849e227 [vulkan] Enable MobileBERT i8 benchmark on Mali (#11464) by Jakub Kuderski · 2 years, 4 months ago
  8. cca23d8 Integrate llvm/llvm-project@279d294d26c3 (#11461) by Lei Zhang · 2 years, 4 months ago
  9. 295b4cb Followup transform dialect integrate cleanups (#11458) by Nicolas Vasilache · 2 years, 4 months ago
  10. 1f70517 NFC: copy over external dialects to integrations/tensorflow (#11459) by Lei Zhang · 2 years, 4 months ago
  11. 8e0022d Split `iree-dispatch-linalg-on-tensors-pass` into two (#11313) by Guray Ozen · 2 years, 4 months ago
  12. 4d4230b [LLVMGPU] Add basic support for target feature checks (#11453) by Thomas · 2 years, 4 months ago
  13. 52c2e35 Relands "Switch e2e tests and benchmarks to `--iree-flow-enable-data-tiling` on `llvm-cpu`" (#11443) by Han-Chung Wang · 2 years, 4 months ago
  14. 48ad5f8 Delete build_configurations Buildkite pipeline (#11450) by Geoffrey Martin-Noble · 2 years, 4 months ago
  15. eb1e2c4 Change VM dialect to generate and use prefixed accessors (#11444) by Lei Zhang · 2 years, 4 months ago
  16. b892aa0 Enable compilation statistics tracking for Android benchmarks (#11436) by Jerry Wu · 2 years, 4 months ago
  17. 0573f4f Transform dialect bringup (#11422) by Nicolas Vasilache · 2 years, 4 months ago
  18. 01cf67a Add e2e tests for winograd (#11428) by harsh-nod · 2 years, 4 months ago
  19. b792e2b Fix Winograd constant folding for F16 weights (#11433) by harsh-nod · 2 years, 4 months ago
  20. 8f59ef4 Removing notes file that accidentally made its way into main. by Ben Vanik · 2 years, 4 months ago
  21. ad23611 Adding plumbing and samples of custom CUDA/SPIR-V/CPU dispatch code. (#11287) by Ben Vanik · 2 years, 4 months ago
  22. 05f29e2 Supporting omitted subgroup sizes when none are needed (from PR #11406). by Ben Vanik · 2 years, 4 months ago
  23. b4bf5f7 Splitting iree-sample-deps from iree-test-deps. by Ben Vanik · 2 years, 4 months ago
  24. 5efe034 Adding examples of custom CUDA/SPIR-V/CPU dispatch code. by Ben Vanik · 2 years, 4 months ago
  25. b71aa53 Adds support for HAL executable object linkage. by Ben Vanik · 2 years, 4 months ago
  26. aca8cc6 Fixing LLVM target lookup to use the executable target as truth. by Ben Vanik · 2 years, 4 months ago
  27. 27be42f Adding collectives HAL operations and compiler support. (#11342) by Ben Vanik · 2 years, 4 months ago
  28. 0066c7a Adding initial stream.async.collective op. by Ben Vanik · 2 years, 4 months ago
  29. a0ab5e5 Adding stream.channel.default memoization pass. by Ben Vanik · 2 years, 4 months ago
  30. 233ebb4 Adding stream.cmd.collective & co ops. by Ben Vanik · 2 years, 5 months ago
  31. 8b92a9f Adding hal.command_buffer.collective & co runtime imports. by Ben Vanik · 2 years, 5 months ago
  32. 2165fd2 Adding hal.command_buffer.collective & co ops. by Ben Vanik · 2 years, 5 months ago
  33. 9a1ab32 Adding iree_hal_channel_t and the iree_hal_command_buffer_collective API. by Ben Vanik · 2 years, 4 months ago
  34. be4c5fc Integrate llvm/llvm-project@ca23b7ca476fb (#11429) by Lei Zhang · 2 years, 4 months ago
  35. b3fa021 Revert "Switch e2e tests and benchmarks to `--iree-flow-enable-data-tiling` on `llvm-cpu`" (#11435) by Jerry Wu · 2 years, 4 months ago
  36. c781d6a Re-enabling cuda's split-k test (#11431) by Murali Vijayaraghavan · 2 years, 4 months ago
  37. 044017f Switch e2e tests and benchmarks to `--iree-flow-enable-data-tiling` on `llvm-cpu` (#11411) by bjacob · 2 years, 4 months ago
  38. fdc340e ukernel tweaks (#11280) by bjacob · 2 years, 4 months ago
  39. 8da14f8 [spirv] Fix C matrix promotion with bufferization allocations (#11418) by Lei Zhang · 2 years, 4 months ago
  40. 7011e9f Move IREE LLVM backends to use opaque pointers. (#11404) by MaheshRavishankar · 2 years, 4 months ago
  41. 46efefa Add @hanhanW to FLOW CODEOWNERS (#11427) by Han-Chung Wang · 2 years, 4 months ago
  42. 9ec9bd9 NFC: refresh external dialects under integrations/tensorflow (#11426) by Lei Zhang · 2 years, 4 months ago
  43. c7a935f [spirv] Enable use wave32 mode for AMD architectures (#11425) by Lei Zhang · 2 years, 4 months ago
  44. 8eff5dd [vulkan] Add support for VK_EXT_subgroup_size_control (#11406) by Lei Zhang · 2 years, 4 months ago
  45. dc15a27 Add pass to convert convolutions to Winograd form (#11395) by harsh-nod · 2 years, 4 months ago
  46. b5281b7 NFC: remove the unnecessary extra setTranslationInfo helper (#11414) by Lei Zhang · 2 years, 4 months ago
  47. 5de1a38 [Flow] Move Convert1x1FilterConvToMatmul to come after DetachElementwiseFromNamedOps (#11410) by Quinn Dawkins · 2 years, 4 months ago
  48. b07d60c Plumb through support for controlling subgroup size in CodeGen (#11388) by Lei Zhang · 2 years, 4 months ago
  49. 895cf38 [spirv] Add support for Winograd output op (#11409) by harsh-nod · 2 years, 4 months ago
  50. 7649620 Set maximum number of threads in the thread block for CUDA (#11387) by Guray Ozen · 2 years, 4 months ago
  51. 08aea95 Enable fusion for unpack + elementwise ops. (#11403) by Han-Chung Wang · 2 years, 4 months ago
  52. a06805f Improve generated cmake files verification and update lint.sh (#11381) by Jerry Wu · 2 years, 4 months ago
  53. 8400fdb Turn off ccache in buildkite samples build. (#11407) by Scott Todd · 2 years, 4 months ago
  54. 74b5c94 Build linux benchmark tools in CI (#11397) by Jerry Wu · 2 years, 4 months ago
  55. a32ed6a Add ConvPerf workflow (#11334) by mariecwhite · 2 years, 4 months ago
  56. 16dbf3b [spirv] Change illegal configuration test to use user control (#11398) by Lei Zhang · 2 years, 4 months ago
  57. 42a4dc7 [spirv] Add support for Winograd input op (#11375) by harsh-nod · 2 years, 4 months ago
  58. e82cbe6 NFC: Separate setting translation info and dispatch configuration (#11400) by Lei Zhang · 2 years, 4 months ago
  59. 884e361 NFC: Add better error message. (#11401) by MaheshRavishankar · 2 years, 4 months ago
  60. a303b95 Do not try to fuse with ops that are cloned into dispatch regions anyway. (#11399) by MaheshRavishankar · 2 years, 4 months ago
  61. 5d8b054 [WebGPU] Push constants Storage i32 -> Uniform vector<4xi32>. (#11392) by Scott Todd · 2 years, 4 months ago
  62. 4ccfe79 [mlir][gpu] Pack and unpack to enable f16 and int8 warp reduce. (#11349) by Stanley Winata · 2 years, 4 months ago
  63. d110d1b Adopt the new memref lowering process (#11261) by qcolombet · 2 years, 4 months ago
  64. 0169788 Support distribution to more than 3 dimension at the workgroup level (#11385) by Thomas · 2 years, 4 months ago
  65. 9484142 Switch e2e model tests to use e2e test artifacts (#11380) by Jerry Wu · 2 years, 4 months ago
  66. e986cdb Integrate at llvm/llvm-project@61aed52c and bump dependencies (#11393) by Thomas · 2 years, 4 months ago
  67. 8753312 Migrate the rest of the alternate configurations builds from Buildkite (#11391) by Geoffrey Martin-Noble · 2 years, 4 months ago
  68. 8b19b3c Adopt the new memref lowering process by Quentin Colombet · 2 years, 4 months ago
  69. 7b75cf7 Add test script for native training. by Daniel Ellis · 2 years, 4 months ago
  70. 503ce22 test maxntid by Guray Ozen · 2 years, 4 months ago
  71. e5a213a Cherry-pick some SPIR-V related MLIR commits (#11384) by Lei Zhang · 2 years, 4 months ago
  72. 69b8e23 Add ResNet50 to new benchmark suites (#11172) by Jerry Wu · 2 years, 4 months ago
  73. 0246972 Bump actions to versions using environment files (#11383) by Geoffrey Martin-Noble · 2 years, 4 months ago
  74. 9a22a37 [SPIRV] Add bank conflict reduction to cooperative matrix pipeline (#11386) by Quinn Dawkins · 2 years, 4 months ago
  75. 214847a Set maximum number of threads in the thread block for CUDA target by Guray Ozen · 2 years, 4 months ago
  76. 275280c Fixes how the mmperf repo is cloned in the mmperf workflow (#11365) by mariecwhite · 2 years, 4 months ago
  77. 3bc3548 [spirv] Use a placeholder pointer type when creating resources (#11382) by Lei Zhang · 2 years, 4 months ago
  78. da5b7f7 Verify the cmake files are properly generated (#11378) by Jerry Wu · 2 years, 4 months ago
  79. f7a12af Use new environment files for GitHub actions step outputs (#11376) by Geoffrey Martin-Noble · 2 years, 4 months ago
  80. 129ae96 Enable fusion for elementwise Linalg op + pack op (#11374) by Han-Chung Wang · 2 years, 4 months ago
  81. 2f4225d [Flow] Add support for linalg.conv2d_nchw_fchw in img2col (#11369) by Quinn Dawkins · 2 years, 4 months ago
  82. 3236f51 Port the AArch64 tile size selection from the old `matmul-to-mmt4d` pass. (#11366) by bjacob · 2 years, 4 months ago
  83. 25f6bf2 Add winograd output op to LinalgExt (#11361) by harsh-nod · 2 years, 4 months ago
  84. 2046e25 Revert "Enable fusion for elementwise Linalg op + pack op" (#11372) by Han-Chung Wang · 2 years, 4 months ago
  85. 5260015 Enable fusion for elementwise Linalg op + pack op (#11284) by Han-Chung Wang · 2 years, 4 months ago
  86. 296d545 Enable GPU pipeline schedule with shared memory stores in stage 0 (#11125) by Quinn Dawkins · 2 years, 4 months ago
  87. 693f45a Remove scikit-learn requirement from native training example (#11368) by Daniel Ellis · 2 years, 4 months ago
  88. b576330 Integrate at llvm/llvm-project@bf15f1e4 and bump dependencies (#11341) by Thomas · 2 years, 4 months ago
  89. 91b3086 Move CUDA llvm optimization to the new pass manager (#11348) by Thomas · 2 years, 4 months ago
  90. 634ca1a Add a test that compiles softmax under aggressive fusion. (#11362) by MaheshRavishankar · 2 years, 4 months ago
  91. 16ab7a6 Encode the matmul type triple in `TensorEncoding` (#11355) by bjacob · 2 years, 4 months ago
  92. 304bda0 Turn off ccache in buildkite configurations build (#11363) by Geoffrey Martin-Noble · 2 years, 4 months ago
  93. 695985b Update GitHub runner to 2.299.1 (#11364) by Geoffrey Martin-Noble · 2 years, 4 months ago
  94. 46e47e3 Use ccache in CI (#11311) by Geoffrey Martin-Noble · 2 years, 4 months ago
  95. 30356ad Cleanup unused variables from a few build system files. (#11358) by Scott Todd · 2 years, 4 months ago
  96. e36ce11 Support e2e run configs in linux benchmark tool (#11074) by Jerry Wu · 2 years, 4 months ago
  97. 971cd4b Cleanup build scripting (#11329) by Geoffrey Martin-Noble · 2 years, 4 months ago
  98. 72e0d3c [WebGPU] Add compilation tests for xla_ops/ and tosa_ops/ (again). (#11327) by Scott Todd · 2 years, 4 months ago
  99. b79ca72 Fixing race in task wait poller timeout handling. (#11352) by Ben Vanik · 2 years, 4 months ago
  100. 5597106 Disable build_test_all_windows job while it is failing. (#11351) by Scott Todd · 2 years, 4 months ago