1. c91759d Retire unused RISCV compile flag (#15484) by CindyLiu · 1 year, 5 months ago
  2. ed32801 Resolve `host` CPU to `generic` outside of x86 (#15481) by bjacob · 1 year, 5 months ago
  3. 3e2ae27 [SPIRV] Fix vector masking for transfer scalarization patterns (#15480) by Quinn Dawkins · 1 year, 5 months ago
  4. a2733b0 Add iree-dump-parameters python console script entrypoint. (#15490) by Stella Laurenzo · 1 year, 5 months ago
  5. d4f5368 LLVM integrate integrate-llvm-20231107 (#15470) by Stella Laurenzo · 1 year, 5 months ago
  6. d5ab0f0 Revert "[DataTiling] Add supports for materializing elementwise ops. (#15446)" by Stella Laurenzo · 1 year, 5 months ago
  7. bd60372 [DataTiling] Add supports for materializing elementwise ops. (#15446) by Max191 · 1 year, 5 months ago
  8. d115d41 Revert "Drop `-Wno-unused-function` on Clang, find out about actually unused functions, silence false positives." (#15483) by bjacob · 1 year, 5 months ago
  9. a4b8ccb Update github branch names etc (NFC) (#15482) by Jacques Pienaar · 1 year, 5 months ago
  10. 3a3c1a4 Fix `fp16` feature on arm64: the proper feature name is `fullfp16`, not `fp16`. (#15479) by bjacob · 1 year, 5 months ago
  11. 87ed5fc [GlobalOptimization] Lift `linalg.generic` ops to `linalg.batch_matmul/linalg.batch_vecmat/linalg.batch_matvec` (#15339) by Max191 · 1 year, 5 months ago
  12. 6db32c6 [GlobalOptimization] Support SetEncoding on batch matmul cases with p… (#15371) by Max191 · 1 year, 5 months ago
  13. 0ca4f62 Drop `-Wno-unused-function` on Clang, find out about actually unused functions, silence false positives. (#15471) by bjacob · 1 year, 5 months ago
  14. e604f9d Data-tiling: never use a generic fallback tile size (#15469) by bjacob · 1 year, 5 months ago
  15. d557095 Benchmark on c2-standard-60 in background (#15467) by Jerry Wu · 1 year, 5 months ago
  16. 88f41e5 [Reducer] Add initial check for interestingness (#15427) by Kunwar Grover · 1 year, 5 months ago
  17. d97270d Disable Metal tests when IREE_HAL_DRIVER_METAL is OFF. (#15476) by MaheshRavishankar · 1 year, 5 months ago
  18. 7ab3509 Record narrow static M/N sizes in `EncodingAttr` and rationalize MaterializeEncoding for narrow shapes. (#15431) by bjacob · 1 year, 5 months ago
  19. d509096 Treat `iree-llvmcpu-target-cpu` as an error outside of x86, and continue past errors (#15477) by bjacob · 1 year, 5 months ago
  20. 65fe91f Python implementation of the low level io parameters API. (#15457) by Stella Laurenzo · 1 year, 5 months ago
  21. 5b0fca4 Handle gguf zero-width types. (#15473) by Stella Laurenzo · 1 year, 5 months ago
  22. 2fda954 Add `chlo` dialect to auto input conversion pipeline (#15474) by Rob Suderman · 1 year, 5 months ago
  23. 82dc975 Start LLVM integrate integrate-llvm-20231103 (#15403) by Quinn Dawkins · 1 year, 5 months ago
  24. a0f3628 [CPU][SVE] Fix peeling for scalable-tiled loops (#15460) by Diego Caballero · 1 year, 5 months ago
  25. f4f3451 [GlobalOpt] Do not set encoding if they have preset compilation info. (#15455) by Han-Chung Wang · 1 year, 5 months ago
  26. d8a1643 Re-enable Pixel 6 tests and benchmarks (#15459) by Jerry Wu · 1 year, 5 months ago
  27. b6d7b83 [CMake] Add dep for ukernel internal headers (#15462) by Thomas Preud'homme · 1 year, 5 months ago
  28. 0f42d3f [GlobalOpt] Unset encodings for non-CPU backends. (#15453) by Han-Chung Wang · 1 year, 5 months ago
  29. e3f2ab3 [CPU] Improve tile sizes selection for tensor.pack ops. (#15397) by Han-Chung Wang · 1 year, 5 months ago
  30. 41a23ad [CPU] Materialize encodings into NOP if the target is not supported yet. (#15450) by Han-Chung Wang · 1 year, 5 months ago
  31. 327af04 Support output verification in Android benchmark tool (#15344) by Jerry Wu · 1 year, 5 months ago
  32. 38bfdba [GlobalOptimization] Add pattern to reassociate dequantization + matmul `linalg.gen… (#15278) by Max191 · 1 year, 5 months ago
  33. 1f61c88 Support fetching and streaming artifacts in benchmark tools (#15432) by Jerry Wu · 1 year, 5 months ago
  34. e14dff4 Add generated iree.runtime pyi file to enable IDE auto-complete. (#15454) by Stella Laurenzo · 1 year, 5 months ago
  35. a4b1a78 Remove redundant reshape checks in dot general preprocessing (#15319) by Rob Suderman · 1 year, 5 months ago
  36. d99527b Make default distribution logic divide work evenly. (#15414) by Han-Chung Wang · 1 year, 5 months ago
  37. af5d47d Fixing io_parameters.load VM lowering bug. (#15445) by Ben Vanik · 1 year, 5 months ago
  38. 4a77618 Generalization for ElementsAttr coverage (#15433) by saienduri · 1 year, 5 months ago
  39. f7de6ed Fix dashboard link on iree.perf.dev (#15443) by Jerry Wu · 1 year, 5 months ago
  40. d04da61 Always pass all local workgroup memory to each dispatch. (#15439) by Ben Vanik · 1 year, 5 months ago
  41. 4c71903 Fix `complex.bitcast` inputs to not generate bit manipulations (#15345) by Rob Suderman · 1 year, 5 months ago
  42. e94f3cb Add icons and tags to most website/docs/developers/ pages. (#15413) by Scott Todd · 1 year, 5 months ago
  43. 3e00192 Mention CPU .o files on tips page. (#15436) by Scott Todd · 1 year, 5 months ago
  44. f2e3260 Create oneshot stream command buffer in pending_queue_actions by Lei Zhang · 1 year, 5 months ago
  45. 618c835 [cuda] Port over CUDA stream-based command buffer impl by Lei Zhang · 1 year, 5 months ago
  46. e3671a5 NFC: Rename cuda to cuda2 by Lei Zhang · 1 year, 5 months ago
  47. a02ff0e NFC: Copy over existing stream command buffer impl by Lei Zhang · 1 year, 5 months ago
  48. ddb0d7d Let `EncodingAttr` use `struct(params)` assembly format (#15434) by bjacob · 1 year, 5 months ago
  49. 59122fd [metal] Fix unused variable when building for iOS (#15430) by Lei Zhang · 1 year, 5 months ago
  50. 717c7a0 Temporarily disable Pixel 6 tests and benchmarks (#15426) by Jerry Wu · 1 year, 5 months ago
  51. a306a28 Adding support for splat entries in parameter indices. (#15420) by Ben Vanik · 1 year, 5 months ago
  52. d46a708 Dump CPU .o files with `--iree-hal-dump-executable-intermediates-to=`. (#15412) by Ben Vanik · 1 year, 5 months ago
  53. eb3ec9c [spirv] Fix bufferization allocation in subgroup reduction pipeline (#15425) by Lei Zhang · 1 year, 5 months ago
  54. b28d1f0 [gpu] NFC: Tidy up the VectorReduceToGPU pass (#15424) by Lei Zhang · 1 year, 5 months ago
  55. 8ca883f Ignore Clangd configuration files for now (#15419) by Lei Zhang · 1 year, 5 months ago
  56. 4e1210c Make the schedule release GH job take an explicit override commit. (#15418) by Stella Laurenzo · 1 year, 5 months ago
  57. 640df51 Fix some broken links to benchmark_suites.md. (#15410) by Scott Todd · 1 year, 5 months ago
  58. 089e63b Add IO/Parameters dialect doc to website index page. (#15415) by Scott Todd · 1 year, 5 months ago
  59. cbe5799 Try to make enforce_glob less dumb and fix breakage at head. (#15409) by Stella Laurenzo · 1 year, 5 months ago
  60. 8ef62de Try to make enforce_glob less dumb and fix breakage at head. (#15409) by Stella Laurenzo · 1 year, 5 months ago
  61. 11ced0c Adding parameters as a concept to stream/hal/tooling. (#15104) by Ben Vanik · 1 year, 5 months ago
  62. 988f7c5 optimized s16s16s32 mmt4d tile functions on x86 (#15365) by bjacob · 1 year, 5 months ago
  63. a2a2e8d Build arm64 docker with ccache 4.7.4 (#15307) by Fredrik Knutsson · 1 year, 5 months ago
  64. 48fd11b Disable Pixel 4 e2e regression benchmarks (#15400) by Jerry Wu · 1 year, 5 months ago
  65. 904dfab Add Arm64 variants of build_all and test_all CI jobs (#15237) by Fredrik Knutsson · 1 year, 5 months ago
  66. 7a99948 CPU features flags improvements (#15387) by bjacob · 1 year, 5 months ago
  67. 573f5e9 Merge docs/developers into docs/website/. (#15396) by Scott Todd · 1 year, 5 months ago
  68. d354465 [NFC] Move generic tile sizes selection tests to vmvx_materialize_encoding.mlir (#15394) by Han-Chung Wang · 1 year, 5 months ago
  69. d32d8ce [CPU] Minor clean-up and fixes for mmt4d code generation (#15380) by Diego Caballero · 1 year, 5 months ago
  70. 8e68e98 Add full int8 ViT benchmark by mariecwhite · 1 year, 5 months ago
  71. 6bbdb72 [cuda] Mark event related APIs as unimplemented (#15382) by Lei Zhang · 1 year, 5 months ago
  72. 8d7dc80 Stop trying to use builtin `_Float16` (#15388) by bjacob · 1 year, 5 months ago
  73. 6b93f11 bump torch-mlir (#15389) by Daniel Garvey · 1 year, 5 months ago
  74. 668c020 Cast tensor.empty type to TypeConverter's type during materialization. (#15375) by Han-Chung Wang · 1 year, 5 months ago
  75. 77a8c55 [NFC] Move CPU materialize_encoding tests to Common/CPU/test (#15376) by Han-Chung Wang · 1 year, 5 months ago
  76. d1d63c3 Add riscv vector extension in cpu feature using hwcap (#15306) by Yun Hsiang · 1 year, 5 months ago
  77. 85f4006 [shlo] Misc fixes exposed by jax test. (#15379) by Jacques Pienaar · 1 year, 5 months ago
  78. fd9cd2f Fix some minspec/optional feature bitrot. (#15378) by Stella Laurenzo · 1 year, 5 months ago
  79. 332ac35 Drop AMDGPU in-tree build of device libraries. (#15374) by Stella Laurenzo · 1 year, 5 months ago
  80. fcdddcb Bump ARM64 runner image (#15366) by Jerry Wu · 1 year, 5 months ago
  81. 7c58c58 Use `c2-standard-16` VM to run x86_64 e2e benchmark tests (#15361) by Jerry Wu · 1 year, 5 months ago
  82. 4a20b91 Drop vulkan-spirv test cases from modules/check/test/*. (#15356) by Scott Todd · 1 year, 5 months ago
  83. 3d1d8c8 ukernels: sub-8-bit support, types `s16 * u4 -> s32` and `s16 * s16 -> s32` (#15343) by bjacob · 1 year, 5 months ago
  84. c0525ad Update the usage of the transform dialect interpreter (#15340) by Nicolas Vasilache · 1 year, 5 months ago
  85. 3112576 Adding gcloud CLI support to arm64 runner (#15308) by Fredrik Knutsson · 1 year, 5 months ago
  86. 4d06d20 [pjrt] Towards more mechanical stub generation. (#15363) by Jacques Pienaar · 1 year, 5 months ago
  87. 18b3dd7 LLVM integrate integrate-llvm-20231030 (#15351) by Stella Laurenzo · 1 year, 5 months ago
  88. bd72855 use getTypeBitWidth() to get the element type's bit width (#15360) by Okwan Kwon · 1 year, 5 months ago
  89. 03d655a Fix size calculation in the tensor.empty materialization pattern. (#15359) by Han-Chung Wang · 1 year, 5 months ago
  90. 9f7d6d4 Add falcon benchmarks by mariecwhite · 1 year, 5 months ago
  91. af171c5 Exclude executable files in root .gitignore. (#15266) by Scott Todd · 1 year, 5 months ago
  92. 256fe4f Add "torch" as an `InputType` in `iree/compiler/tools/core.py`. (#15358) by Scott Todd · 1 year, 5 months ago
  93. 5223596 [cuda] Support building node DAG in graph command buffer (#14857) by Eugene Zhulenev · 1 year, 5 months ago
  94. 05928c5 Enable the 'clang' project when building the ROCm target. (#15346) by Scott Todd · 1 year, 5 months ago
  95. 5c9556c Add a pass to materialize encodings into nop. (#15312) by Han-Chung Wang · 1 year, 5 months ago
  96. 092a74d [pjrt] Add primitive jax2tf tests (#15341) by Jacques Pienaar · 1 year, 5 months ago
  97. 2706526 [ROCM] add device path and use it to setup device (#15234) by nirvedhmeshram · 1 year, 5 months ago
  98. 7b92a6d [cuda] Avoid sorting when composing kernel arguments (#15325) by Lei Zhang · 1 year, 5 months ago
  99. 546e372 Add fallback for undo-ing encodings. (#15302) by Han-Chung Wang · 1 year, 5 months ago
  100. ada35b3 Disable folding casting ops into contraction ops by default. (#15342) by Han-Chung Wang · 1 year, 5 months ago