1. fce839f Adding IREE parameter archive format and tooling support. (#15670) by Ben Vanik · 1 year, 4 months ago
  2. 2bb8019 Adding iree_io_stream_t and memory stream implementation. (#15668) by Ben Vanik · 1 year, 4 months ago
  3. dc6f0cd Adding multiple_modules sample (and fixing bugs). (#15653) by Ben Vanik · 1 year, 5 months ago
  4. f83ca74 Add newline in parameter help print (#15647) by Quinn Dawkins · 1 year, 5 months ago
  5. 5b2cb64 Fix intermittent failure - functions with `_try_` in their name may fail spuriously. (#15636) by bjacob · 1 year, 5 months ago
  6. 916dae9 Removing io_parameters.read/write in favor of gather/scatter. (#15607) by Ben Vanik · 1 year, 5 months ago
  7. 1012586 Fix MSVC build by disabling AVX-512-BF16 in non-latest MSVC versions. (#15589) by bjacob · 1 year, 5 months ago
  8. 0908ff8 ukernels: add `bf16 * bf16 -> bf16` optimized tile functions for x86 and arm64. (#15543) by bjacob · 1 year, 5 months ago
  9. 16e4346 ukernel test improvements (#15542) by bjacob · 1 year, 5 months ago
  10. 9393f94 Avoid stack allocation for VM->HAL iree_hal_fence_join calls. (#15569) by Ben Vanik · 1 year, 5 months ago
  11. dc506b8 Fix redundant IREE_UK_STATIC_ASSERT macro definition (#15567) by bjacob · 1 year, 5 months ago
  12. 4546b95 Simplify ukernel headers now that C++ is out of the picture (#15564) by bjacob · 1 year, 5 months ago
  13. 199ecee Simplify ukernel headers now that out-of-line asm is out of the picture (#15563) by bjacob · 1 year, 5 months ago
  14. 41aeb02 Add dep for ukernel internal headers (#15562) by Thomas Preud'homme · 1 year, 5 months ago
  15. 2eb6862 Unify compiler and runtime HAL driver plugin systems. (#15551) by Stella Laurenzo · 1 year, 5 months ago
  16. 01cef21 Revert "[CMake] Add dep for ukernel internal headers (#15462)" (#15552) by MaheshRavishankar · 1 year, 5 months ago
  17. 1e41cbf Fix data race (TSan report) on worker thread startup on macOS in `iree_thread_request_affinity` (#15534) by Ben Vanik · 1 year, 5 months ago
  18. 9fc8f82 Fix assertion in VNNI ukernel (#15529) by bjacob · 1 year, 5 months ago
  19. f458a88 Optimized vecmat ukernel tile functions for `i16 x u4 -> i32` on AVX-512-VNNI (#15525) by bjacob · 1 year, 5 months ago
  20. 8633629 Fix data race (TSan report) on worker thread startup on macOS in `iree_thread_request_affinity` (#15499) by bjacob · 1 year, 5 months ago
  21. f66f28f Roll-up of minimal fixes needed to get external-parameter llama models working (#15523) by Stella Laurenzo · 1 year, 5 months ago
  22. 7be3992 [Bindings][Python] Allow copies into a larger target buffer than the source (#15511) by Quinn Dawkins · 1 year, 5 months ago
  23. 094af3d Add note about registering vm list types. (#15279) by Scott Todd · 1 year, 5 months ago
  24. 74506f5 Supporting quantized block types in GGUF (kinda). (#15510) by Ben Vanik · 1 year, 5 months ago
  25. 1582054 Setting worker local memory size based on L2 cache size by default. (#15444) by Ben Vanik · 1 year, 5 months ago
  26. d991dc9 Drop unused functions, but do not drop `-Wno-unused-function` (#15502) by bjacob · 1 year, 5 months ago
  27. 9b4606b Fix iree:span construction error using C++20 (#15489) by Julian Walker · 1 year, 5 months ago
  28. a2733b0 Add iree-dump-parameters python console script entrypoint. (#15490) by Stella Laurenzo · 1 year, 5 months ago
  29. d115d41 Revert "Drop `-Wno-unused-function` on Clang, find out about actually unused functions, silence false positives." (#15483) by bjacob · 1 year, 5 months ago
  30. a4b8ccb Update github branch names etc (NFC) (#15482) by Jacques Pienaar · 1 year, 5 months ago
  31. 3a3c1a4 Fix `fp16` feature on arm64: the proper feature name is `fullfp16`, not `fp16`. (#15479) by bjacob · 1 year, 5 months ago
  32. 0ca4f62 Drop `-Wno-unused-function` on Clang, find out about actually unused functions, silence false positives. (#15471) by bjacob · 1 year, 5 months ago
  33. 65fe91f Python implementation of the low level io parameters API. (#15457) by Stella Laurenzo · 1 year, 5 months ago
  34. 5b0fca4 Handle gguf zero-width types. (#15473) by Stella Laurenzo · 1 year, 5 months ago
  35. b6d7b83 [CMake] Add dep for ukernel internal headers (#15462) by Thomas Preud'homme · 1 year, 5 months ago
  36. e14dff4 Add generated iree.runtime pyi file to enable IDE auto-complete. (#15454) by Stella Laurenzo · 1 year, 5 months ago
  37. d04da61 Always pass all local workgroup memory to each dispatch. (#15439) by Ben Vanik · 1 year, 5 months ago
  38. 618c835 [cuda] Port over CUDA stream-based command buffer impl by Lei Zhang · 1 year, 5 months ago
  39. 59122fd [metal] Fix unused variable when building for iOS (#15430) by Lei Zhang · 1 year, 5 months ago
  40. a306a28 Adding support for splat entries in parameter indices. (#15420) by Ben Vanik · 1 year, 5 months ago
  41. 11ced0c Adding parameters as a concept to stream/hal/tooling. (#15104) by Ben Vanik · 1 year, 5 months ago
  42. 988f7c5 optimized s16s16s32 mmt4d tile functions on x86 (#15365) by bjacob · 1 year, 5 months ago
  43. 573f5e9 Merge docs/developers into docs/website/. (#15396) by Scott Todd · 1 year, 5 months ago
  44. 8d7dc80 Stop trying to use builtin `_Float16` (#15388) by bjacob · 1 year, 5 months ago
  45. d1d63c3 Add riscv vector extension in cpu feature using hwcap (#15306) by Yun Hsiang · 1 year, 5 months ago
  46. 4a20b91 Drop vulkan-spirv test cases from modules/check/test/*. (#15356) by Scott Todd · 1 year, 5 months ago
  47. 3d1d8c8 ukernels: sub-8-bit support, types `s16 * u4 -> s32` and `s16 * s16 -> s32` (#15343) by bjacob · 1 year, 5 months ago
  48. c0525ad Update the usage of the transform dialect interpreter (#15340) by Nicolas Vasilache · 1 year, 5 months ago
  49. 41e7289 ukernels: stop abusing signless as signed (#15338) by bjacob · 1 year, 5 months ago
  50. 44c9346 Patch some Windows C code that breaks under gcc/clang. (#15326) by Scott Todd · 1 year, 5 months ago
  51. 97a1394 Adding `vm.br_table` op. (#15286) by Ben Vanik · 1 year, 5 months ago
  52. 1567c80 [vm] Fix vm ref trace function definition (#15247) by Lei Zhang · 1 year, 5 months ago
  53. 205c88d [metal] Retain device until command buffer completion (#15288) by Lei Zhang · 1 year, 5 months ago
  54. 094d9ba [Python] Add Fence VMRef Binding to enable async-exec on py. (#15263) by Stanley Winata · 1 year, 5 months ago
  55. e79fb2b [metal] Fix staging buffer alignment calculation (#15272) by Lei Zhang · 1 year, 5 months ago
  56. 0bdf338 [metal] Fix imported vm buffer ref counting in allocator (#15271) by Lei Zhang · 1 year, 5 months ago
  57. aa5602d Improvements to e2e matmul tests (take 2) (#15259) by bjacob · 1 year, 5 months ago
  58. eb9b8b6 Revert "Improvements to e2e matmul tests" (#15252) by bjacob · 1 year, 6 months ago
  59. 71c22da Improvements to e2e matmul tests (#15243) by bjacob · 1 year, 6 months ago
  60. 5a20dce bf16: select appropriate tile sizes on x86 and Arm, and enable in x86 bitcode build (#15244) by bjacob · 1 year, 6 months ago
  61. 63381a8 Switching external resources to be device-local only. (#14016) by Ben Vanik · 1 year, 6 months ago
  62. b13037b Fix build after #15151 (#15236) by Andrzej Warzyński · 1 year, 6 months ago
  63. add9417 [Bindings] Implement alloc + copy to local host when map is unavailable. (#14997) by Stanley Winata · 1 year, 6 months ago
  64. 82611a9 Making execution region results queue-ordered allocas. (#15149) by Ben Vanik · 1 year, 6 months ago
  65. 6b5b989 Remove unnecessary pragma (#15224) by bjacob · 1 year, 6 months ago
  66. f84545e Implementing stack trace capture on iree_status_t for Win/Mac. (#15151) by Ben Vanik · 1 year, 6 months ago
  67. 1fa8b48 Replace more uses of `cc_library` with `iree_runtime_cc_library`. (#15204) by Scott Todd · 1 year, 6 months ago
  68. 3eaface Add an API to disable nanobind's leak checker. (#15189) by Stella Laurenzo · 1 year, 6 months ago
  69. 2a42fd3 `avx512bf16` ukernel: work around yet another Clang-16 crash (#15164) by bjacob · 1 year, 6 months ago
  70. ef25b0c [metal] Fix creating device via registry path (#15142) by Lei Zhang · 1 year, 6 months ago
  71. 4983668 `mmt4d` ukernel for the `bf16*bf16->f32` case using AVX-512-BF16 (#15089) by bjacob · 1 year, 6 months ago
  72. 77a8741 Fixing TRACY_NO_EXIT on MacOS and supporting MacOS tracy builds. (#15143) by Ben Vanik · 1 year, 6 months ago
  73. 94e7e23 Revert "Fixing TRACY_NO_EXIT on MacOS. (#15139)" (#15140) by mariecwhite · 1 year, 6 months ago
  74. 198af34 Fixing TRACY_NO_EXIT on MacOS. (#15139) by Ben Vanik · 1 year, 6 months ago
  75. 3add457 Adding iree_io_file_handle_t placeholder. (#15101) by Ben Vanik · 1 year, 6 months ago
  76. ad64ecc [experimental][ROCM] Add shared memory support on ROCM RT and Target. (#15097) by Stanley Winata · 1 year, 6 months ago
  77. b3e5a43 Add python binding for device profile flushing (#15095) by Quinn Dawkins · 1 year, 6 months ago
  78. 1b5eec9 [vulkan] Enable coop matrix features when available (#15056) by Jakub Kuderski · 1 year, 6 months ago
  79. 8ac3a1a Optimized narrow mmt4d tile functions (for small M0) (#14995) by bjacob · 1 year, 7 months ago
  80. cc7d6e9 Fixing cpuinfo Mac build break from #14969. by Ben Vanik · 1 year, 7 months ago
  81. fb9e1b6 Adding `--iree-vulkan-experimental-indirect-bindings=true` flag. (#14977) by Ben Vanik · 1 year, 7 months ago
  82. 0f4dd73 Adding `--task_topology_cpu_ids=` flag. (#14969) by Ben Vanik · 1 year, 7 months ago
  83. d24d024 Changing IREE_CPU_DATA* enums to free floating constants. (#14991) by Ben Vanik · 1 year, 7 months ago
  84. bc45174 Fix benchmark.py handling of multiple entry functions. (#14884) by Scott Todd · 1 year, 7 months ago
  85. af14b98 Tentatively work around MSVC compiler bug in CPUID (#14968) by bjacob · 1 year, 7 months ago
  86. 7313867 Handling useless initial task barriers in command buffers. (#14950) by Ben Vanik · 1 year, 7 months ago
  87. ec74fe0 Workaround for Windows COM headers #defining `interface`. (#14964) by Ben Vanik · 1 year, 7 months ago
  88. e3578d5 Removing VMA. (#14960) by Ben Vanik · 1 year, 7 months ago
  89. 801356f Un-unroll ukernel C+intrinsics code. (#14908) by bjacob · 1 year, 7 months ago
  90. 7a6ca42 [vm] Add function name to function signature error message (#14745) by Eugene Zhulenev · 1 year, 7 months ago
  91. 1150e6a Workaround Vulkan driver bug triggered by native allocator code. (#14790) by Scott Todd · 1 year, 7 months ago
  92. 9202104 Remove redundant parameters from ukernel tile funcs (#14895) by bjacob · 1 year, 7 months ago
  93. 9ed3dab Update documentations and setups to Python 3.9 (#14860) by Han-Chung Wang · 1 year, 7 months ago
  94. bd79a14 Adding support for parsing/printing bfloat16 values in tools. (#14869) by Ben Vanik · 1 year, 7 months ago
  95. 7356108 Enable runtime/bindings/python/test/benchmark_test. (#14850) by Scott Todd · 1 year, 7 months ago
  96. b890e22 Add descriptions for root compiler and runtime Python modules. (#14856) by Scott Todd · 1 year, 7 months ago
  97. 409876d ukernels: remove the lone inline-asm ukernel code path (#14853) by bjacob · 1 year, 7 months ago
  98. aabb73a [vulkan] Request 8-/16-bit integer/floating-point features (#14848) by Lei Zhang · 1 year, 7 months ago
  99. 2f9a42c Ukernels: Fix inline assembly constraint for clang-9 (#14786) by Fredrik Knutsson · 1 year, 7 months ago
  100. 82be925 Adding iree_hal_device_profiling_flush. (#14829) by Ben Vanik · 1 year, 7 months ago