1. b716704 Update git-clang-format ref and clang-format version. (#16792) by Scott Todd · 11 months ago
  2. d18c5d8 [vulkan] Print VkResult as int32 value in errors (#17362) by Lei Zhang · 11 months ago
  3. d8d9b8e [hip] Use PRIhsz for iree_host_size_t values (#17360) by Lei Zhang · 11 months ago
  4. 0b8b13c Converting some runtime benchmarks to use our C API. (#17336) by Ben Vanik · 11 months ago
  5. 9406b9c [runtime] Fix buffer diagnostics compiler errors (#17325) by Benjamin Maxwell · 11 months ago
  6. 3ca0a49 Moving OutlineConstantsPass to flow and adding parameter support. (#17303) by Ben Vanik · 11 months ago
  7. 098b0c4 Add test_amd_mi250 CI job including ROCm matmul tests. (#17293) by erman-gurses · 11 months ago
  8. 71a9945 Adding `--iree-opt-import-parameters=` and cleaning up export. (#16828) by Ben Vanik · 11 months ago
  9. b8ef25c Fixing threading test reported by TSAN. (#17260) by Ben Vanik · 11 months ago
  10. e2bdf9c [runtime][vulkan][cts] Disable flaky test WaitForFiniteTime on Android (#17246) by Boian Petkantchin · 11 months ago
  11. e4b5a93 [runtime][hal][cts] Add test to wait on all semaphores on multiple places simultaneously (#17240) by Boian Petkantchin · 11 months ago
  12. e088c0b [python] Adds DLPack import and export support for BufferView. (#17131) by Stella Laurenzo · 11 months ago
  13. 5fa2480 [runtime][cts] add test where a batch is waiting on a smaller value than signaled (#17141) by Boian Petkantchin · 11 months ago
  14. 729ebc6 [runtime][metal] exclude properly the failing semaphore test (#17151) by Boian Petkantchin · 11 months ago
  15. 30acc53 [runtime][cts] add test where a device batch signals another and the host (#17138) by Boian Petkantchin · 11 months ago
  16. 290d812 [runtime][cts] add semaphore test where a batch waits on another and a host signal (#17130) by Boian Petkantchin · 11 months ago
  17. 568bb31 [runtime][cts] Add test waiting on a semaphore for finite time and fix Vulkan driver (#17126) by Boian Petkantchin · 11 months ago
  18. 655b71a Executable library call hooks system, and a sample Linux/CPU event implementation (#15803) by Benoit Jacob · 11 months ago
  19. 36b3ce1 [runtime][cts] Add test to wait multiple times for the same semaphore value (#17125) by Boian Petkantchin · 11 months ago
  20. 44ccc22 [runtime][hip][cuda] Fix bug where zombified actions may not get cleaned up (#17107) by Boian Petkantchin · 11 months ago
  21. f4bee0c [runtime] Make the runtime more TSan-friendly (#17051) by Boian Petkantchin · 11 months ago
  22. 0f6bc24 [runtime][hip][cuda] Fix mutex locking when waiting on a semaphore (#17118) by Boian Petkantchin · 11 months ago
  23. 78005ef [runtime][cts] add test where 2 batches wait on different semaphore values (#17091) by Boian Petkantchin · 11 months ago
  24. 074cbf3 [runtime] Refactor semaphore submission CTS (#17108) by Boian Petkantchin · 11 months ago
  25. 3f51a55 Replace openxla/iree with iree-org/iree across the project. (#17110) by Scott Todd · 11 months ago
  26. a2476ce [metal] Disable failing semaphore submission test until fixing (#17100) by Lei Zhang · 12 months ago
  27. 3677fbc [runtime] Add semaphore test where 2 batches wait on a former batch amongst 2 (#17080) by Boian Petkantchin · 12 months ago
  28. cd282de [runtime][hip][cuda] Fix waiting on a semaphore on the host (#17073) by Boian Petkantchin · 12 months ago
  29. 459fab6 [runtime][hip][cuda] Fix waiting on wait semaphores before executing actions (#17025) by Boian Petkantchin · 12 months ago
  30. 1c49d6a [runtime][hip][cuda] Add tracing in graph execution mode (#16894) by Boian Petkantchin · 12 months ago
  31. 40f2533 [HIP] Add inline execution mode (#16951) by Nithin Meganathan · 12 months ago
  32. fbd31b0 [Vulkan] Fix coop matrix property initialization (#17023) by Jakub Kuderski · 12 months ago
  33. 2780fd5 Fix typo in benchmark tracing warning message. by Ben Vanik · 12 months ago
  34. 5d2af54 [python] Convert python io tests to unit tests. (#16984) by Stella Laurenzo · 12 months ago
  35. 5ad0fe2 [python] Add missing public alias of symbol. (#16980) by Stella Laurenzo · 12 months ago
  36. 27670b6 Bump nanobind version in more requirement files. (#16976) by Scott Todd · 12 months ago
  37. 11d2259 Fix arm and windows builder issues at head. by Stella Laurenzo · 12 months ago
  38. 4f40080 [python] Flesh out more of the python parameters API. (#16957) by Stella Laurenzo · 12 months ago
  39. be9f097 [hip] Introduce options to control load of libamdhip64.so. (#16766) by Stella Laurenzo · 12 months ago
  40. c46daa6 Fix for iree-run-module with --input=not-a-path.bin (#16919) by James Newling · 1 year ago
  41. daee1dd Ukernels: replace boilerplate by tables. (#16879) by Benoit Jacob · 1 year ago
  42. 2cdf145 fix build errors when tracing mode = 1 (#16884) by Okwan Kwon · 1 year ago
  43. 5f2743b [hip] Move into hal/drivers and build by default (#16706) by Lei Zhang · 1 year ago
  44. 767a611 [tracing] fix build errors when tracing mode = 1 (#16853) by Okwan Kwon · 1 year ago
  45. e7eef08 Adds a nullptr check around optional implementation in dynamic modules. (#16845) by Stella Laurenzo · 1 year ago
  46. 77cc34b Bump numpy dep up to the 2.0 prerelease. (#16800) by Scott Todd · 1 year ago
  47. 2eae9b3 [CPU] Remove 8x8x16 i8mm microkernel by mariecwhite · 1 year ago
  48. 92fe572 [CPU] Add s8s4s32 i8mm ukernel (#16678) by mariecwhite · 1 year ago
  49. b61a918 Enable LTO optimization by default for runtime releases. (#16811) by Stella Laurenzo · 1 year ago
  50. 2395046 [rocm] Excises (almost) dependence on /opt/rocm from the compiler. (#16803) by Stella Laurenzo · 1 year ago
  51. d05b4a1 Splitting stack trace support out of status.c. (#16791) by Ben Vanik · 1 year ago
  52. 15d9039 Embedding executable source contents in binaries for tracing. (#16757) by Ben Vanik · 1 year, 1 month ago
  53. e12ab47 Replace `unaryOperator` by EmitC LogicalNotOp (#16730) by Marius Brehler · 1 year, 1 month ago
  54. e9f7ecd [cuda][hip] Drop name_literal reference like Vulkan side (#16742) by Lei Zhang · 1 year, 1 month ago
  55. 27b837c Replace `binaryOperator` by EmitC ops (#16728) by Marius Brehler · 1 year, 1 month ago
  56. 7da8af6 Cleanup a few docs/references to old paths in HAL/Target. (#16716) by Scott Todd · 1 year, 1 month ago
  57. b5af996 Prefer broadcasting RHS over LHS in AVX-512 multiply-accumulate instructions (#16709) by Benoit Jacob · 1 year, 1 month ago
  58. f34e534 Replace k with m by mariecwhite · 1 year, 1 month ago
  59. 4d3c93f Add missing macros to dotprod ukernel by mariecwhite · 1 year, 1 month ago
  60. 8adae37 [cuda][hip] Add support for semaphore multi wait (#16638) by Lei Zhang · 1 year, 1 month ago
  61. 9d6d99f faster narrow mmt4d ukernels on x86 (#16655) by Benoit Jacob · 1 year, 1 month ago
  62. 4f1f055 mmt4d ukernel: use fewer magic macros to generate tile-functions M0-variants (#16645) by Benoit Jacob · 1 year, 1 month ago
  63. b994b72 Reenable accidentally disabled architecture-specific parts of `mmt4d_test` (#16654) by Benoit Jacob · 1 year, 1 month ago
  64. f433fd2 Using iree.abi.name consistently for arg/result names. (#16635) by Ben Vanik · 1 year, 1 month ago
  65. fe5e69a [cuda][hip] Shorten deferred queue worker name (#16642) by Lei Zhang · 1 year, 1 month ago
  66. 9dfc612 [cuda][hip] Fix worker thread and device host callback synchronization (#16621) by Boian Petkantchin · 1 year, 1 month ago
  67. f66d7f2 Fix enablement of mmt4d ukernel test cases based on ISA code paths built (#16637) by Benoit Jacob · 1 year, 1 month ago
  68. 5180ede mmt4d ukernel: simplification in generic tile funcs: stop using a stack array (#16633) by Benoit Jacob · 1 year, 1 month ago
  69. 8959b90 Make ukernels fallback opt-in and add a `mmt4d_info` ukernel to query the mmt4d implementation. (#16631) by Benoit Jacob · 1 year, 1 month ago
  70. 6ff9a3d Refactor how llvm-cpu check tests interface with ASan/TSan. (#16452) by Scott Todd · 1 year, 1 month ago
  71. e6397cb Change ukernels calling convention to default (#16541) by Benoit Jacob · 1 year, 1 month ago
  72. e991798 Unroll fixed-trip-count loops within mmt4d ukernel tile functions. (#16626) by Benoit Jacob · 1 year, 1 month ago
  73. 88b1d4d Replace std::iterator with our custom iterator typedefs (#16423) (#16583) by Peyman Barazandeh · 1 year, 1 month ago
  74. 9dc8ae4 [cuda][hip] Fix launch host func and worker thread state update (#16568) by Lei Zhang · 1 year, 1 month ago
  75. 862a031 Adding --task_abort_on_failure flag/API. (#16565) by Ben Vanik · 1 year, 1 month ago
  76. 23f2828 Adding iree-benchmark-executable tool. (#16550) by Ben Vanik · 1 year, 1 month ago
  77. c15b610 [EmitC] Remove the forked emitter and generate all the code in the conversion pass (#16357) by Simon Camphausen · 1 year, 1 month ago
  78. d500494 Add s8s4s32 dotprod microkernel (#16473) by mariecwhite · 1 year, 1 month ago
  79. c3b3d96 Adding hal.device.id queries to HAL devices. (#16495) by Ben Vanik · 1 year, 1 month ago
  80. 6d293af Retrying try-lock in synchronization_test to avoid arm64 flakes. (#16436) by Ben Vanik · 1 year, 1 month ago
  81. 4463f8d [python] Enable building of 3.12 wheels on Linux. (#16424) by Stella Laurenzo · 1 year, 1 month ago
  82. 1f3e907 ukernels: update README.md (#16358) by Benoit Jacob · 1 year, 1 month ago
  83. d1e1d05 [python] Add a couple more async APIs. (#16419) by Stella Laurenzo · 1 year, 1 month ago
  84. 00aa173 [hip] Add missing source locations and fix parsing (#16418) by Lei Zhang · 1 year, 1 month ago
  85. d32609e Add s8s4s32 ukernel for ARM (#16259) by mariecwhite · 1 year, 1 month ago
  86. c02b89e [cuda][hip] Guard against NULL cleanup callbacks (#16403) by Lei Zhang · 1 year, 2 months ago
  87. 7c2ec73 Fix a bug in the fastpath of iree_hal_task_semaphore_multi_wait which was doing a spurious wait. (#16404) by Stella Laurenzo · 1 year, 2 months ago
  88. 60ac333 [python] Add a HalDeviceLoop class for routing runtime events to futures. (#16385) by Stella Laurenzo · 1 year, 2 months ago
  89. c70bf22 [HAL] Remove pool assert during allocator creation (#16388) by Nithin Meganathan · 1 year, 2 months ago
  90. 14927d1 Replacing the ancient vm_util with function_io/function_util. (#16351) by Ben Vanik · 1 year, 2 months ago
  91. 9aabcb3 Add conversions for FP8 types (F8E5M2 and F8E4M3) (#16374) by Benoit Jacob · 1 year, 2 months ago
  92. 30901f5 Replacing the ancient vm_util with function_io/function_util. by Ben Vanik · 1 year, 2 months ago
  93. 49f8a61 Adding iree_io_vec_stream_t. by Ben Vanik · 1 year, 2 months ago
  94. 29a7462 Adding iree_io_stdio_stream_t. by Ben Vanik · 1 year, 2 months ago
  95. 0a2483a Splitting iree_io_memory_stream_t from iree/io/stream.h. by Ben Vanik · 1 year, 2 months ago
  96. 9234f42 Add a number of runtime python bindings and refine the HalFence.wait() behavior. (#16371) by Stella Laurenzo · 1 year, 2 months ago
  97. 87bf971 Fixing implicit casting that caused 4GB fill/copy limits in local-task. (#16364) by Ben Vanik · 1 year, 2 months ago
  98. 10fd98b Fixes to enable clang-cl compilation of compiler/runtime. (#16299) by Ben Vanik · 1 year, 2 months ago
  99. 065e04a Adding support for outputting binary files from tooling. (#16291) by Ben Vanik · 1 year, 2 months ago
  100. 406626b [Vulkan][SPIRV] Introduce `address` vulkan device property (#16282) by Jakub Kuderski · 1 year, 2 months ago