- b716704 Update git-clang-format ref and clang-format version. (#16792) by Scott Todd · 11 months ago
- d18c5d8 [vulkan] Print VkResult as int32 value in errors (#17362) by Lei Zhang · 11 months ago
- d8d9b8e [hip] Use PRIhsz for iree_host_size_t values (#17360) by Lei Zhang · 11 months ago
- 0b8b13c Converting some runtime benchmarks to use our C API. (#17336) by Ben Vanik · 11 months ago
- 9406b9c [runtime] Fix buffer diagnostics compiler errors (#17325) by Benjamin Maxwell · 11 months ago
- 3ca0a49 Moving OutlineConstantsPass to flow and adding parameter support. (#17303) by Ben Vanik · 11 months ago
- 098b0c4 Add test_amd_mi250 CI job including ROCm matmul tests. (#17293) by erman-gurses · 11 months ago
- 71a9945 Adding `--iree-opt-import-parameters=` and cleaning up export. (#16828) by Ben Vanik · 11 months ago
- b8ef25c Fixing threading test reported by TSAN. (#17260) by Ben Vanik · 11 months ago
- e2bdf9c [runtime][vulkan][cts] Disable flaky test WaitForFiniteTime on Android (#17246) by Boian Petkantchin · 11 months ago
- e4b5a93 [runtime][hal][cts] Add test to wait on all semaphores on multiple places simultaneously (#17240) by Boian Petkantchin · 11 months ago
- e088c0b [python] Adds DLPack import and export support for BufferView. (#17131) by Stella Laurenzo · 11 months ago
- 5fa2480 [runtime][cts] add test where a batch is waiting on a smaller value than signaled (#17141) by Boian Petkantchin · 11 months ago
- 729ebc6 [runtime][metal] exclude properly the failing semaphore test (#17151) by Boian Petkantchin · 11 months ago
- 30acc53 [runtime][cts] add test where a device batch signals another and the host (#17138) by Boian Petkantchin · 11 months ago
- 290d812 [runtime][cts] add semaphore test where a batch waits on another and a host signal (#17130) by Boian Petkantchin · 11 months ago
- 568bb31 [runtime][cts] Add test waiting on a semaphore for finite time and fix Vulkan driver (#17126) by Boian Petkantchin · 11 months ago
- 655b71a Executable library call hooks system, and a sample Linux/CPU event implementation (#15803) by Benoit Jacob · 11 months ago
- 36b3ce1 [runtime][cts] Add test to wait multiple times for the same semaphore value (#17125) by Boian Petkantchin · 11 months ago
- 44ccc22 [runtime][hip][cuda] Fix bug where zombified actions may not get cleaned up (#17107) by Boian Petkantchin · 11 months ago
- f4bee0c [runtime] Make the runtime more TSan-friendly (#17051) by Boian Petkantchin · 11 months ago
- 0f6bc24 [runtime][hip][cuda] Fix mutex locking when waiting on a semaphore (#17118) by Boian Petkantchin · 11 months ago
- 78005ef [runtime][cts] add test where 2 batches wait on different semaphore values (#17091) by Boian Petkantchin · 11 months ago
- 074cbf3 [runtime] Refactor semaphore submission CTS (#17108) by Boian Petkantchin · 11 months ago
- 3f51a55 Replace openxla/iree with iree-org/iree across the project. (#17110) by Scott Todd · 11 months ago
- a2476ce [metal] Disable failing semaphore submission test until fixing (#17100) by Lei Zhang · 12 months ago
- 3677fbc [runtime] Add semaphore test where 2 batches wait on a former batch amongst 2 (#17080) by Boian Petkantchin · 12 months ago
- cd282de [runtime][hip][cuda] Fix waiting on a semaphore on the host (#17073) by Boian Petkantchin · 12 months ago
- 459fab6 [runtime][hip][cuda] Fix waiting on wait semaphores before executing actions (#17025) by Boian Petkantchin · 12 months ago
- 1c49d6a [runtime][hip][cuda] Add tracing in graph execution mode (#16894) by Boian Petkantchin · 12 months ago
- 40f2533 [HIP] Add inline execution mode (#16951) by Nithin Meganathan · 12 months ago
- fbd31b0 [Vulkan] Fix coop matrix property initialization (#17023) by Jakub Kuderski · 12 months ago
- 2780fd5 Fix typo in benchmark tracing warning message. by Ben Vanik · 12 months ago
- 5d2af54 [python] Convert python io tests to unit tests. (#16984) by Stella Laurenzo · 12 months ago
- 5ad0fe2 [python] Add missing public alias of symbol. (#16980) by Stella Laurenzo · 12 months ago
- 27670b6 Bump nanobind version in more requirement files. (#16976) by Scott Todd · 12 months ago
- 11d2259 Fix arm and windows builder issues at head. by Stella Laurenzo · 12 months ago
- 4f40080 [python] Flesh out more of the python parameters API. (#16957) by Stella Laurenzo · 12 months ago
- be9f097 [hip] Introduce options to control load of libamdhip64.so. (#16766) by Stella Laurenzo · 12 months ago
- c46daa6 Fix for iree-run-module with --input=not-a-path.bin (#16919) by James Newling · 1 year ago
- daee1dd Ukernels: replace boilerplate by tables. (#16879) by Benoit Jacob · 1 year ago
- 2cdf145 fix build errors when tracing mode = 1 (#16884) by Okwan Kwon · 1 year ago
- 5f2743b [hip] Move into hal/drivers and build by default (#16706) by Lei Zhang · 1 year ago
- 767a611 [tracing] fix build errors when tracing mode = 1 (#16853) by Okwan Kwon · 1 year ago
- e7eef08 Adds a nullptr check around optional implementation in dynamic modules. (#16845) by Stella Laurenzo · 1 year ago
- 77cc34b Bump numpy dep up to the 2.0 prerelease. (#16800) by Scott Todd · 1 year ago
- 2eae9b3 [CPU] Remove 8x8x16 i8mm microkernel by mariecwhite · 1 year ago
- 92fe572 [CPU] Add s8s4s32 i8mm ukernel (#16678) by mariecwhite · 1 year ago
- b61a918 Enable LTO optimization by default for runtime releases. (#16811) by Stella Laurenzo · 1 year ago
- 2395046 [rocm] Excises (almost) dependence on /opt/rocm from the compiler. (#16803) by Stella Laurenzo · 1 year ago
- d05b4a1 Splitting stack trace support out of status.c. (#16791) by Ben Vanik · 1 year ago
- 15d9039 Embedding executable source contents in binaries for tracing. (#16757) by Ben Vanik · 1 year, 1 month ago
- e12ab47 Replace `unaryOperator` by EmitC LogicalNotOp (#16730) by Marius Brehler · 1 year, 1 month ago
- e9f7ecd [cuda][hip] Drop name_literal reference like Vulkan side (#16742) by Lei Zhang · 1 year, 1 month ago
- 27b837c Replace `binaryOperator` by EmitC ops (#16728) by Marius Brehler · 1 year, 1 month ago
- 7da8af6 Cleanup a few docs/references to old paths in HAL/Target. (#16716) by Scott Todd · 1 year, 1 month ago
- b5af996 Prefer broadcasting RHS over LHS in AVX-512 multiply-accumulate instructions (#16709) by Benoit Jacob · 1 year, 1 month ago
- f34e534 Replace k with m by mariecwhite · 1 year, 1 month ago
- 4d3c93f Add missing macros to dotprod ukernel by mariecwhite · 1 year, 1 month ago
- 8adae37 [cuda][hip] Add support for semaphore multi wait (#16638) by Lei Zhang · 1 year, 1 month ago
- 9d6d99f faster narrow mmt4d ukernels on x86 (#16655) by Benoit Jacob · 1 year, 1 month ago
- 4f1f055 mmt4d ukernel: use fewer magic macros to generate tile-functions M0-variants (#16645) by Benoit Jacob · 1 year, 1 month ago
- b994b72 Reenable accidentally disabled architecture-specific parts of `mmt4d_test` (#16654) by Benoit Jacob · 1 year, 1 month ago
- f433fd2 Using iree.abi.name consistently for arg/result names. (#16635) by Ben Vanik · 1 year, 1 month ago
- fe5e69a [cuda][hip] Shorten deferred queue worker name (#16642) by Lei Zhang · 1 year, 1 month ago
- 9dfc612 [cuda][hip] Fix worker thread and device host callback synchronization (#16621) by Boian Petkantchin · 1 year, 1 month ago
- f66d7f2 Fix enablement of mmt4d ukernel test cases based on ISA code paths built (#16637) by Benoit Jacob · 1 year, 1 month ago
- 5180ede mmt4d ukernel: simplification in generic tile funcs: stop using a stack array (#16633) by Benoit Jacob · 1 year, 1 month ago
- 8959b90 Make ukernels fallback opt-in and add a `mmt4d_info` ukernel to query the mmt4d implementation. (#16631) by Benoit Jacob · 1 year, 1 month ago
- 6ff9a3d Refactor how llvm-cpu check tests interface with ASan/TSan. (#16452) by Scott Todd · 1 year, 1 month ago
- e6397cb Change ukernels calling convention to default (#16541) by Benoit Jacob · 1 year, 1 month ago
- e991798 Unroll fixed-trip-count loops within mmt4d ukernel tile functions. (#16626) by Benoit Jacob · 1 year, 1 month ago
- 88b1d4d Replace std::iterator with our custom iterator typedefs (#16423) (#16583) by Peyman Barazandeh · 1 year, 1 month ago
- 9dc8ae4 [cuda][hip] Fix launch host func and worker thread state update (#16568) by Lei Zhang · 1 year, 1 month ago
- 862a031 Adding --task_abort_on_failure flag/API. (#16565) by Ben Vanik · 1 year, 1 month ago
- 23f2828 Adding iree-benchmark-executable tool. (#16550) by Ben Vanik · 1 year, 1 month ago
- c15b610 [EmitC] Remove the forked emitter and generate all the code in the conversion pass (#16357) by Simon Camphausen · 1 year, 1 month ago
- d500494 Add s8s4s32 dotprod microkernel (#16473) by mariecwhite · 1 year, 1 month ago
- c3b3d96 Adding hal.device.id queries to HAL devices. (#16495) by Ben Vanik · 1 year, 1 month ago
- 6d293af Retrying try-lock in synchronization_test to avoid arm64 flakes. (#16436) by Ben Vanik · 1 year, 1 month ago
- 4463f8d [python] Enable building of 3.12 wheels on Linux. (#16424) by Stella Laurenzo · 1 year, 1 month ago
- 1f3e907 ukernels: update README.md (#16358) by Benoit Jacob · 1 year, 1 month ago
- d1e1d05 [python] Add a couple more async APIs. (#16419) by Stella Laurenzo · 1 year, 1 month ago
- 00aa173 [hip] Add missing source locations and fix parsing (#16418) by Lei Zhang · 1 year, 1 month ago
- d32609e Add s8s4s32 ukernel for ARM (#16259) by mariecwhite · 1 year, 1 month ago
- c02b89e [cuda][hip] Guard against NULL cleanup callbacks (#16403) by Lei Zhang · 1 year, 2 months ago
- 7c2ec73 Fix a bug in the fastpath of iree_hal_task_semaphore_multi_wait which was doing a spurious wait. (#16404) by Stella Laurenzo · 1 year, 2 months ago
- 60ac333 [python] Add a HalDeviceLoop class for routing runtime events to futures. (#16385) by Stella Laurenzo · 1 year, 2 months ago
- c70bf22 [HAL] Remove pool assert during allocator creation (#16388) by Nithin Meganathan · 1 year, 2 months ago
- 14927d1 Replacing the ancient vm_util with function_io/function_util. (#16351) by Ben Vanik · 1 year, 2 months ago
- 9aabcb3 Add conversions for FP8 types (F8E5M2 and F8E4M3) (#16374) by Benoit Jacob · 1 year, 2 months ago
- 30901f5 Replacing the ancient vm_util with function_io/function_util. by Ben Vanik · 1 year, 2 months ago
- 49f8a61 Adding iree_io_vec_stream_t. by Ben Vanik · 1 year, 2 months ago
- 29a7462 Adding iree_io_stdio_stream_t. by Ben Vanik · 1 year, 2 months ago
- 0a2483a Splitting iree_io_memory_stream_t from iree/io/stream.h. by Ben Vanik · 1 year, 2 months ago
- 9234f42 Add a number of runtime python bindings and refine the HalFence.wait() behavior. (#16371) by Stella Laurenzo · 1 year, 2 months ago
- 87bf971 Fixing implicit casting that caused 4GB fill/copy limits in local-task. (#16364) by Ben Vanik · 1 year, 2 months ago
- 10fd98b Fixes to enable clang-cl compilation of compiler/runtime. (#16299) by Ben Vanik · 1 year, 2 months ago
- 065e04a Adding support for outputting binary files from tooling. (#16291) by Ben Vanik · 1 year, 2 months ago
- 406626b [Vulkan][SPIRV] Introduce `address` vulkan device property (#16282) by Jakub Kuderski · 1 year, 2 months ago