- 7eb0da1 Generalizing iree_tooling_variant_list_fprint & co. by Ben Vanik · 2 years, 1 month ago
- 78fc0e3 Finishing iree_vm_list_set_variant/iree_vm_list_push_variant impls. by Ben Vanik · 2 years, 1 month ago
- fd76fb4 NFC: cleaning up trace_replay.c layout. by Ben Vanik · 2 years, 1 month ago
- ff9bc6a Bumping bytecode version. by Ben Vanik · 2 years, 1 month ago
- 3e6130e Adding alignment to vm.buffer.alloc and vm.buffer.clone. by Ben Vanik · 2 years, 1 month ago
- 459f5ff Adding i32/i64/f32 min/max ops. by Ben Vanik · 2 years, 1 month ago
- 08ae97f Refreshing VM performance by separating verification and tweaking buffer access. (#12426) by Ben Vanik · 2 years, 1 month ago
- 784db04 Adding bytecode verification. by Ben Vanik · 2 years, 1 month ago
- 91e73d3 Adding requirement bits to bytecode modules and bumping version. by Ben Vanik · 2 years, 1 month ago
- 8883247 Optimizing vm.buffer access with inlined buffer data access. by Ben Vanik · 2 years, 1 month ago
- aaa47cd Switching vm.buffer.fill to index with elements. by Ben Vanik · 2 years, 1 month ago
- bc00bc7 Moving iree_e2e_test_value_t out of tooling/, no clue why it was there. by Ben Vanik · 2 years, 1 month ago
- fa62f49 Only build iree_bitcode_library targets when llvm-cpu is enabled. (#12506) by Scott Todd · 2 years, 1 month ago
- 675699f Implement CPU feature detection on x86 (CPUID) (#12440) by bjacob · 2 years, 1 month ago
- f8efefe Rework `cpu.c` to make room for x86 path (#12395) by bjacob · 2 years, 1 month ago
- d6f8f15 Log out dlopen errors. (#12381) by Ben Vanik · 2 years, 1 month ago
- c42d002 [hal][cts] Add some non-offset-0 buffer fill tests (#12376) by Lei Zhang · 2 years, 1 month ago
- cf49d69 Moving bytecode files under iree/vm/bytecode/. (#12371) by Ben Vanik · 2 years, 1 month ago
- c319c2d Adding initial dispatch instrumention support. (#12357) by Ben Vanik · 2 years, 1 month ago
- 52de6a3 Centralizing HAL allocator wrapper creation and exposing to python. (#12356) by Ben Vanik · 2 years, 1 month ago
- 4e08eed Moving caching allocator configuration util out of tooling/. (#12338) by Ben Vanik · 2 years, 1 month ago
- 50588fb Adding HAL debug allocator (`--device_allocator=debug`). (#12337) by Ben Vanik · 2 years, 1 month ago
- 17e1111 Adding sym_visibility to vm.import and fixing parsing. (#12325) by Ben Vanik · 2 years, 1 month ago
- 987a864 Make tests compatible with both `iree::span` and C++20 `std::span` (#12319) by Geoffrey Martin-Noble · 2 years, 1 month ago
- c9ef78a Data-tiling encodings: rename RHS_TRANSPOSE to RHS (#12298) by bjacob · 2 years, 1 month ago
- c7b2912 Update references from `iree-org` to `openxla`. (#12304) by Scott Todd · 2 years, 1 month ago
- 7f6c57c Generating iree/builtins/device using our own clang. by Ben Vanik · 2 years, 2 months ago
- db2aefc Switching device builtins script to use our clang. by Ben Vanik · 2 years, 2 months ago
- 96d959e Add an option to disable use of bundled LLVM. (#12256) by Stella Laurenzo · 2 years, 1 month ago
- cd5347c Support mhlo collective ops (#11988) by Okwan Kwon · 2 years, 2 months ago
- 1044a85 Return a meaningful message on empty `--input=` flags. (#12192) by Ben Vanik · 2 years, 2 months ago
- a0a8bb8 [EmitC] Add support for Buffer Ops in VM Dialect (#11776) by Simon Camphausen · 2 years, 2 months ago
- b339919 Supporting CUDA importable buffers in compatibility check. (#12093) by Ben Vanik · 2 years, 2 months ago
- af5b6fe Get the right `pack`/`unpack` ukernel ARM64 tile functions (#12141) by bjacob · 2 years, 2 months ago
- 7c72bce Fix `IREE_FLAGS_ENABLE_FLAG_FILE` file IO support (#12098) by CindyLiu · 2 years, 2 months ago
- 99fb839 Adding iree_vm_list_swap_storage and iree_vm_list_copy. (#12088) by Ben Vanik · 2 years, 2 months ago
- 745e7f9 Fixing aliasing of iree_hal_memory_type_t bit. by Ben Vanik · 2 years, 2 months ago
- 33d2d1b Allowing import of CUDA buffers and improving error messages. by Ben Vanik · 2 years, 2 months ago
- eadd749 unpack ukernel (ARM64 code, test, benchmark) + pack improvements (#12055) by bjacob · 2 years, 2 months ago
- 21d0808 Add command_buffer_push_constants_test. (#11578) by Scott Todd · 2 years, 2 months ago
- 9fed940 Switch args/results to heap allocated storage for VM invocations. (#12045) by Ben Vanik · 2 years, 2 months ago
- 1f7702e Adding caching allocator configuration flag handling. by Ben Vanik · 2 years, 2 months ago
- a217ef3 Adding iree_hal_caching_allocator_t. by Ben Vanik · 2 years, 3 months ago
- c33b7dd Properly erroring out when spilling VM result stack storage. (#12030) by Ben Vanik · 2 years, 2 months ago
- e309cb3 Add `--output=` flag to iree-run-module/iree-run-mlir. (#12016) by Ben Vanik · 2 years, 2 months ago
- f65c5cb Renaming tool flags to --module/function/input. (#12010) by Ben Vanik · 2 years, 2 months ago
- 8bedd4b microkernels: simplify architecture code paths (#12015) by bjacob · 2 years, 2 months ago
- e026545 Extending `iree_hal_allocator_t` with memory heaps and compat queries. (#12002) by Ben Vanik · 2 years, 2 months ago
- 59f3b61 Check the nccl version compatibility (#12000) by Okwan Kwon · 2 years, 2 months ago
- df0db84 fix bad stride in pack_benchmark (#12009) by bjacob · 2 years, 2 months ago
- b2e4d2a Fixing runtime flag string lists to not grow exponentially. (#12007) by Ben Vanik · 2 years, 2 months ago
- 7ab025c Making iree_hal_allocator_query_buffer_compatibility return params. by Ben Vanik · 2 years, 2 months ago
- 77f6d26 Renaming `iree_hal_allocator_query_[+buffer_]compatibility`. by Ben Vanik · 2 years, 2 months ago
- 1a323b5 Adding iree_hal_allocator_memory_heap_t + query. by Ben Vanik · 2 years, 4 months ago
- 4cd7c8d Disabling CUDA replay stream validation. by Ben Vanik · 2 years, 2 months ago
- 29ad1ad Adding parsers for common HAL bitfields. by Ben Vanik · 2 years, 2 months ago
- 47d0b7c Adding iree_bitfield_parse utility. by Ben Vanik · 2 years, 2 months ago
- b14aaf4 pack microkernel padding improvements (#11987) by bjacob · 2 years, 2 months ago
- e513433 Add iree_hal_fence_extend helper. (#11989) by Stella Laurenzo · 2 years, 2 months ago
- 2e91d14 Fixing disabled tracing/flags functions ala #11962. (#11965) by Ben Vanik · 2 years, 2 months ago
- c170ca3 Removing unused iree/base/internal/main.h. (#11964) by Ben Vanik · 2 years, 2 months ago
- ed658db Fix CUDA tracing build in Debug build mode. (#11962) by Scott Todd · 2 years, 2 months ago
- 042e2c9 Create spans from non-temporaries. (#11958) by Jacques Pienaar · 2 years, 2 months ago
- 9242287 Attaching proper alignment to HAL binding subspans. (#11952) by Ben Vanik · 2 years, 2 months ago
- 80b3577 Swapping hal.interface.binding.subspan offset and alignment. by Ben Vanik · 2 years, 2 months ago
- f41856b Fixing PDB file naming to match DLL file naming. by Ben Vanik · 2 years, 2 months ago
- 0331769 Supporting symbol-relative relocations in the ELF loader. by Ben Vanik · 2 years, 2 months ago
- ca9bc42 Adding --task_worker_stack_size= flag and defaulting to 128KB. (#11867) by Ben Vanik · 2 years, 2 months ago
- 66fef01 Fixing MSVC compilation error with IREE_TRACE macro in a struct. by Ben Vanik · 2 years, 2 months ago
- 6a53867 Adding CUDA device-side tracing support (`--cuda_tracing`). (#11943) by Ben Vanik · 2 years, 2 months ago
- 9fa5de7 Enable CUDA (compiler and runtime) by default in CMake builds. (#11832) by Scott Todd · 2 years, 2 months ago
- a6de54c Refactor microkernel headers (#11920) by bjacob · 2 years, 2 months ago
- 48fa0da Supporting host memory registration in CUDA via external buffers. (#11899) by Ben Vanik · 2 years, 2 months ago
- 1006842 Make ukernels assert their validation, remove `iree_uk_status_t`. (#11900) by bjacob · 2 years, 2 months ago
- 3c78102 Revert "Supporting host memory registration in CUDA via external buff… (#11895) by Thomas · 2 years, 3 months ago
- e5f143f Use nested initializer list for iree_string_pair_t (#11871) by Lei Zhang · 2 years, 3 months ago
- 5932920 Discard root tasks before leaf tasks. (#11853) by bjacob · 2 years, 3 months ago
- 3c0aaab Supporting host memory registration in CUDA via external buffers. (#11848) by Ben Vanik · 2 years, 3 months ago
- c31c8a0 Nccl integration (#11585) by Okwan Kwon · 2 years, 3 months ago
- 2126488 Adding iree_hal_device_replace_allocator API. (#11809) by Ben Vanik · 2 years, 3 months ago
- 0d33d96 Relax the assumption of only one padding tile in `iree_uk_pack` (#11642) by bjacob · 2 years, 3 months ago
- 7958fc9 Adding IREE_FLAG_LIST utility for repeated string flags. (#11806) by Ben Vanik · 2 years, 3 months ago
- 43f516e Roll forward stateful loop_emscripten changes. (#11801) by Scott Todd · 2 years, 3 months ago
- 11c4049 Revert "Make loop_emscripten stateful and allow for multiple instances." (#11799) by Scott Todd · 2 years, 3 months ago
- 84d9957 Make loop_emscripten stateful and allow for multiple instances. (#11507) by Scott Todd · 2 years, 3 months ago
- 5b9a761 Add a `query_tile_sizes` ukernel. (#11640) by bjacob · 2 years, 3 months ago
- aa03d23 [NFC] Cleanup EmitC Conversion (#11793) by Simon Camphausen · 2 years, 3 months ago
- 3bb67d3 Reworking task API to provide iree_task_executors_create_from_flags. (#11614) by Ben Vanik · 2 years, 3 months ago
- c8ee571 Enable some now-passing HAL CTS tests. (#11537) by Scott Todd · 2 years, 3 months ago
- a9d23f6 Overhaul CMake and Bazel CUDA support (#11651) by Stella Laurenzo · 2 years, 3 months ago
- 4cf4b5a Support runtime tracing with Tracy in bazel builds. (#11650) by Stella Laurenzo · 2 years, 3 months ago
- 68c4ec2 Adding tracy memory pool visualization for Vulkan VMA buffers. (#11622) by Ben Vanik · 2 years, 4 months ago
- 830820e Adding tracing annotations for VMA buffer allocation sizes. (#11621) by Ben Vanik · 2 years, 4 months ago
- 742cf6a Rename `ukernel/exported_flag_bits` to `exported_bits` (#11609) by bjacob · 2 years, 4 months ago
- 41c5c07 Add vmvx.query_tile_sizes op (#11600) by bjacob · 2 years, 4 months ago
- 964ce61 Fix implementations of allocators' query_compatibility (#11599) by Ben Vanik · 2 years, 4 months ago
- a9aaebe Revert "Integrate NCCL" (#11584) by Ben Vanik · 2 years, 4 months ago
- 399adb3 Integrate NCCL (#11481) by Okwan Kwon · 2 years, 4 months ago
- a973914 Update some very old TODOs. (#11540) by Scott Todd · 2 years, 4 months ago
- ad23611 Adding plumbing and samples of custom CUDA/SPIR-V/CPU dispatch code. (#11287) by Ben Vanik · 2 years, 4 months ago