1. 7eb0da1 Generalizing iree_tooling_variant_list_fprint & co. by Ben Vanik · 2 years, 1 month ago
  2. 78fc0e3 Finishing iree_vm_list_set_variant/iree_vm_list_push_variant impls. by Ben Vanik · 2 years, 1 month ago
  3. fd76fb4 NFC: cleaning up trace_replay.c layout. by Ben Vanik · 2 years, 1 month ago
  4. ff9bc6a Bumping bytecode version. by Ben Vanik · 2 years, 1 month ago
  5. 3e6130e Adding alignment to vm.buffer.alloc and vm.buffer.clone. by Ben Vanik · 2 years, 1 month ago
  6. 459f5ff Adding i32/i64/f32 min/max ops. by Ben Vanik · 2 years, 1 month ago
  7. 08ae97f Refreshing VM performance by separating verification and tweaking buffer access. (#12426) by Ben Vanik · 2 years, 1 month ago
  8. 784db04 Adding bytecode verification. by Ben Vanik · 2 years, 1 month ago
  9. 91e73d3 Adding requirement bits to bytecode modules and bumping version. by Ben Vanik · 2 years, 1 month ago
  10. 8883247 Optimizing vm.buffer access with inlined buffer data access. by Ben Vanik · 2 years, 1 month ago
  11. aaa47cd Switching vm.buffer.fill to index with elements. by Ben Vanik · 2 years, 1 month ago
  12. bc00bc7 Moving iree_e2e_test_value_t out of tooling/, no clue why it was there. by Ben Vanik · 2 years, 1 month ago
  13. fa62f49 Only build iree_bitcode_library targets when llvm-cpu is enabled. (#12506) by Scott Todd · 2 years, 1 month ago
  14. 675699f Implement CPU feature detection on x86 (CPUID) (#12440) by bjacob · 2 years, 1 month ago
  15. f8efefe Rework `cpu.c` to make room for x86 path (#12395) by bjacob · 2 years, 1 month ago
  16. d6f8f15 Log out dlopen errors. (#12381) by Ben Vanik · 2 years, 1 month ago
  17. c42d002 [hal][cts] Add some non-offset-0 buffer fill tests (#12376) by Lei Zhang · 2 years, 1 month ago
  18. cf49d69 Moving bytecode files under iree/vm/bytecode/. (#12371) by Ben Vanik · 2 years, 1 month ago
  19. c319c2d Adding initial dispatch instrumention support. (#12357) by Ben Vanik · 2 years, 1 month ago
  20. 52de6a3 Centralizing HAL allocator wrapper creation and exposing to python. (#12356) by Ben Vanik · 2 years, 1 month ago
  21. 4e08eed Moving caching allocator configuration util out of tooling/. (#12338) by Ben Vanik · 2 years, 1 month ago
  22. 50588fb Adding HAL debug allocator (`--device_allocator=debug`). (#12337) by Ben Vanik · 2 years, 1 month ago
  23. 17e1111 Adding sym_visibility to vm.import and fixing parsing. (#12325) by Ben Vanik · 2 years, 1 month ago
  24. 987a864 Make tests compatible with both `iree::span` and C++20 `std::span` (#12319) by Geoffrey Martin-Noble · 2 years, 1 month ago
  25. c9ef78a Data-tiling encodings: rename RHS_TRANSPOSE to RHS (#12298) by bjacob · 2 years, 1 month ago
  26. c7b2912 Update references from `iree-org` to `openxla`. (#12304) by Scott Todd · 2 years, 1 month ago
  27. 7f6c57c Generating iree/builtins/device using our own clang. by Ben Vanik · 2 years, 2 months ago
  28. db2aefc Switching device builtins script to use our clang. by Ben Vanik · 2 years, 2 months ago
  29. 96d959e Add an option to disable use of bundled LLVM. (#12256) by Stella Laurenzo · 2 years, 1 month ago
  30. cd5347c Support mhlo collective ops (#11988) by Okwan Kwon · 2 years, 2 months ago
  31. 1044a85 Return a meaningful message on empty `--input=` flags. (#12192) by Ben Vanik · 2 years, 2 months ago
  32. a0a8bb8 [EmitC] Add support for Buffer Ops in VM Dialect (#11776) by Simon Camphausen · 2 years, 2 months ago
  33. b339919 Supporting CUDA importable buffers in compatibility check. (#12093) by Ben Vanik · 2 years, 2 months ago
  34. af5b6fe Get the right `pack`/`unpack` ukernel ARM64 tile functions (#12141) by bjacob · 2 years, 2 months ago
  35. 7c72bce Fix `IREE_FLAGS_ENABLE_FLAG_FILE` file IO support (#12098) by CindyLiu · 2 years, 2 months ago
  36. 99fb839 Adding iree_vm_list_swap_storage and iree_vm_list_copy. (#12088) by Ben Vanik · 2 years, 2 months ago
  37. 745e7f9 Fixing aliasing of iree_hal_memory_type_t bit. by Ben Vanik · 2 years, 2 months ago
  38. 33d2d1b Allowing import of CUDA buffers and improving error messages. by Ben Vanik · 2 years, 2 months ago
  39. eadd749 unpack ukernel (ARM64 code, test, benchmark) + pack improvements (#12055) by bjacob · 2 years, 2 months ago
  40. 21d0808 Add command_buffer_push_constants_test. (#11578) by Scott Todd · 2 years, 2 months ago
  41. 9fed940 Switch args/results to heap allocated storage for VM invocations. (#12045) by Ben Vanik · 2 years, 2 months ago
  42. 1f7702e Adding caching allocator configuration flag handling. by Ben Vanik · 2 years, 2 months ago
  43. a217ef3 Adding iree_hal_caching_allocator_t. by Ben Vanik · 2 years, 3 months ago
  44. c33b7dd Properly erroring out when spilling VM result stack storage. (#12030) by Ben Vanik · 2 years, 2 months ago
  45. e309cb3 Add `--output=` flag to iree-run-module/iree-run-mlir. (#12016) by Ben Vanik · 2 years, 2 months ago
  46. f65c5cb Renaming tool flags to --module/function/input. (#12010) by Ben Vanik · 2 years, 2 months ago
  47. 8bedd4b microkernels: simplify architecture code paths (#12015) by bjacob · 2 years, 2 months ago
  48. e026545 Extending `iree_hal_allocator_t` with memory heaps and compat queries. (#12002) by Ben Vanik · 2 years, 2 months ago
  49. 59f3b61 Check the nccl version compatibility (#12000) by Okwan Kwon · 2 years, 2 months ago
  50. df0db84 fix bad stride in pack_benchmark (#12009) by bjacob · 2 years, 2 months ago
  51. b2e4d2a Fixing runtime flag string lists to not grow exponentially. (#12007) by Ben Vanik · 2 years, 2 months ago
  52. 7ab025c Making iree_hal_allocator_query_buffer_compatibility return params. by Ben Vanik · 2 years, 2 months ago
  53. 77f6d26 Renaming `iree_hal_allocator_query_[+buffer_]compatibility`. by Ben Vanik · 2 years, 2 months ago
  54. 1a323b5 Adding iree_hal_allocator_memory_heap_t + query. by Ben Vanik · 2 years, 4 months ago
  55. 4cd7c8d Disabling CUDA replay stream validation. by Ben Vanik · 2 years, 2 months ago
  56. 29ad1ad Adding parsers for common HAL bitfields. by Ben Vanik · 2 years, 2 months ago
  57. 47d0b7c Adding iree_bitfield_parse utility. by Ben Vanik · 2 years, 2 months ago
  58. b14aaf4 pack microkernel padding improvements (#11987) by bjacob · 2 years, 2 months ago
  59. e513433 Add iree_hal_fence_extend helper. (#11989) by Stella Laurenzo · 2 years, 2 months ago
  60. 2e91d14 Fixing disabled tracing/flags functions ala #11962. (#11965) by Ben Vanik · 2 years, 2 months ago
  61. c170ca3 Removing unused iree/base/internal/main.h. (#11964) by Ben Vanik · 2 years, 2 months ago
  62. ed658db Fix CUDA tracing build in Debug build mode. (#11962) by Scott Todd · 2 years, 2 months ago
  63. 042e2c9 Create spans from non-temporaries. (#11958) by Jacques Pienaar · 2 years, 2 months ago
  64. 9242287 Attaching proper alignment to HAL binding subspans. (#11952) by Ben Vanik · 2 years, 2 months ago
  65. 80b3577 Swapping hal.interface.binding.subspan offset and alignment. by Ben Vanik · 2 years, 2 months ago
  66. f41856b Fixing PDB file naming to match DLL file naming. by Ben Vanik · 2 years, 2 months ago
  67. 0331769 Supporting symbol-relative relocations in the ELF loader. by Ben Vanik · 2 years, 2 months ago
  68. ca9bc42 Adding --task_worker_stack_size= flag and defaulting to 128KB. (#11867) by Ben Vanik · 2 years, 2 months ago
  69. 66fef01 Fixing MSVC compilation error with IREE_TRACE macro in a struct. by Ben Vanik · 2 years, 2 months ago
  70. 6a53867 Adding CUDA device-side tracing support (`--cuda_tracing`). (#11943) by Ben Vanik · 2 years, 2 months ago
  71. 9fa5de7 Enable CUDA (compiler and runtime) by default in CMake builds. (#11832) by Scott Todd · 2 years, 2 months ago
  72. a6de54c Refactor microkernel headers (#11920) by bjacob · 2 years, 2 months ago
  73. 48fa0da Supporting host memory registration in CUDA via external buffers. (#11899) by Ben Vanik · 2 years, 2 months ago
  74. 1006842 Make ukernels assert their validation, remove `iree_uk_status_t`. (#11900) by bjacob · 2 years, 2 months ago
  75. 3c78102 Revert "Supporting host memory registration in CUDA via external buff… (#11895) by Thomas · 2 years, 3 months ago
  76. e5f143f Use nested initializer list for iree_string_pair_t (#11871) by Lei Zhang · 2 years, 3 months ago
  77. 5932920 Discard root tasks before leaf tasks. (#11853) by bjacob · 2 years, 3 months ago
  78. 3c0aaab Supporting host memory registration in CUDA via external buffers. (#11848) by Ben Vanik · 2 years, 3 months ago
  79. c31c8a0 Nccl integration (#11585) by Okwan Kwon · 2 years, 3 months ago
  80. 2126488 Adding iree_hal_device_replace_allocator API. (#11809) by Ben Vanik · 2 years, 3 months ago
  81. 0d33d96 Relax the assumption of only one padding tile in `iree_uk_pack` (#11642) by bjacob · 2 years, 3 months ago
  82. 7958fc9 Adding IREE_FLAG_LIST utility for repeated string flags. (#11806) by Ben Vanik · 2 years, 3 months ago
  83. 43f516e Roll forward stateful loop_emscripten changes. (#11801) by Scott Todd · 2 years, 3 months ago
  84. 11c4049 Revert "Make loop_emscripten stateful and allow for multiple instances." (#11799) by Scott Todd · 2 years, 3 months ago
  85. 84d9957 Make loop_emscripten stateful and allow for multiple instances. (#11507) by Scott Todd · 2 years, 3 months ago
  86. 5b9a761 Add a `query_tile_sizes` ukernel. (#11640) by bjacob · 2 years, 3 months ago
  87. aa03d23 [NFC] Cleanup EmitC Conversion (#11793) by Simon Camphausen · 2 years, 3 months ago
  88. 3bb67d3 Reworking task API to provide iree_task_executors_create_from_flags. (#11614) by Ben Vanik · 2 years, 3 months ago
  89. c8ee571 Enable some now-passing HAL CTS tests. (#11537) by Scott Todd · 2 years, 3 months ago
  90. a9d23f6 Overhaul CMake and Bazel CUDA support (#11651) by Stella Laurenzo · 2 years, 3 months ago
  91. 4cf4b5a Support runtime tracing with Tracy in bazel builds. (#11650) by Stella Laurenzo · 2 years, 3 months ago
  92. 68c4ec2 Adding tracy memory pool visualization for Vulkan VMA buffers. (#11622) by Ben Vanik · 2 years, 4 months ago
  93. 830820e Adding tracing annotations for VMA buffer allocation sizes. (#11621) by Ben Vanik · 2 years, 4 months ago
  94. 742cf6a Rename `ukernel/exported_flag_bits` to `exported_bits` (#11609) by bjacob · 2 years, 4 months ago
  95. 41c5c07 Add vmvx.query_tile_sizes op (#11600) by bjacob · 2 years, 4 months ago
  96. 964ce61 Fix implementations of allocators' query_compatibility (#11599) by Ben Vanik · 2 years, 4 months ago
  97. a9aaebe Revert "Integrate NCCL" (#11584) by Ben Vanik · 2 years, 4 months ago
  98. 399adb3 Integrate NCCL (#11481) by Okwan Kwon · 2 years, 4 months ago
  99. a973914 Update some very old TODOs. (#11540) by Scott Todd · 2 years, 4 months ago
  100. ad23611 Adding plumbing and samples of custom CUDA/SPIR-V/CPU dispatch code. (#11287) by Ben Vanik · 2 years, 4 months ago