1. a488d38 Add region to `linalg_ext.attention` (#18728) by Rob Suderman · 5 months ago
  2. 164a60e [ROCM] Disable mixed precision fma instructions that cause numeric issues (#18753) by Nirvedh Meshram · 6 months ago
  3. 0b17c72 Add testing for punet model variations. (#18639) by saienduri · 6 months ago
  4. 546d862 Fix experimental/web/ samples after recent changes. (#18567) by Scott Todd · 6 months ago
  5. e2a2b2b Removing descriptor set layouts from HAL IR and simplifying bindings. by Ben Vanik · 7 months ago
  6. 7dc8c26 Renaming dispatch2 -> dispatch and create2 -> create. by Ben Vanik · 7 months ago
  7. f43f59b Renaming [spirv|wgsl]_executable_def to [vulkan|webgpu]. by Ben Vanik · 7 months ago
  8. 4349df7 Renaming rocm executable -> hip executable. by Ben Vanik · 7 months ago
  9. 5c06d4b Factoring out common debug info from GPU executable flatbuffers. by Ben Vanik · 7 months ago
  10. 7a7bfe1 [Flow] Move first part of Flow transforms to new pipeline (#18290) by Ian Wood · 7 months ago
  11. cc44a85 Rework special model testing to avoid shared cache interference. (#18344) by saienduri · 7 months ago
  12. c44d29b [compiler] Make cuda/hip/vulkan target cl options consistent (#17710) by Lei Zhang · 7 months ago
  13. 4c8913b Remove device "gpu number" specifications from model benchmarks. (#18315) by Scott Todd · 7 months ago
  14. 8dd1db3 Bubble expand shapes through `AttentionOp`s (#18074) by Ian Wood · 7 months ago
  15. 7c8fedc Remove PyYAML dependency from Python bindings. (#18262) by Scott Todd · 7 months ago
  16. 9b05f17 Delete all in-tree benchmark infrastructure code. (#18144) by Scott Todd · 7 months ago
  17. 9c951ca [Flow] Generalize horizontal contraction fusion to cover more cases. (#17880) by MaheshRavishankar · 7 months ago
  18. 8dc6820 Adding simplified HAL dispatch methods. (#18189) by Ben Vanik · 8 months ago
  19. 7cf0e26 Migrate uses of build_host_tools.sh and delete it. (#18129) by Scott Todd · 8 months ago
  20. a28f76f Adding flag placeholders to semaphores/events. (#18122) by Ben Vanik · 8 months ago
  21. 242d69e Drop regression_suite install from setup_venv.py. (#18072) by Scott Todd · 8 months ago
  22. 6c45bef [runtime][HIP] Retire ROCm HAL backend (#17029) by Nithin Meganathan · 8 months ago
  23. e900692 Updating various tests to the latest changes. by Ben Vanik · 8 months ago
  24. 3ea1357 Adding `iree_hal_dispatch_flags_t` to dispatch operations. by Ben Vanik · 8 months ago
  25. d8bf4ac Add `iree-c-embed-data` and `iree-flatcc-cli` to release packages. (#18001) by Scott Todd · 8 months ago
  26. 0bc1518 Log more context when sdxl benchmark commands fail. (#17907) by Scott Todd · 8 months ago
  27. 44808e1 Add in-tree special_models test suite using reworked iree-tooling. (#17883) by saienduri · 9 months ago
  28. f8f2996 Retaining binding tables and plumbing indirect cmds in local-task. (#17838) by Ben Vanik · 9 months ago
  29. 9ffe473 Making HAL command buffers take buffers as indirect args. (#17730) by Ben Vanik · 9 months ago
  30. 13e6b7e Removing nested command buffers and adding indirect execution. (#17724) by Ben Vanik · 9 months ago
  31. 62efaee Format files across the project using pre-commit. (#17534) by Scott Todd · 10 months ago
  32. d47f86e [hip][rocm] Switch to use old hipDeviceProp_t for queries (#17522) by Lei Zhang · 10 months ago
  33. 4132d2e [runtime][hal][hip] Implement collectives via RCCL (#17270) by Boian Petkantchin · 10 months ago
  34. c1fdd75 Introduce new logo assets. (#17424) by Scott Todd · 10 months ago
  35. b716704 Update git-clang-format ref and clang-format version. (#16792) by Scott Todd · 10 months ago
  36. c2114b8 Remove experimental/dispatch_profiler. (#17287) by Scott Todd · 11 months ago
  37. f7098e3 Moving regression suite to azure (#17140) by saienduri · 11 months ago
  38. 655b71a Executable library call hooks system, and a sample Linux/CPU event implementation (#15803) by Benoit Jacob · 11 months ago
  39. 3f51a55 Replace openxla/iree with iree-org/iree across the project. (#17110) by Scott Todd · 11 months ago
  40. e44cf32 Move external test suite configs out of experimental. (#16907) by Scott Todd · 12 months ago
  41. ff820d6 Re-land "start testing real weight models ..." (#16918) by Scott Todd · 12 months ago
  42. 7bda2ec Bump torch-mlir to HEAD (e2343cf4ce9a13e8fa09d6c5ade6524fa7cf2b02). (#16911) by Stella Laurenzo · 12 months ago
  43. cd1068b Revert "Start testing real weight models from external test suite." (#16910) by Scott Todd · 12 months ago
  44. 8ab68b6 Start testing real weight models from external test suite. (#16801) by Scott Todd · 12 months ago
  45. 61a1f2e Mark regression tests as passing that now pass. (#16900) by Stella Laurenzo · 12 months ago
  46. 5f2743b [hip] Move into hal/drivers and build by default (#16706) by Lei Zhang · 1 year ago
  47. ed59bed Enabling external HIP HSACO support ala CUDA external PTX support. (#16830) by Ben Vanik · 1 year ago
  48. a2ed5d1 Trace allocate/deallocate in rocm_allocator. (#16822) by Scott Todd · 1 year ago
  49. b61a918 Enable LTO optimization by default for runtime releases. (#16811) by Stella Laurenzo · 1 year ago
  50. ee32fc7 [rocm] Fix crash when executable source information is missing (#16805) by Lei Zhang · 1 year ago
  51. 2395046 [rocm] Excises (almost) dependence on /opt/rocm from the compiler. (#16803) by Stella Laurenzo · 1 year ago
  52. 15d9039 Embedding executable source contents in binaries for tracing. (#16757) by Ben Vanik · 1 year ago
  53. e9f7ecd [cuda][hip] Drop name_literal reference like Vulkan side (#16742) by Lei Zhang · 1 year ago
  54. a3603c6 [ROCM] Fix build with runtime tracing enabled (#16737) by Quinn Dawkins · 1 year ago
  55. 18d73c7 [rocm] Fix IREE_ROCM_TRACE_ZONE symbol (#16736) by Lei Zhang · 1 year ago
  56. c8081fd Adding legacy ROCM tracing zones. (#16735) by Ben Vanik · 1 year ago
  57. 331801c bump torch to 80c7bc3f7ae12413836a2f610a6491794b4dbb08 (#16717) by Daniel Garvey · 1 year ago
  58. a283044 [hip] Make graph command buffer as default for initialization (#16707) by Lei Zhang · 1 year, 1 month ago
  59. 0545746 [hip] Mark device local + host visible as low performance (#16701) by Lei Zhang · 1 year, 1 month ago
  60. c87eafe Update external test suite version pin and XFAIL sets. (#16675) by Scott Todd · 1 year, 1 month ago
  61. 3bdb45b Use correct 'webgpu-spirv' flag name in samples. (#16681) by Scott Todd · 1 year, 1 month ago
  62. 77758bd [rocm] Port optional symbols support for hipGetDeviceProperties (#16661) by Lei Zhang · 1 year, 1 month ago
  63. 6209806 [ROCm] Set option to preload kernel arguments (#16659) by Jakub Kuderski · 1 year, 1 month ago
  64. 57ac339 [hip] Enable stream command buffer dispatch tracing (#16641) by Lei Zhang · 1 year, 1 month ago
  65. 8adae37 [cuda][hip] Add support for semaphore multi wait (#16638) by Lei Zhang · 1 year, 1 month ago
  66. 2ba3d5c [hip] Drop unnecessary __HIP_PLATFORM_HCC__ definition (#16644) by Lei Zhang · 1 year, 1 month ago
  67. eda28bf [hip][rocm] Fix hipGetDeviceProperties usage after ROCm 6.0 (#16643) by Lei Zhang · 1 year, 1 month ago
  68. fe5e69a [cuda][hip] Shorten deferred queue worker name (#16642) by Lei Zhang · 1 year, 1 month ago
  69. 9dfc612 [cuda][hip] Fix worker thread and device host callback synchronization (#16621) by Boian Petkantchin · 1 year, 1 month ago
  70. e2d73ec [rocm] Backport and adjust some HIP allocator and buffer changes (#16627) by Lei Zhang · 1 year, 1 month ago
  71. 96a09d9 Delete experimental/cpu_ukernel (#16540) by Benoit Jacob · 1 year, 1 month ago
  72. 890b070 Forking off device methods from TargetBackend->TargetDevice. (#16591) by Ben Vanik · 1 year, 1 month ago
  73. 24bf0ac [hip] Optionally enable graph command buffer and tests (#16604) by Lei Zhang · 1 year, 1 month ago
  74. 42f1675 Run external test suite tests in pkgci. (#16589) by Scott Todd · 1 year, 1 month ago
  75. eeda5ca Renaming WebGPU to WebGPU-SPIRV (ala Metal-SPIRV). (#16586) by Ben Vanik · 1 year, 1 month ago
  76. 9dc8ae4 [cuda][hip] Fix launch host func and worker thread state update (#16568) by Lei Zhang · 1 year, 1 month ago
  77. 37d60f1 [hip] Enable stablehlo/tosa op e2e tests (#16466) by Lei Zhang · 1 year, 1 month ago
  78. 884d2dc [HIP] Add device cast to fix build error (#16505) by Nithin Meganathan · 1 year, 1 month ago
  79. c3b3d96 Adding hal.device.id queries to HAL devices. (#16495) by Ben Vanik · 1 year, 1 month ago
  80. ab4df60 [hip] Fix CMakeLists duplication and improve variable name (#16465) by Lei Zhang · 1 year, 1 month ago
  81. 1a010d8 [rocm][hip] Make shared object init error understandable (#16459) by Lei Zhang · 1 year, 1 month ago
  82. 5b3a0ab [rocm][hip] Name the function for excessive shared memory in error (#16460) by Lei Zhang · 1 year, 1 month ago
  83. 16b1f2d [HIP] List all current error codes from HIP (#16430) by Nithin Meganathan · 1 year, 1 month ago
  84. 8fc4cd4 [HIP] Replace hipLaunchKernel API with module call (#16448) by Nithin Meganathan · 1 year, 1 month ago
  85. 0be6423 Quality of life improvements for experimental/regression_suite. (#16415) by Scott Todd · 1 year, 1 month ago
  86. 00aa173 [hip] Add missing source locations and fix parsing (#16418) by Lei Zhang · 1 year, 1 month ago
  87. c1d608f Add host cpu tests for regression_suite/.../ukernel. (#16413) by Scott Todd · 1 year, 1 month ago
  88. c02b89e [cuda][hip] Guard against NULL cleanup callbacks (#16403) by Lei Zhang · 1 year, 1 month ago
  89. c8b6dc1 [hip] Initialize the executable resource after allocation (#16397) by Lei Zhang · 1 year, 1 month ago
  90. d36e8f3 [hip] Mark graph update/copy buffer as unimplmented (#16395) by Lei Zhang · 1 year, 1 month ago
  91. c70bf22 [HAL] Remove pool assert during allocator creation (#16388) by Nithin Meganathan · 1 year, 1 month ago
  92. 7fdb581 [HIP] Enable HAL CTS (#16380) by Nithin Meganathan · 1 year, 1 month ago
  93. ebcd016 [HIP] Enable semaphores in HIP device (#16349) by Nithin Meganathan · 1 year, 1 month ago
  94. 4289190 [HIP] Implement event-backed semaphore and deferred queue (#16305) by Nithin Meganathan · 1 year, 2 months ago
  95. fe5ff44 [HIP] Add stream command buffer (#16290) by Nithin Meganathan · 1 year, 2 months ago
  96. d71c147 Refresh website branding. (#16151) by Scott Todd · 1 year, 2 months ago
  97. c859e29 Fix web and Colab sample CI builds. (#16155) by Scott Todd · 1 year, 2 months ago
  98. 51c30ab e2e microkernel pipeline + argmax ukernel on ROCM backend. (#15943) by Stanley Winata · 1 year, 2 months ago
  99. ddccda0 [HIP] Add macro for HIP build deps update (#16123) by Nithin Meganathan · 1 year, 2 months ago
  100. 171e31c [cuda] Move to hal/drivers and wire up BUILD files (#14620) by Lei Zhang · 1 year, 3 months ago