- 65680c6 [VectorDistribution] Add patterns for distributing transfer_read/transfer_write (#16115) by Kunwar Grover · 1 year, 3 months ago
- d71c147 Refresh website branding. (#16151) by Scott Todd · 1 year, 3 months ago
- ee9b206 [ROCM][Ukernel] Fix index types. (#16154) by Stanley Winata · 1 year, 3 months ago
- c859e29 Fix web and Colab sample CI builds. (#16155) by Scott Todd · 1 year, 3 months ago
- 0f481d8 Remove run_shark_tank.yml and supporting code. (#15825) by Scott Todd · 1 year, 3 months ago
- dc0eaf7 Fix: `--iree-llvmcpu-loop-*` command-line options were not having any effect. (#16153) by Benoit Jacob · 1 year, 3 months ago
- 58cd112 Enforce CUDA >= 12 and fix its CMake search procedure (#16142) by Boian Petkantchin · 1 year, 3 months ago
- 3aab7b7 Add e2e tests for packing on i8 types. (#16148) by Han-Chung Wang · 1 year, 3 months ago
- 7d736b5 Ukernels: simplify the architecture-specific bitcode build. (#16126) by Benoit Jacob · 1 year, 3 months ago
- 6ab1ed8 Preserve reflection attrs on functions when wrapping for the native ABI. (#16129) by Ben Vanik · 1 year, 3 months ago
- 91803de Allow specifying multiple --device= flags in tooling. (#16132) by Ben Vanik · 1 year, 3 months ago
- 4b1b8e2 Fixing task worker utilization tracing plot. (#16131) by Ben Vanik · 1 year, 3 months ago
- e178f45 [stablehlo] Implement product of householder reflectors (#15555) by Rob Suderman · 1 year, 3 months ago
- 390eeb0 Fixing copypasta in codegen/flow tests that included ABI ops. (#16140) by Ben Vanik · 1 year, 3 months ago
- 1cb0f99 [VectorExt] Fix LayoutIterator iteration for step != 0 (#16133) by Kunwar Grover · 1 year, 3 months ago
- e3db254 Simplify how mmt4d ukernels deal with the K=0 case. (#16137) by Benoit Jacob · 1 year, 3 months ago
- 51c30ab e2e microkernel pipeline + argmax ukernel on ROCM backend. (#15943) by Stanley Winata · 1 year, 3 months ago
- 6847e37 [CPU] Remove special distribution tile sizes from setting matmul config. (#15968) by Han-Chung Wang · 1 year, 3 months ago
- dd17612 Add `IREE_ENABLE_WERROR_FLAG` CMake option. (#16121) by Rechie Kho · 1 year, 3 months ago
- ddccda0 [HIP] Add macro for HIP build deps update (#16123) by Nithin Meganathan · 1 year, 3 months ago
- 510c77e [VectorDistribution] Preserve fastmath flags on elementwise ops (#16118) by Jakub Kuderski · 1 year, 3 months ago
- 063c46e [CPU] Propagate "scalability" when using peeling for vectorisation (#16058) by Andrzej Warzyński · 1 year, 3 months ago
- 5ac75d8 [VectorDistribution[ Fix bugs in vector distribution (#16116) by Kunwar Grover · 1 year, 3 months ago
- e8b2400 [CPU] Add clamping behavior back because of mid-air collision on #16041 (#16113) by Han-Chung Wang · 1 year, 3 months ago
- f47b76f [CPU] Remove legacy logics from matmul peeling expert. (#16041) by Han-Chung Wang · 1 year, 3 months ago
- 863d302 QOL fixes for portable and cross-compiling builds. (#16111) by Stella Laurenzo · 1 year, 3 months ago
- 914f306 [CodeGen] Switching to upstream eliminateCommonSubExpressions method. (#16105) by Han-Chung Wang · 1 year, 3 months ago
- c27ed41 [GlobalOpt][CPU] Move to using indexing maps for data tiling encodings instead of named op enums (#15984) by Max191 · 1 year, 3 months ago
- 17e9529 [spirv][vulkan] Refine device query to be more descriptive (#16101) by Lei Zhang · 1 year, 3 months ago
- 1b1e769 [LinalgExt] Delete dead codes. (#16104) by Han-Chung Wang · 1 year, 3 months ago
- 7ed7b96 [CPU] Break LLVMCPUVectorLowering pass to several small passes. (#16094) by Han-Chung Wang · 1 year, 3 months ago
- e32a502 Bump LLVM to llvm/llvm-project@f5145f4dc819 (#16073) by Han-Chung Wang · 1 year, 3 months ago
- 869e505 Disable const-eval for parameters unit test (#16089) by Max191 · 1 year, 3 months ago
- 3031ae6 Unifying helpers for size/shape-aware dim lookup. (#16095) by Ben Vanik · 1 year, 3 months ago
- 73fe86e Disable CUDA2 by default. (#16102) by Ben Vanik · 1 year, 3 months ago
- e2e126c [EmitC] Remove const casts in conversion (#15679) by Simon Camphausen · 1 year, 3 months ago
- 92f3a7f [CPU] Refine the logic to control vectorization pre-processing (#16078) by Andrzej Warzyński · 1 year, 3 months ago
- 0a69776 Rename and refactor HoistRedundantVectorTransfers (#16079) by Andrzej Warzyński · 1 year, 3 months ago
- 42e0a4b [spirv] Fix executable linking test to match real queries (#16100) by Lei Zhang · 1 year, 3 months ago
- c3518b2 [CPU] Unifly LLVMCPU cmd flags and variable names in KernelDisatpch.cpp (#16091) by Han-Chung Wang · 1 year, 3 months ago
- 02c5215 Revert "[Codegen] Re-Enable transform dialect configuration strategy sample (#15787)" (#16097) by Quinn Dawkins · 1 year, 3 months ago
- 562098f [VectorDistribution] Add infrastructure to support vector distribution based on layout (#16009) by Kunwar Grover · 1 year, 3 months ago
- 3b534c4 [Codegen] Re-Enable transform dialect configuration strategy sample (#15787) by Quinn Dawkins · 1 year, 3 months ago
- 8fb2680 Disable loop unrolling in LLVM IR optimization passes (#16092) by Benoit Jacob · 1 year, 3 months ago
- 4786ebc Remove SwiftShader Docker images and software Vulkan testing. (#15837) by Scott Todd · 1 year, 3 months ago
- dc81beb [CPU] Do not fuse ukernel ops into tiling loops. (#16054) by Han-Chung Wang · 1 year, 3 months ago
- baa911e Revert "Move Android benchmarks from Pixel 6 to Pixel 8" (#16090) by Jerry Wu · 1 year, 3 months ago
- 171e31c [cuda] Move to hal/drivers and wire up BUILD files (#14620) by Lei Zhang · 1 year, 3 months ago
- 74d1f01 [cuda] Break cyclic retain between device and device event pool (#16088) by Lei Zhang · 1 year, 3 months ago
- 381a16c [cuda] Fix deadlock when advancing deferred queue in driver thread (#15673) by Lei Zhang · 1 year, 3 months ago
- a7a7ad6 Add vm.buffer.hash and util.buffer.hash ops (#16003) by Quinn Dawkins · 1 year, 3 months ago
- 4b2aaaf Move Android benchmarks from Pixel 6 to Pixel 8 (#15796) by Jerry Wu · 1 year, 3 months ago
- 21d0153 Adding a robots.txt to iree.dev. (#16085) by Ben Vanik · 1 year, 3 months ago
- 198f271 [CPU] Fix multiconfig bug with tensor.pack op (#16082) by Max191 · 1 year, 3 months ago
- 81a47a7 Switch JAX pjrt-plugin link. (#15923) by Scott Todd · 1 year, 3 months ago
- c8ecc1c Reland "[spirv][vulkan] Enable device query generation and execution" (#16075) by Lei Zhang · 1 year, 3 months ago
- 2605fa1 Cherry-pick llvm/llvm-project@f5145f4dc819 to fix out-of-bounds access (#16074) by Han-Chung Wang · 1 year, 3 months ago
- 282ab77 Revert "[spirv][vulkan] Enable device query generation and execution" (#16077) by Han-Chung Wang · 1 year, 3 months ago
- 182a8f3 [HIP] Adds graph command buffer & descriptor set and pipeline layout (#15910) by Nithin Meganathan · 1 year, 3 months ago
- 852684a [spirv][vulkan] Enable device query generation and execution (#15977) by Lei Zhang · 1 year, 3 months ago
- b55ba25 Fixing/silencing some warnings that have crept in over time. (#16072) by Ben Vanik · 1 year, 3 months ago
- 776789e [GlobalOpt] Add a pass to simplify tensor pack/unpack ops. (#15993) by Han-Chung Wang · 1 year, 3 months ago
- c1edb82 [CodeGen] Implement MemoryEffectsOpInterface for ukernel ops. (#16053) by Han-Chung Wang · 1 year, 3 months ago
- 6aa310c [CPU] Move checking stack allocation cmd flag to Passes.cpp (#16062) by Han-Chung Wang · 1 year, 3 months ago
- bd2c92d [Stream] Update more op folders to verify matching types (#16070) by Quinn Dawkins · 1 year, 3 months ago
- d21a99c Check for source location resolution function in dynamic modules (#16065) by Quinn Dawkins · 1 year, 3 months ago
- 46b06d0 [minor code simplification] Implement algorithm without stack (#15999) by James Newling · 1 year, 3 months ago
- b16cee3 Bump LLVM to llvm/llvm-project@054b5fc0fd41 (#16055) by Han-Chung Wang · 1 year, 3 months ago
- ccd576c [LLVMGPU] Add AMDGPUToArith conversion patterns to ROCDL lowering (#16067) by Quinn Dawkins · 1 year, 3 months ago
- f0c8380 [LinalgExt] Retire RewriteForallToScfForOp transform op. (#16064) by Han-Chung Wang · 1 year, 4 months ago
- 6647a5b [CPU] Skip tiling if the compute op is not a TilingInterface op. (#16052) by Han-Chung Wang · 1 year, 4 months ago
- 124d562 Bump StableHLO to f8dcebfa1ec166806974f6ae0dfb902d36b47238 (#16049) by Jacques Pienaar · 1 year, 4 months ago
- d6dad12 ukernel: unroll the s16u4 VNNI ukernel, and drop the unused N0=16 variant (#16047) by Benoit Jacob · 1 year, 4 months ago
- ef344ac Bump LLVM to llvm/llvm-project@6b65d79 and deps (2023-12-29) (#16012) by Kunwar Grover · 1 year, 4 months ago
- b3200c8 [CPU] Enable mmt4d distribution for large reduction size cases. (#16037) by Han-Chung Wang · 1 year, 4 months ago
- 8869777 Add crosscompile utility binaries back to iree-dist tarball (#16034) by CindyLiu · 1 year, 4 months ago
- db83cc4 [LinalgExt] Delete fuse_producer transform op. (#16044) by Han-Chung Wang · 1 year, 4 months ago
- 0f0e0e7 [CodeGen] Carry over lowering_config when decomposing batch_mmt4d ops. (#16043) by Han-Chung Wang · 1 year, 4 months ago
- 6ac9b7e [LinalgExt] Expose attention tile size parameter (#16030) by harsh-nod · 1 year, 4 months ago
- 6711155 Add folding arithmetic extensions (#15953) by erman-gurses · 1 year, 4 months ago
- c4739bc [LinalgExt] Delete LinalgExt tiling patterns and passes. (#15921) by Han-Chung Wang · 1 year, 4 months ago
- ded4145 [LinalgExt] Switch tiling LinalgExt tests to use transform dialect. (#15904) by Han-Chung Wang · 1 year, 4 months ago
- 957af54 [LinalgExt] Switch distribution tests to use transform dialect. (#15922) by Han-Chung Wang · 1 year, 4 months ago
- e4aa589 [CodeGen] Add aflag to allow potentially to remove unnecessary code to improve performance. (#15862) by Lubomir Litchev · 1 year, 4 months ago
- ce282c8 [stablehlo] Add missing nullptr check for unregistered dialects (#16032) by Jakub Kuderski · 1 year, 4 months ago
- f7b108f Bump website copyright to 2024. (#16028) by Scott Todd · 1 year, 4 months ago
- 41da229 [CPU][NFC] Retire LLVMCPUTensorPad pass. (#16027) by Han-Chung Wang · 1 year, 4 months ago
- 80efa38 [GlobalOpt] Add f32->bf16 demotion cases for transposed matmuls (#16022) by Max191 · 1 year, 4 months ago
- b92ceb4 Remove SYSTEM scope from transitive includes. (#16018) by Stella Laurenzo · 1 year, 4 months ago
- 8a87bf1 Disable -Waddress warnings on GCC. by Stella Laurenzo · 1 year, 4 months ago
- 3e0583f Some CMake package install ergonomics. (#16015) by Stella Laurenzo · 1 year, 4 months ago
- 895645b [VectorLayoutAnalysis] Add transfer functions for vector.contract (#15996) by Kunwar Grover · 1 year, 4 months ago
- f9cdcfd [python] Expose python bindings for scf in iree.compiler.dialects (#16013) by Kunwar Grover · 1 year, 4 months ago
- e7384a1 [VectorLayoutAnalysis] Add debug printing (#16007) by Kunwar Grover · 1 year, 4 months ago
- c35d8e9 Standardizes CMake setup of C directory trees behind a macro. (#16011) by Stella Laurenzo · 1 year, 4 months ago
- 1ae94a5 [ROCM] Expose amdgpu-waves-per-eu opt hint (#16010) by harsh-nod · 1 year, 4 months ago
- 15c306f Build functioning dev packages for IREECompiler and IREERuntime. (#16008) by Stella Laurenzo · 1 year, 4 months ago
- b0e8f3c [VectorLayoutAnalysis] Fix bug in scf.for transfer functions (#15989) by Kunwar Grover · 1 year, 4 months ago
- 4592b8f [torch] Bump torch-mlir to d560698e3d610ecdc56667c713e2338c47bf4f44. (#16006) by Stella Laurenzo · 1 year, 4 months ago
- ccbe33f [VectorExt] Add layout iterator classes (#16004) by harsh-nod · 1 year, 4 months ago