- 6647a5b [CPU] Skip tiling if the compute op is not a TilingInterface op. (#16052) by Han-Chung Wang · 1 year, 4 months ago
- 124d562 Bump StableHLO to f8dcebfa1ec166806974f6ae0dfb902d36b47238 (#16049) by Jacques Pienaar · 1 year, 4 months ago
- d6dad12 ukernel: unroll the s16u4 VNNI ukernel, and drop the unused N0=16 variant (#16047) by Benoit Jacob · 1 year, 4 months ago
- ef344ac Bump LLVM to llvm/llvm-project@6b65d79 and deps (2023-12-29) (#16012) by Kunwar Grover · 1 year, 4 months ago
- b3200c8 [CPU] Enable mmt4d distribution for large reduction size cases. (#16037) by Han-Chung Wang · 1 year, 4 months ago
- 8869777 Add crosscompile utility binaries back to iree-dist tarball (#16034) by CindyLiu · 1 year, 4 months ago
- db83cc4 [LinalgExt] Delete fuse_producer transform op. (#16044) by Han-Chung Wang · 1 year, 4 months ago
- 0f0e0e7 [CodeGen] Carry over lowering_config when decomposing batch_mmt4d ops. (#16043) by Han-Chung Wang · 1 year, 4 months ago
- 6ac9b7e [LinalgExt] Expose attention tile size parameter (#16030) by harsh-nod · 1 year, 4 months ago
- 6711155 Add folding arithmetic extensions (#15953) by erman-gurses · 1 year, 4 months ago
- c4739bc [LinalgExt] Delete LinalgExt tiling patterns and passes. (#15921) by Han-Chung Wang · 1 year, 4 months ago
- ded4145 [LinalgExt] Switch tiling LinalgExt tests to use transform dialect. (#15904) by Han-Chung Wang · 1 year, 4 months ago
- 957af54 [LinalgExt] Switch distribution tests to use transform dialect. (#15922) by Han-Chung Wang · 1 year, 4 months ago
- e4aa589 [CodeGen] Add aflag to allow potentially to remove unnecessary code to improve performance. (#15862) by Lubomir Litchev · 1 year, 4 months ago
- ce282c8 [stablehlo] Add missing nullptr check for unregistered dialects (#16032) by Jakub Kuderski · 1 year, 4 months ago
- f7b108f Bump website copyright to 2024. (#16028) by Scott Todd · 1 year, 4 months ago
- 41da229 [CPU][NFC] Retire LLVMCPUTensorPad pass. (#16027) by Han-Chung Wang · 1 year, 4 months ago
- 80efa38 [GlobalOpt] Add f32->bf16 demotion cases for transposed matmuls (#16022) by Max191 · 1 year, 4 months ago
- b92ceb4 Remove SYSTEM scope from transitive includes. (#16018) by Stella Laurenzo · 1 year, 4 months ago
- 8a87bf1 Disable -Waddress warnings on GCC. by Stella Laurenzo · 1 year, 4 months ago
- 3e0583f Some CMake package install ergonomics. (#16015) by Stella Laurenzo · 1 year, 4 months ago
- 895645b [VectorLayoutAnalysis] Add transfer functions for vector.contract (#15996) by Kunwar Grover · 1 year, 4 months ago
- f9cdcfd [python] Expose python bindings for scf in iree.compiler.dialects (#16013) by Kunwar Grover · 1 year, 4 months ago
- e7384a1 [VectorLayoutAnalysis] Add debug printing (#16007) by Kunwar Grover · 1 year, 4 months ago
- c35d8e9 Standardizes CMake setup of C directory trees behind a macro. (#16011) by Stella Laurenzo · 1 year, 4 months ago
- 1ae94a5 [ROCM] Expose amdgpu-waves-per-eu opt hint (#16010) by harsh-nod · 1 year, 4 months ago
- 15c306f Build functioning dev packages for IREECompiler and IREERuntime. (#16008) by Stella Laurenzo · 1 year, 4 months ago
- b0e8f3c [VectorLayoutAnalysis] Fix bug in scf.for transfer functions (#15989) by Kunwar Grover · 1 year, 4 months ago
- 4592b8f [torch] Bump torch-mlir to d560698e3d610ecdc56667c713e2338c47bf4f44. (#16006) by Stella Laurenzo · 1 year, 4 months ago
- ccbe33f [VectorExt] Add layout iterator classes (#16004) by harsh-nod · 1 year, 4 months ago
- 9cde4e3 [VectorExt] Add folders for to_simt and to_simd (#15997) by Kunwar Grover · 1 year, 4 months ago
- 5bac47b [NFC][Codegen] Move Codegen/Dialect to Codegen/Dialect/Codegen/IR (#15944) by Kunwar Grover · 1 year, 4 months ago
- a3af0b0 Fix producer-consumer graph in `ConstExprAnalysis` (#15913) by Jerry Wu · 1 year, 4 months ago
- 0af34bd [GlobalOpt] Add quantized matmul reassociation support for f16 types (#15964) by Max191 · 1 year, 4 months ago
- 8d9638e [Flow] Loosen restrictions for dequantization fusion (#15663) by Max191 · 1 year, 4 months ago
- 5e33995 [onnx] Enable --iree-input-type=onnx (#15995) by Stella Laurenzo · 1 year, 4 months ago
- 7b7ffeb [onnx] Add ONNX importer and iree-import-onnx tool to compiler package. (#15920) by Stella Laurenzo · 1 year, 4 months ago
- 193bc27 Improve feedback for missing flags in iree-dump-parameters (#15985) by Ilija Kalinić · 1 year, 4 months ago
- 09c9f5b [spirv] Provide same entry point set in variants when linking (#15935) by Lei Zhang · 1 year, 4 months ago
- 5b8e870 [Flow] Relax dynamic shape restrictions on slice -> Flow patterns (#15861) by Quinn Dawkins · 1 year, 4 months ago
- 9ba681c [GlobalOpt][NFC] Follow LLVM comment style and apply few cleanups. (#15990) by Han-Chung Wang · 1 year, 4 months ago
- ded5262 [GlobalOpt] Remove specialized check-prefix from lit tests. (#15991) by Han-Chung Wang · 1 year, 4 months ago
- 6f969db [VectorExt] Add to_simt and to_simd operations (#15987) by Kunwar Grover · 1 year, 4 months ago
- 03f7079 Move VMVX benchmarks to x86_64 runner (#15969) by Jerry Wu · 1 year, 4 months ago
- 109d174 Add a f32-to-bf16 demotion pass for matmuls, and new compiler flag `--iree-global-opt-enable-demote-contraction-inputs-to-bf16` (#15946) by bjacob · 1 year, 4 months ago
- 86511e9 drop redundant `populateExpandBFloat16Patterns` call in `ConvertToLLVM` (#15986) by bjacob · 1 year, 4 months ago
- 73f5947 Align the Flow::DispatchTensorStoreOp verifier with tensor.insert_slice (#15959) by Quinn Dawkins · 1 year, 4 months ago
- 829b398 [VM] Add a vm.rodata.table.inline op (#15971) by Quinn Dawkins · 1 year, 4 months ago
- 4e9199b Integrate llvm/llvm-project@99045b60b575 and bump deps (2023-12-19) (#15972) by Han-Chung Wang · 1 year, 4 months ago
- 92df2b4 Make iree.compiler.api.Output.map_memory() retain its backing reference. (#15975) by Stella Laurenzo · 1 year, 4 months ago
- da8570c [GlobalOpt] Turn transpose propagation off by default. (#15979) by Han-Chung Wang · 1 year, 4 months ago
- 166dcc5 [GlobalOptimization] Add pass for transpose propagation (#15858) by Quinn Dawkins · 1 year, 4 months ago
- 39576d0 [spirv] Trim provided executable target before linking (#15888) by Lei Zhang · 1 year, 4 months ago
- d29890c Add Falcon7b int4 to CI benchmarks by mariecwhite · 1 year, 4 months ago
- a0919c5 [Codegen] Bail TileAndDistributeToWorkgroups if the workgroup count is lowered (#15885) by Quinn Dawkins · 1 year, 4 months ago
- e2ba0a2 Updating HAL passes to use tablegen. (#15952) by Ben Vanik · 1 year, 4 months ago
- 0bd654f [compiler] NFC: Move op equivalence into a file in Utils/ (#15966) by Lei Zhang · 1 year, 4 months ago
- a5f0b29 [CPU] Unify distribution tile size selection heuristic. (#15928) by Han-Chung Wang · 1 year, 4 months ago
- 873ba77 [CPU][NFC] Update variable names to be camelCase. (#15967) by Han-Chung Wang · 1 year, 4 months ago
- 0ecbe75 [SPIRV] Switch tiling on reductions to use common helpers. (#15940) by Han-Chung Wang · 1 year, 4 months ago
- a0bf55c [Codegen] Switch to Util helper for getting bit widths (#15957) by Quinn Dawkins · 1 year, 4 months ago
- 6c1b1ab [GPU] Simplify gpu warp reduction lowering (#15963) by Jakub Kuderski · 1 year, 4 months ago
- e2dcbe1 Add support for serializing complex resource attributes (#15960) by Quinn Dawkins · 1 year, 4 months ago
- 18ed8d5 Bump torch-mlir to 9c655d0bfb166785dc17e51b0afa1f937c227cef (#15956) by Quinn Dawkins · 1 year, 4 months ago
- 0c0eb27 [spirv] Cleanup and reorder functions in SPIR-V passes (#15958) by Lei Zhang · 1 year, 4 months ago
- a5c48ed Update to wolfpld/tracy@af73dba7 (2023-12-5) (#15807) by Lei Zhang · 1 year, 4 months ago
- 6d43d9c Integrate llvm/llvm-project@395f9ce30e88 (2023-12-16) (#15954) by Lei Zhang · 1 year, 4 months ago
- f97aa4d [HIP] Adds support for native executable and cache (#15937) by Nithin Meganathan · 1 year, 4 months ago
- fac5c59 [Flow] Enable fusion of `tensor.unpack` with non-intersecting reductions (#15945) by Max191 · 1 year, 4 months ago
- 7606729 [spirv] NFC: sort pass pipelines update comment headers (#15950) by Lei Zhang · 1 year, 4 months ago
- 6f2c98f [LLVMGPU] Add multi-row vector reduction configuration for ROCm (#15941) by Jakub Kuderski · 1 year, 4 months ago
- 416e4b4 [Transform] Add transform.iree.gpu_distribute_shared_memory_copy (#15942) by Kunwar Grover · 1 year, 4 months ago
- 88c9992 [CodeGen] Switch tiling only cases to use scf::tileUsingSCFForOp (#15939) by Han-Chung Wang · 1 year, 4 months ago
- a910e74 [Codegen] Add VectorLayoutAnalysis (#15726) by Kunwar Grover · 1 year, 4 months ago
- 6dbc227 [CPU] Retire CPUDoubleTilingPadExpert pipeline. (#15931) by Han-Chung Wang · 1 year, 4 months ago
- 5dd6f34 [CodeGen][NFC] Rename tileLinalgOpsWithFilter to distributeLinalgOpsWithFilter. (#15938) by Han-Chung Wang · 1 year, 4 months ago
- 0842feb [bf16] Rework vector+bf16 support to avoid invalid conversion (#15911) by Rob Suderman · 1 year, 4 months ago
- 5519e24 Removing old HAL device matching attributes. (#15933) by Ben Vanik · 1 year, 4 months ago
- 8c99394 [GlobalOpt][DT] Retire ExpandVectors pass. (#15932) by Han-Chung Wang · 1 year, 4 months ago
- 46d9347 Tweaks to e2e matmul tests (#15930) by bjacob · 1 year, 4 months ago
- bc0b7d4 Fixing flow->stream affinity mapping. (#15929) by Ben Vanik · 1 year, 4 months ago
- a4a6b4b [CPU] Add support for converting math.powf from fp16 to fp32. (#15927) by Han-Chung Wang · 1 year, 4 months ago
- 98fa0b3 [LLVMGPU] Re-enable multidim distribution on gridDimension. (#15924) by Stanley Winata · 1 year, 4 months ago
- b5f1a83 Fix experimental/web[gpu] builds after HAL changes. (#15925) by Scott Todd · 1 year, 4 months ago
- 9d0a179 Mark tensorflow_resnet.ipynb as xfail. (#15926) by Scott Todd · 1 year, 4 months ago
- 9d2ec18 Add tests for iree-run-module inputs. (#15918) by Scott Todd · 1 year, 4 months ago
- 9a59315 [CPU] Add missing passes for ukernel fallback (#15917) by Jerry Wu · 1 year, 4 months ago
- 9726ead [spirv] Respect entry point ordinal when serializing executables (#15905) by Lei Zhang · 1 year, 4 months ago
- f81f361 Removing transfer_range from the HAL device vtable. (#15919) by Ben Vanik · 1 year, 4 months ago
- 80e70ca Replacing hal.ex.shared_device with hal.devices.* ops. (#15916) by Ben Vanik · 1 year, 4 months ago
- 3654fd6 Quote path to Python executable for lit tests on Windows. (#15915) by Scott Todd · 1 year, 4 months ago
- 5889a12 Update Falcon benchmarks by mariecwhite · 1 year, 4 months ago
- 605aca9 [CPU][ArmSME] Add (initial) tiling and lowering pipeline for ArmSME (#15794) by Benjamin Maxwell · 1 year, 4 months ago
- 6ebc2b8 Annotate dispatches with pure data-movement linalg operations better (#15745) by Kunwar Grover · 1 year, 4 months ago
- 7bdaf4a Drop Vulkan driver tests from `tools/test/`. (#15912) by Scott Todd · 1 year, 4 months ago
- 45febf7 Delete `tools/android/run_module_app`. (#15909) by Scott Todd · 1 year, 4 months ago
- 33a331b [CPU] Enable codegen fallback when ukernels are not available. (#15883) by Han-Chung Wang · 1 year, 4 months ago
- 82ae1ee [Codegen] Add VectorExt to CommonDialectRegistration (#15729) by Kunwar Grover · 1 year, 4 months ago
- 6a60b64 [docs] Add tips for profiling python programs with Tracy (#15908) by Jakub Kuderski · 1 year, 4 months ago
- 08e2021 [Cleanup] Delete tiling tensor.pad test from LinalgExt. (#15902) by Han-Chung Wang · 1 year, 4 months ago