- c3518b2 [CPU] Unifly LLVMCPU cmd flags and variable names in KernelDisatpch.cpp (#16091) by Han-Chung Wang · 1 year, 3 months ago
- 02c5215 Revert "[Codegen] Re-Enable transform dialect configuration strategy sample (#15787)" (#16097) by Quinn Dawkins · 1 year, 3 months ago
- 562098f [VectorDistribution] Add infrastructure to support vector distribution based on layout (#16009) by Kunwar Grover · 1 year, 3 months ago
- 3b534c4 [Codegen] Re-Enable transform dialect configuration strategy sample (#15787) by Quinn Dawkins · 1 year, 3 months ago
- 8fb2680 Disable loop unrolling in LLVM IR optimization passes (#16092) by Benoit Jacob · 1 year, 3 months ago
- 4786ebc Remove SwiftShader Docker images and software Vulkan testing. (#15837) by Scott Todd · 1 year, 3 months ago
- dc81beb [CPU] Do not fuse ukernel ops into tiling loops. (#16054) by Han-Chung Wang · 1 year, 3 months ago
- baa911e Revert "Move Android benchmarks from Pixel 6 to Pixel 8" (#16090) by Jerry Wu · 1 year, 3 months ago
- 171e31c [cuda] Move to hal/drivers and wire up BUILD files (#14620) by Lei Zhang · 1 year, 3 months ago
- 74d1f01 [cuda] Break cyclic retain between device and device event pool (#16088) by Lei Zhang · 1 year, 3 months ago
- 381a16c [cuda] Fix deadlock when advancing deferred queue in driver thread (#15673) by Lei Zhang · 1 year, 3 months ago
- a7a7ad6 Add vm.buffer.hash and util.buffer.hash ops (#16003) by Quinn Dawkins · 1 year, 3 months ago
- 4b2aaaf Move Android benchmarks from Pixel 6 to Pixel 8 (#15796) by Jerry Wu · 1 year, 3 months ago
- 21d0153 Adding a robots.txt to iree.dev. (#16085) by Ben Vanik · 1 year, 3 months ago
- 198f271 [CPU] Fix multiconfig bug with tensor.pack op (#16082) by Max191 · 1 year, 3 months ago
- 81a47a7 Switch JAX pjrt-plugin link. (#15923) by Scott Todd · 1 year, 3 months ago
- c8ecc1c Reland "[spirv][vulkan] Enable device query generation and execution" (#16075) by Lei Zhang · 1 year, 3 months ago
- 2605fa1 Cherry-pick llvm/llvm-project@f5145f4dc819 to fix out-of-bounds access (#16074) by Han-Chung Wang · 1 year, 3 months ago
- 282ab77 Revert "[spirv][vulkan] Enable device query generation and execution" (#16077) by Han-Chung Wang · 1 year, 3 months ago
- 182a8f3 [HIP] Adds graph command buffer & descriptor set and pipeline layout (#15910) by Nithin Meganathan · 1 year, 3 months ago
- 852684a [spirv][vulkan] Enable device query generation and execution (#15977) by Lei Zhang · 1 year, 3 months ago
- b55ba25 Fixing/silencing some warnings that have crept in over time. (#16072) by Ben Vanik · 1 year, 3 months ago
- 776789e [GlobalOpt] Add a pass to simplify tensor pack/unpack ops. (#15993) by Han-Chung Wang · 1 year, 3 months ago
- c1edb82 [CodeGen] Implement MemoryEffectsOpInterface for ukernel ops. (#16053) by Han-Chung Wang · 1 year, 3 months ago
- 6aa310c [CPU] Move checking stack allocation cmd flag to Passes.cpp (#16062) by Han-Chung Wang · 1 year, 3 months ago
- bd2c92d [Stream] Update more op folders to verify matching types (#16070) by Quinn Dawkins · 1 year, 3 months ago
- d21a99c Check for source location resolution function in dynamic modules (#16065) by Quinn Dawkins · 1 year, 3 months ago
- 46b06d0 [minor code simplification] Implement algorithm without stack (#15999) by James Newling · 1 year, 3 months ago
- b16cee3 Bump LLVM to llvm/llvm-project@054b5fc0fd41 (#16055) by Han-Chung Wang · 1 year, 3 months ago
- ccd576c [LLVMGPU] Add AMDGPUToArith conversion patterns to ROCDL lowering (#16067) by Quinn Dawkins · 1 year, 3 months ago
- f0c8380 [LinalgExt] Retire RewriteForallToScfForOp transform op. (#16064) by Han-Chung Wang · 1 year, 3 months ago
- 6647a5b [CPU] Skip tiling if the compute op is not a TilingInterface op. (#16052) by Han-Chung Wang · 1 year, 3 months ago
- 124d562 Bump StableHLO to f8dcebfa1ec166806974f6ae0dfb902d36b47238 (#16049) by Jacques Pienaar · 1 year, 3 months ago
- d6dad12 ukernel: unroll the s16u4 VNNI ukernel, and drop the unused N0=16 variant (#16047) by Benoit Jacob · 1 year, 3 months ago
- ef344ac Bump LLVM to llvm/llvm-project@6b65d79 and deps (2023-12-29) (#16012) by Kunwar Grover · 1 year, 3 months ago
- b3200c8 [CPU] Enable mmt4d distribution for large reduction size cases. (#16037) by Han-Chung Wang · 1 year, 3 months ago
- 8869777 Add crosscompile utility binaries back to iree-dist tarball (#16034) by CindyLiu · 1 year, 3 months ago
- db83cc4 [LinalgExt] Delete fuse_producer transform op. (#16044) by Han-Chung Wang · 1 year, 3 months ago
- 0f0e0e7 [CodeGen] Carry over lowering_config when decomposing batch_mmt4d ops. (#16043) by Han-Chung Wang · 1 year, 3 months ago
- 6ac9b7e [LinalgExt] Expose attention tile size parameter (#16030) by harsh-nod · 1 year, 3 months ago
- 6711155 Add folding arithmetic extensions (#15953) by erman-gurses · 1 year, 3 months ago
- c4739bc [LinalgExt] Delete LinalgExt tiling patterns and passes. (#15921) by Han-Chung Wang · 1 year, 3 months ago
- ded4145 [LinalgExt] Switch tiling LinalgExt tests to use transform dialect. (#15904) by Han-Chung Wang · 1 year, 3 months ago
- 957af54 [LinalgExt] Switch distribution tests to use transform dialect. (#15922) by Han-Chung Wang · 1 year, 3 months ago
- e4aa589 [CodeGen] Add aflag to allow potentially to remove unnecessary code to improve performance. (#15862) by Lubomir Litchev · 1 year, 3 months ago
- ce282c8 [stablehlo] Add missing nullptr check for unregistered dialects (#16032) by Jakub Kuderski · 1 year, 3 months ago
- f7b108f Bump website copyright to 2024. (#16028) by Scott Todd · 1 year, 3 months ago
- 41da229 [CPU][NFC] Retire LLVMCPUTensorPad pass. (#16027) by Han-Chung Wang · 1 year, 3 months ago
- 80efa38 [GlobalOpt] Add f32->bf16 demotion cases for transposed matmuls (#16022) by Max191 · 1 year, 3 months ago
- b92ceb4 Remove SYSTEM scope from transitive includes. (#16018) by Stella Laurenzo · 1 year, 4 months ago
- 8a87bf1 Disable -Waddress warnings on GCC. by Stella Laurenzo · 1 year, 4 months ago
- 3e0583f Some CMake package install ergonomics. (#16015) by Stella Laurenzo · 1 year, 4 months ago
- 895645b [VectorLayoutAnalysis] Add transfer functions for vector.contract (#15996) by Kunwar Grover · 1 year, 4 months ago
- f9cdcfd [python] Expose python bindings for scf in iree.compiler.dialects (#16013) by Kunwar Grover · 1 year, 4 months ago
- e7384a1 [VectorLayoutAnalysis] Add debug printing (#16007) by Kunwar Grover · 1 year, 4 months ago
- c35d8e9 Standardizes CMake setup of C directory trees behind a macro. (#16011) by Stella Laurenzo · 1 year, 4 months ago
- 1ae94a5 [ROCM] Expose amdgpu-waves-per-eu opt hint (#16010) by harsh-nod · 1 year, 4 months ago
- 15c306f Build functioning dev packages for IREECompiler and IREERuntime. (#16008) by Stella Laurenzo · 1 year, 4 months ago
- b0e8f3c [VectorLayoutAnalysis] Fix bug in scf.for transfer functions (#15989) by Kunwar Grover · 1 year, 4 months ago
- 4592b8f [torch] Bump torch-mlir to d560698e3d610ecdc56667c713e2338c47bf4f44. (#16006) by Stella Laurenzo · 1 year, 4 months ago
- ccbe33f [VectorExt] Add layout iterator classes (#16004) by harsh-nod · 1 year, 4 months ago
- 9cde4e3 [VectorExt] Add folders for to_simt and to_simd (#15997) by Kunwar Grover · 1 year, 4 months ago
- 5bac47b [NFC][Codegen] Move Codegen/Dialect to Codegen/Dialect/Codegen/IR (#15944) by Kunwar Grover · 1 year, 4 months ago
- a3af0b0 Fix producer-consumer graph in `ConstExprAnalysis` (#15913) by Jerry Wu · 1 year, 4 months ago
- 0af34bd [GlobalOpt] Add quantized matmul reassociation support for f16 types (#15964) by Max191 · 1 year, 4 months ago
- 8d9638e [Flow] Loosen restrictions for dequantization fusion (#15663) by Max191 · 1 year, 4 months ago
- 5e33995 [onnx] Enable --iree-input-type=onnx (#15995) by Stella Laurenzo · 1 year, 4 months ago
- 7b7ffeb [onnx] Add ONNX importer and iree-import-onnx tool to compiler package. (#15920) by Stella Laurenzo · 1 year, 4 months ago
- 193bc27 Improve feedback for missing flags in iree-dump-parameters (#15985) by Ilija Kalinić · 1 year, 4 months ago
- 09c9f5b [spirv] Provide same entry point set in variants when linking (#15935) by Lei Zhang · 1 year, 4 months ago
- 5b8e870 [Flow] Relax dynamic shape restrictions on slice -> Flow patterns (#15861) by Quinn Dawkins · 1 year, 4 months ago
- 9ba681c [GlobalOpt][NFC] Follow LLVM comment style and apply few cleanups. (#15990) by Han-Chung Wang · 1 year, 4 months ago
- ded5262 [GlobalOpt] Remove specialized check-prefix from lit tests. (#15991) by Han-Chung Wang · 1 year, 4 months ago
- 6f969db [VectorExt] Add to_simt and to_simd operations (#15987) by Kunwar Grover · 1 year, 4 months ago
- 03f7079 Move VMVX benchmarks to x86_64 runner (#15969) by Jerry Wu · 1 year, 4 months ago
- 109d174 Add a f32-to-bf16 demotion pass for matmuls, and new compiler flag `--iree-global-opt-enable-demote-contraction-inputs-to-bf16` (#15946) by bjacob · 1 year, 4 months ago
- 86511e9 drop redundant `populateExpandBFloat16Patterns` call in `ConvertToLLVM` (#15986) by bjacob · 1 year, 4 months ago
- 73f5947 Align the Flow::DispatchTensorStoreOp verifier with tensor.insert_slice (#15959) by Quinn Dawkins · 1 year, 4 months ago
- 829b398 [VM] Add a vm.rodata.table.inline op (#15971) by Quinn Dawkins · 1 year, 4 months ago
- 4e9199b Integrate llvm/llvm-project@99045b60b575 and bump deps (2023-12-19) (#15972) by Han-Chung Wang · 1 year, 4 months ago
- 92df2b4 Make iree.compiler.api.Output.map_memory() retain its backing reference. (#15975) by Stella Laurenzo · 1 year, 4 months ago
- da8570c [GlobalOpt] Turn transpose propagation off by default. (#15979) by Han-Chung Wang · 1 year, 4 months ago
- 166dcc5 [GlobalOptimization] Add pass for transpose propagation (#15858) by Quinn Dawkins · 1 year, 4 months ago
- 39576d0 [spirv] Trim provided executable target before linking (#15888) by Lei Zhang · 1 year, 4 months ago
- d29890c Add Falcon7b int4 to CI benchmarks by mariecwhite · 1 year, 4 months ago
- a0919c5 [Codegen] Bail TileAndDistributeToWorkgroups if the workgroup count is lowered (#15885) by Quinn Dawkins · 1 year, 4 months ago
- e2ba0a2 Updating HAL passes to use tablegen. (#15952) by Ben Vanik · 1 year, 4 months ago
- 0bd654f [compiler] NFC: Move op equivalence into a file in Utils/ (#15966) by Lei Zhang · 1 year, 4 months ago
- a5f0b29 [CPU] Unify distribution tile size selection heuristic. (#15928) by Han-Chung Wang · 1 year, 4 months ago
- 873ba77 [CPU][NFC] Update variable names to be camelCase. (#15967) by Han-Chung Wang · 1 year, 4 months ago
- 0ecbe75 [SPIRV] Switch tiling on reductions to use common helpers. (#15940) by Han-Chung Wang · 1 year, 4 months ago
- a0bf55c [Codegen] Switch to Util helper for getting bit widths (#15957) by Quinn Dawkins · 1 year, 4 months ago
- 6c1b1ab [GPU] Simplify gpu warp reduction lowering (#15963) by Jakub Kuderski · 1 year, 4 months ago
- e2dcbe1 Add support for serializing complex resource attributes (#15960) by Quinn Dawkins · 1 year, 4 months ago
- 18ed8d5 Bump torch-mlir to 9c655d0bfb166785dc17e51b0afa1f937c227cef (#15956) by Quinn Dawkins · 1 year, 4 months ago
- 0c0eb27 [spirv] Cleanup and reorder functions in SPIR-V passes (#15958) by Lei Zhang · 1 year, 4 months ago
- a5c48ed Update to wolfpld/tracy@af73dba7 (2023-12-5) (#15807) by Lei Zhang · 1 year, 4 months ago
- 6d43d9c Integrate llvm/llvm-project@395f9ce30e88 (2023-12-16) (#15954) by Lei Zhang · 1 year, 4 months ago
- f97aa4d [HIP] Adds support for native executable and cache (#15937) by Nithin Meganathan · 1 year, 4 months ago
- fac5c59 [Flow] Enable fusion of `tensor.unpack` with non-intersecting reductions (#15945) by Max191 · 1 year, 4 months ago