- f1bbc57 [LLVMGPU] Add loop invariant code motion before software pipelining (#12540) by Thomas · 2 years, 1 month ago
- 1148f72 Remove memory side effect from hal.interface.binding.subspan (#12494) by Quinn Dawkins · 2 years, 1 month ago
- 9a135c2 [CUDA] Remove workaround running passes in old pass manager (#12516) by Thomas · 2 years, 1 month ago
- d5a9b1b Adding compilation reentrancy tests and new HAL pipeline phases. (#12503) by Ben Vanik · 2 years, 1 month ago
- 70e6e52 Add additional options to the ApplyPatternsOp (#12519) by Nicolas Vasilache · 2 years, 1 month ago
- 8007d74 Adding min/max VM ops and VM buffer allocation alignment. (#12522) by Ben Vanik · 2 years, 1 month ago
- e6425d6 Starting support for HAL dispatch specialization. (#12483) by Ben Vanik · 2 years, 1 month ago
- ff9bc6a Bumping bytecode version. by Ben Vanik · 2 years, 1 month ago
- 3e6130e Adding alignment to vm.buffer.alloc and vm.buffer.clone. by Ben Vanik · 2 years, 1 month ago
- 459f5ff Adding i32/i64/f32 min/max ops. by Ben Vanik · 2 years, 1 month ago
- 84b4802 Fully retire CanonicalizedSequenceOp (#12467) by Nicolas Vasilache · 2 years, 1 month ago
- 769ffda Evolve transform dialect usage towards non-blanket-canonicalized sequences (#12465) by Nicolas Vasilache · 2 years, 1 month ago
- d015bbc Integrate llvm-project at 33cf40122279342b50f92a3a53f5c185390b6018 (#12475) by Jakub Kuderski · 2 years, 1 month ago
- 08ae97f Refreshing VM performance by separating verification and tweaking buffer access. (#12426) by Ben Vanik · 2 years, 1 month ago
- 91e73d3 Adding requirement bits to bytecode modules and bumping version. by Ben Vanik · 2 years, 1 month ago
- 38abd13 Add bazel `iree_link_compiler_shared` flag. (#12508) by Stella Laurenzo · 2 years, 1 month ago
- aaa47cd Switching vm.buffer.fill to index with elements. by Ben Vanik · 2 years, 1 month ago
- fef4534 Build libIREECompiler.so in bazel. (#12507) by Stella Laurenzo · 2 years, 1 month ago
- 98a9105 Remove duplicate sources from Common (are in CommonPasses). (#12505) by Stella Laurenzo · 2 years, 1 month ago
- 79cbc19 Fail vectorize pad pattern when the pad value is not constant. (#12484) by MaheshRavishankar · 2 years, 1 month ago
- f43f8a3 Pad fusion bufferization workaround. (#12425) by MaheshRavishankar · 2 years, 1 month ago
- 787b518 Add `i1` type propagation for named Linalg ops. (#12474) by MaheshRavishankar · 2 years, 1 month ago
- 23ca7bb [LLVMGPU] Enhance TensorPad pass to handle tensor.unpack ops. (#12458) by Han-Chung Wang · 2 years, 1 month ago
- da22c84 Re-Enable reduction masking (#12472) by Murali Vijayaraghavan · 2 years, 1 month ago
- 98061e3 Fix incorrect API usage in RewritePatterns (round 1) (#12466) by Matthias Springer · 2 years, 1 month ago
- 4674a50 Reserving x18 for AArch64 platforms (#12460) by Murali Vijayaraghavan · 2 years, 1 month ago
- 7875b8e Revert the latest integrate to fix issues with `--iree-codegen-llvmgpu-enable-transform-dialect-jit` (#12451) by Jakub Kuderski · 2 years, 1 month ago
- 51bdf0a Work around test failures in `mobilenetv3_fake_weights` (#12445) by Jakub Kuderski · 2 years, 1 month ago
- 9c00793 fix k dimension for matmul ukernel (#12443) by bjacob · 2 years, 1 month ago
- 47872bc Passing constant uniform operands to dispatches and ignores dynamic ones. (#12432) by Ben Vanik · 2 years, 1 month ago
- 902aada Fixing symbol DCE stripping executable sources on round-tripping. (#12429) by Ben Vanik · 2 years, 1 month ago
- 259d709 [NFC] Add proper debug messages to LLVMGPUUtils to be better informed… (#12437) by Nicolas Vasilache · 2 years, 1 month ago
- 908c427 Integrate llvm-project at 523640f7a4c0 (#12419) by Jakub Kuderski · 2 years, 1 month ago
- 564ede9 Retire obsolete transforms and tests (#12436) by Nicolas Vasilache · 2 years, 1 month ago
- 21aeee8 [Common][TransformDialect] Fix the usage of num_threads for TileToFor… (#12438) by qcolombet · 2 years, 1 month ago
- 0312dd3 [GPU] Add a new pipeline optimized for nvidia ampere target (#12427) by Thomas · 2 years, 1 month ago
- e1c15ed [LLVMGPU] Add good tile size for large matmul in tensorcore pipeline (#12430) by Thomas · 2 years, 1 month ago
- 8f23c69 Allow 0-D interface MemRef in FlattenMemRefSubspan (#12414) by Lei Zhang · 2 years, 1 month ago
- 475d64d Adding FMA folding in the VM. (#12382) by Ben Vanik · 2 years, 1 month ago
- bc2a673 Enable hoisting uniform memref.load in reduction pipeline (#12421) by Lei Zhang · 2 years, 1 month ago
- 26fc399 [vulkan][spirv] Plumb through support for KHR Integer Dot Product (#12424) by Jakub Kuderski · 2 years, 1 month ago
- 17eafc9 [LLVMGPU] Enable tensor.pack op e2e execution on cuda. (#12370) by Han-Chung Wang · 2 years, 1 month ago
- 6ae5762 support unspecified channel_handle and use_global_device_ids (#12412) by Okwan Kwon · 2 years, 1 month ago
- 4a65a33 Retire most of LinalgExt::(Un)PackOp usages and transformations. (#12253) by Han-Chung Wang · 2 years, 1 month ago
- 4a8d063 Drop the tensor.pack/unpack -> LinalgExt lowering from transform dialect (#12401) by Han-Chung Wang · 2 years, 1 month ago
- f1c1799 Switching to use tensor.pack/unpack ops for data-tiling (#12247) by Han-Chung Wang · 2 years, 1 month ago
- 6a84459 Fixes tiling sizes for pulled producers in TileAndDistribute (#12399) by Han-Chung Wang · 2 years, 1 month ago
- 96b61ba [metal] Populate proper capability and limits for SPIR-V CodeGen (#12391) by Lei Zhang · 2 years, 1 month ago
- 69d4380 Promote bf16 to f32. (#12344) by Daniel Ellis · 2 years, 1 month ago
- c365243 Revert "Enable reduction masking (#12003)" (#12384) by Stella Laurenzo · 2 years, 1 month ago
- 7465d7e [LLVMGPU] Improve how we distribute small inner shapes (#12368) by Thomas · 2 years, 1 month ago
- b4490f2 [flow] NFC: Move code for cloning producers into RegionOpUtils (#12349) by Lei Zhang · 2 years, 1 month ago
- c35411b Strongly home flag registration statics in C++ files. (#12372) by Stella Laurenzo · 2 years, 1 month ago
- cf49d69 Moving bytecode files under iree/vm/bytecode/. (#12371) by Ben Vanik · 2 years, 1 month ago
- c319c2d Adding initial dispatch instrumention support. (#12357) by Ben Vanik · 2 years, 1 month ago
- 289b9a1 Add IREE_COMPILER_BUILD_SHARED_LIBS option. (#12310) by Stella Laurenzo · 2 years, 1 month ago
- 7a6b21b Minor fixes found while enabling micro-kernel usage e2e. (#12354) by MaheshRavishankar · 2 years, 1 month ago
- a23b52b Remove ListenerGreedyPatternRewriteDriver (#12358) by Matthias Springer · 2 years, 1 month ago
- f8a4aa2 Integrate llvm-project at 219ba2fb7b0a (#12342) by Han-Chung Wang · 2 years, 1 month ago
- f3ea9ee Blocking stream resource subview walks at timeline ops. (#12353) by Ben Vanik · 2 years, 1 month ago
- cd92019 Enable reduction masking (#12003) by Diego Caballero · 2 years, 1 month ago
- 689e0fa [spirv] Support multiple reductions fused in single dispatch cases (#11975) by Lei Zhang · 2 years, 1 month ago
- eba1769 Add explicit template instantiations that seem to fail with old compi… (#12340) by Nicolas Vasilache · 2 years, 1 month ago
- f20da41 Integrate llvm/llvm-project@c33fd3b47faa (#12320) by Han-Chung Wang · 2 years, 1 month ago
- 309dc5b Separate the compiler stable API into compiler/bindings/c. (#12332) by Stella Laurenzo · 2 years, 1 month ago
- 17e1111 Adding sym_visibility to vm.import and fixing parsing. (#12325) by Ben Vanik · 2 years, 1 month ago
- fee7018 Adding support for the `vm.import.module` attribute to force linking. (#12328) by Ben Vanik · 2 years, 1 month ago
- 023b0de Adding `--iree-hal-preprocess-executables-with=` option. (#12313) by Ben Vanik · 2 years, 1 month ago
- b27c48a Enable conversion of `depthwise_conv_2d_nhwc_hwcm_q` to its fixed-point counterpart (#12314) by Diego Caballero · 2 years, 1 month ago
- 5c78e87 Rework the logic that avoids fusions when that lead to stack allocatiions. (#12277) by MaheshRavishankar · 2 years, 1 month ago
- 49de0b6 Update `ResolveBufferDescriptors` to handle `memref.extract_strided_metadata` (#12205) by MaheshRavishankar · 2 years, 1 month ago
- c9ef78a Data-tiling encodings: rename RHS_TRANSPOSE to RHS (#12298) by bjacob · 2 years, 1 month ago
- 3b1a0ac Fix distribution to warps (#12306) by Thomas · 2 years, 1 month ago
- c7b2912 Update references from `iree-org` to `openxla`. (#12304) by Scott Todd · 2 years, 1 month ago
- 1f0b62b Fix mhlo.scatter with i64 indices by casting to i32 (#12302) by Rob Suderman · 2 years, 1 month ago
- 38ad88e Adding support for linking bitcode files in LLVM CPU/GPU targets. (#12265) by Ben Vanik · 2 years, 1 month ago
- c889a4c Adding `iree_bitcode_library` using our own clang build. (#12097) by Ben Vanik · 2 years, 1 month ago
- de17d3d Scoping the tensor.empty -> flow pass to top level ops. (#12303) by Ben Vanik · 2 years, 1 month ago
- 7f6c57c Generating iree/builtins/device using our own clang. by Ben Vanik · 2 years, 2 months ago
- 5c2172c Added ComplexToStandardPass to the LLVM compilation pipelines (#12273) by Rob Suderman · 2 years, 1 month ago
- 2277151 Add transform dialect op to allow hoisting bounded allocs in a function (#12284) by Nicolas Vasilache · 2 years, 1 month ago
- a4c74dc Extend MapNestedForallToGpuThreadsOp to support distrution to warpId (#12272) by Thomas · 2 years, 1 month ago
- 96d959e Add an option to disable use of bundled LLVM. (#12256) by Stella Laurenzo · 2 years, 1 month ago
- 1f3295a [rv32] Enable 32-bit mulsi_extended lowering for +zve32f (#12266) by Diego Caballero · 2 years, 1 month ago
- 2f40854 [spirv] Add Mali i8 matmul config (#12271) by Jakub Kuderski · 2 years, 1 month ago
- 03b368c [NFC] Centralizing dispatch region constant inlining. (#12235) by Ben Vanik · 2 years, 1 month ago
- f59bd6e Remove `hal.interface.binding.subspan` ops that are only used `memref.assume_alignment_ops`. (#12269) by MaheshRavishankar · 2 years, 1 month ago
- ad8782e Integrate llvm/llvm-project@eb141867 (#12264) by Lei Zhang · 2 years, 1 month ago
- 5b8757b Adding `--iree-hal-substitute-executable=` flag. (#12240) by Ben Vanik · 2 years, 1 month ago
- e752a83 [spirv] Enable vectorized codegen for i8 matmul (#12262) by Jakub Kuderski · 2 years, 1 month ago
- 14828ec Pass by reference in reproducer causes crash (#12257) by Rob Suderman · 2 years, 1 month ago
- e231b39 Making paths required on `#hal.executable.object`. by Ben Vanik · 2 years, 1 month ago
- 9955ff0 Adding `--iree-hal-substitute-executable-*=` flags. by Ben Vanik · 2 years, 1 month ago
- cd5347c Support mhlo collective ops (#11988) by Okwan Kwon · 2 years, 1 month ago
- 261471c Plumb vector to mma.sync through the transform dialect (#12244) by Nicolas Vasilache · 2 years, 1 month ago
- f560f42 [NFC] Switch to use upstream transform::TrivialPatternRewriter. (#12245) by Han-Chung Wang · 2 years, 1 month ago
- 6d4aadb [NFC] Switch to use upstream mlir::verifyCompatibleShape method. (#12243) by Han-Chung Wang · 2 years, 1 month ago
- 6be3f5a [rv32] Expand `arith.mulsi_extended` before going to LLVM (#12241) by Jakub Kuderski · 2 years, 1 month ago
- 1290401 Integrate llvm/llvm-project@4f15267d (#12226) by Lei Zhang · 2 years, 1 month ago
- a04c262 [spirv] Vectorize integer extend ops in lowering to subgroup_mma (#12202) by Quinn Dawkins · 2 years, 1 month ago