Sign in
opensecura
/
3p
/
openxla
/
iree
/
fa3a144967a574ba59a3367e9d9ac710d16c562f
/
compiler
2b1a8e7
[LLVMGPU] Make FP8 VMFMA intrinsic discoverable by KernelConfig (#19022)
by Stanley Winata
· 5 months ago
1444755
[LLVMGPU] Add VMFMA for FP8 to align layouts between chained F8 contractions. (#19020)
by Stanley Winata
· 5 months ago
f71dd12
Integrate llvm-project@7c69491 (#19008)
by Quinn Dawkins
· 5 months ago
9e20e68
[LLVMGPU] Enable IGEMM for convolutions by default (#19006)
by Max191
· 5 months ago
a5537bc
[LLVMGPU] Teach KernelConfig to set MMA schedules per op in LoweringConfig (#18984)
by Stanley Winata
· 5 months ago
ec7528c
[Codegen][VectorExt] Fix VectorExt ops for 0-d vectors (#18915)
by Kunwar Grover
· 5 months ago
9c85e30
[iree.build] Implement iree-compile action. (#18993)
by Stella Laurenzo
· 5 months ago
05ec795
Add `InferIntDivisibilityInterface` for `arith.muli`. (#18994)
by MaheshRavishankar
· 5 months ago
2f15eeb
Integrates/llvm 20241101@e577f14 (#18987)
by Bangtian Liu
· 5 months ago
3bb7fd2
Use `IntegerRangeAnalysis` to get bounds of allocation. (#18991)
by MaheshRavishankar
· 5 months ago
046a705
[LLVMGPU] Create GPU pipeline option for IGEMM (#18981)
by Max191
· 5 months ago
db59070
[GPU] Disable unaligned to instrinsic batch matmul codegen with vector distribute (#18935)
by Nirvedh Meshram
· 5 months ago
bb542ee
[LLVMGPU] Add Virtual MFMA layout that maximizes load through adjusted K-width (#18930)
by Stanley Winata
· 5 months ago
20c8347
Fix typo in tile_and_distribute_to_workgroups.mlir test (#18982)
by Max191
· 5 months ago
0077358
[python] Add an iree.build package with API/tooling for program building. (#18630)
by Stella Laurenzo
· 5 months ago
8d3faf8
Revert "Propagate reshapes through generics with reduction… (#18968)
by Ian Wood
· 5 months ago
57fb10f
[NFC] Cleanups to flow op folders. (#18974)
by Ben Vanik
· 5 months ago
348dd47
Adding a HAL module debug sink interface. (#18966)
by Ben Vanik
· 5 months ago
df2b8a4
Integrate llvm-project @f1595ecfdce5387e41826fd72ff930a1a39ae398 (#18897)
by Max191
· 5 months ago
b38de27
Adjust `isFusableUsingTileAndFuse` in `SinkReshapes` (#18921)
by Ian Wood
· 5 months ago
2ec9017
Improving VM conversion performance. (#18957)
by Ben Vanik
· 5 months ago
a744285
[Codegen][LLVMGPU] Set global read layouts at linalg level (#18860)
by Kunwar Grover
· 5 months ago
12cb042
[Flow] Add pattern to canonicalize away full tensor.insert_slice ops (#18941)
by Quinn Dawkins
· 5 months ago
53813e8
[LLVMGPU] Use flat workgroup sizes in vector distribution (#18947)
by Kunwar Grover
· 5 months ago
78481a6
Propagate reshapes through generics with reduction iterators (#18857)
by Ian Wood
· 5 months ago
14f58e0
[ROCM] Turn on SLP vectorization (#18949)
by Nirvedh Meshram
· 5 months ago
554f31f
Adding a flag to force indirect command buffers on in non-reusable cases. (#18945)
by Ben Vanik
· 5 months ago
1f76cb7
GPU data tiling: reimplement getConcreteMFMALayout (#18953)
by Benoit Jacob
· 5 months ago
0bb6d92
Add `ReifyRankedShapedTypeOpInterface` to `hal.interface.binding.subspan` (#18946)
by MaheshRavishankar
· 5 months ago
26ba4fd
Switching VM's EraseUnusedCallOp pattern to a pass. (#18950)
by Ben Vanik
· 5 months ago
15ea0dc
[GlobalOpt] Prevent fusing transposed extend in RaiseSpecialOps (#18901)
by Cullen Rhodes
· 5 months ago
5fc340d
Registering the ROCDL dialect in init_mlir_dialects. (#18944)
by Ben Vanik
· 5 months ago
d1dd3e3
Add integer range inference to hal.buffer_view.dim and rank ops. (#18943)
by Stella Laurenzo
· 5 months ago
49ffdac
Enabling linking in the ROCM/CUDA compiler targets. (#18936)
by Ben Vanik
· 5 months ago
a321be2
Adding 'amdgpu' target device and flatbuffer for HAL executables. (#18933)
by Ben Vanik
· 5 months ago
4376117
[GPU] Do not treat pad as a tilable producer for operand promotion (#18918)
by Kunwar Grover
· 5 months ago
3cf5b65
[LinalgExt] Implement AggregateOpInterface for AttentionOp (#18890)
by Kunwar Grover
· 5 months ago
b31b033
Revert "[DispatchCreation] Run preprocessing before..." (#18934)
by Ian Wood
· 5 months ago
36caa05
[ROCM] Add flag to enable GlobalISel (#18922)
by Quinn Dawkins
· 5 months ago
fa752ae
[DispatchCreation] Run preprocessing before elementwise fusion (#18920)
by Ian Wood
· 5 months ago
3b69679
Enable the MLIR debug actions CL options in the compiler driver. (#18928)
by Stella Laurenzo
· 5 months ago
f4a5f13
Use workgroup_count_from_slice in Stream builtins (#18924)
by Quinn Dawkins
· 5 months ago
9d36cfa
[Codegen] Don't require full slice to decompose boundary pack and unpack ops (#18906)
by Max191
· 5 months ago
e66171a
[LinalgExt] Generalize attribute setting for attention decomposition (#18780)
by Kunwar Grover
· 5 months ago
a041798
[VectorDistribution] Add vector distribution support multi-dim reduction with scalars (#18800)
by Bangtian Liu
· 5 months ago
8806173
Revert "[DispatchCreation] Extend multi-use producer fusion" (#18917)
by Ian Wood
· 5 months ago
9731fed
Pass to block dynamic dimensions of operands of `iree_linalg_ext.attention`. (#18874)
by MaheshRavishankar
· 5 months ago
03c744e
[GPU] Support multiple contraction dims in MmaSchedules (#18720)
by Max191
· 5 months ago
0c2c627
[NFC] Update old naming from flow to dispatch creation (#18904)
by Ian Wood
· 5 months ago
55c5562
[LLVMGPU][NFC] Create LLVMGPU pass for IGEMM (#18871)
by Max191
· 5 months ago
c6b3592
[Dispatch Creation] Bubble up ExtractSliceOp with FillOp when the latter has multiple consumers (#18896)
by Nithin Meganathan
· 5 months ago
1aa5825
[LLVMGPU] Combine parallel and reduction padding in LLVMGPUPadAndVectorDistribute (#18771)
by Kunwar Grover
· 5 months ago
1fc6e5b
Add CDNA3 MFMA BF16 intrinsics. (#18892)
by Benoit Jacob
· 5 months ago
3b751a4
[LLVMCPU] Enable tileDispatchUsingForall as default (#18777)
by Prashant Kumar
· 5 months ago
e96e3c0
[VectorLayout] Fix insertion of new constOp for non dominate issue. (#18894)
by Stanley Winata
· 5 months ago
aef6e1f
[GPU] Bail out in GPUReduceBankConflicts if we have collapse_shape user (#18863)
by Nirvedh Meshram
· 5 months ago
4ad834b
Support F8E5M2FNUZ MFMA on CDNA3 (#18887)
by Benoit Jacob
· 5 months ago
2291b38
Support 8-bit floats in the compiler. (#18886)
by Benoit Jacob
· 5 months ago
abe3f89
Add conversions for 1x1 conv_2d to matmul (#18736)
by Ian Wood
· 5 months ago
c3fae2f
[LLVMGPU] Use forall workgroup distribution in TileAndFuse pipeline (#18565)
by Max191
· 5 months ago
4d20b82
Emit an error when affinity analysis fails. (#18883)
by Ben Vanik
· 5 months ago
9f5610d
Preserving `nosideeffects` on func.func -> util.func import. (#18882)
by Ben Vanik
· 5 months ago
e1469b2
[Codegen] Add pass to decompose pack unpack ops at dispatch boundaries (#18852)
by Max191
· 5 months ago
9c5b57a
Use FetchContent for both pybind11 and nanobind. (#18872)
by Stella Laurenzo
· 5 months ago
00104b5
Allow dynamic dimensions during folding of `tensor.expand_shape/collapse_shape` into `flow.dispatch.tensor.load/store`. (#18873)
by MaheshRavishankar
· 5 months ago
a400cde
[ROCM][NFC] Add option to control SLP vectorization in llvm optimizations (#18865)
by Nirvedh Meshram
· 5 months ago
e3f2d47
Bump torch-mlir to 140cad5 and update TorchOnnxToTorch conversion pipeline (#18867)
by Vivek Khandelwal
· 5 months ago
81c8b25
[Codegen] Allow multiple reduction dimensions in VectorDistribution (#18868)
by Kunwar Grover
· 5 months ago
b922a70
GPU data tiling: query the target's list of MMA intrinsics. Add FP8 test. (#18862)
by Benoit Jacob
· 5 months ago
bb71f7d
[Attention] Only clamp attention for low precision types (#18848)
by Kunwar Grover
· 5 months ago
4cc6671
[CPU] Limit vectorization tile sizes for SVE (#18846)
by Cullen Rhodes
· 5 months ago
d0269f3
Integrate llvm-project @864902e9b4d8bc6d3f0852d5c475e3dc97dd8335 (#18843)
by Max191
· 5 months ago
c08362a
GPU target parameters for data tiling. (#18839)
by Benoit Jacob
· 5 months ago
114a142
[LLVMGPU] Embed mma_intrinsic in to_layout and infer contraction's intrinsic from it. (#18842)
by Stanley Winata
· 5 months ago
66342ab
Reland #18804 (#18840)
by Maksim Levental
· 5 months ago
556c945
[Codegen] Fix bug in IGEMM pass for non conv contractions (#18838)
by Max191
· 5 months ago
df5e5aa
Revert "`iree_gpu` Python bindings (`GPUPipelineOptionsAttr`)" (#18833)
by Scott Todd
· 5 months ago
5546ae0
Run `compiler/src/iree/compiler/API/generate_exports.py`. (#18832)
by Scott Todd
· 5 months ago
2dffc9e
Fix MSVC compilation following #18682 (#18830)
by Benoit Jacob
· 5 months ago
012f8a6
[Codegen] Drop TransformStrategies (#18820)
by Quinn Dawkins
· 5 months ago
4f33005
Skip ROCM/test/opt_pass_plugin on Windows while broken. (#18823)
by Scott Todd
· 5 months ago
fb18c42
`iree_gpu` Python bindings (`GPUPipelineOptionsAttr`) (#18804)
by Maksim Levental
· 5 months ago
4b32a1d
[TileSwizzle] Make the dump and variable name match. (NFC) (#18821)
by Han-Chung Wang
· 5 months ago
0c6a151
Warn when --iree-llvmcpu-target-cpu defaults to "generic". (#18682)
by Benoit Jacob
· 5 months ago
fecccdc
[Flow] Fix FoldSplatReshapeIntoSplat pattern (#18818)
by Ian Wood
· 5 months ago
258cdb8
Opt into free-threaded Python (#18770)
by Marius Brehler
· 5 months ago
05bbcf1
Enable arithmetic optimizations as part of the stream simplification pipeline. (#18806)
by Stella Laurenzo
· 5 months ago
929a7da
Fixes a range inference overflow with util.align. (#18808)
by Stella Laurenzo
· 5 months ago
8da6ba2
Add util.assume.int folder. (#18805)
by Stella Laurenzo
· 5 months ago
1500641
Various tweaks to numeric optimizations found while looking at programs. (#18765)
by Stella Laurenzo
· 5 months ago
e2a352c
Integrates LLVM @ 36d936a2d057ddbd7822614edf01e39a0c21d654 (#18801)
by Vivian
· 5 months ago
a488d38
Add region to `linalg_ext.attention` (#18728)
by Rob Suderman
· 5 months ago
8568efa
[GPU] Adding support for opt pass plugins during AMDGPU executable serialization (#18347)
by Jose Manuel Monsalve Diaz
· 5 months ago
206b60c
[DispatchCreation] Extend multi-use producer fusion (#18551)
by Ian Wood
· 5 months ago
f9fa934
Integrates LLVM @ a758bcdbd92efb64a3482eb95d2769d74e33f5bb (#18783)
by Vivian
· 5 months ago
2945399
Revert `tensor.cast` to flow reshape conversion (#18772)
by Ian Wood
· 5 months ago
9f93073
[CMake] Don't update compile definitions for imported targets for MSCV (#18766)
by Maksim Levental
· 5 months ago
3ccd4f1
[Codegen] Replace LICM with a version that checks trip count (#18679)
by Quinn Dawkins
· 5 months ago
a3d8ad6
[ROCM] Fix feature flags for gfx1100 and improve flag handling (#18781)
by Kunwar Grover
· 5 months ago
afe18d2
[Codegen][GPU] Add tiling cleanup pattern to fuse pad without zero gaurd (#18748)
by Kunwar Grover
· 5 months ago
Next »