Log - 29e70abcd352a4d037509e6ecf5e52507b738f4e - 3p/openxla/iree

a6a56a9 Add `LinalgFusionInterface` to support fusion for linalg_ext ops (added `scatter` and `reverse`) (#17428) by Ian Wood · 10 months ago
3d1364e [Codegen][GPU] Add pattern to lower iree_gpu.multi_mma to intrinsics (#17457) by Quinn Dawkins · 10 months ago
ab8f668 Revert "Data tiling: transpose narrow-N into narrow-M" (#17503) by Benoit Jacob · 10 months ago
e33ca89 [LinalgExt] Split TileAndDecomposeAttention (#17468) by Kunwar Grover · 10 months ago
322d688 [Codegen][GPU] Add pattern to drop lead unit dims of multi_mma ops (#17456) by Quinn Dawkins · 10 months ago
117cb43 Test 'console' provider in 'tracing' job. (#16454) by Scott Todd · 10 months ago
16bdaa9 Data tiling: transpose narrow-N into narrow-M (#17446) by lialan · 10 months ago
6c75aa1 [Codegen][GPU] Allow iree_gpu.tensor_barrier to take vectors (#17479) by Quinn Dawkins · 10 months ago
1750e2b Integrate LLVM at 855eef2abd81cb8c7543d4748353d5e378fdd4c2 (#17501) by Benoit Jacob · 10 months ago
051c361 NFC: Make a few loop transformations more accessible (#17489) by Quinn Dawkins · 10 months ago
cad02f9 [Codegen][GPU] Add unrolling pattern for iree_gpu.multi_mma (#17454) by Quinn Dawkins · 10 months ago
46c6bf5 [CPU] Add support for pack ukernel preparation. (#17472) by Han-Chung Wang · 10 months ago
a842527 [Codegen][GPU] Drop dead PassDetail.h file (#17490) by Quinn Dawkins · 10 months ago
63dff03 [Codegen][GPU] Add iree_gpu.tensor_barrier op (#17478) by Quinn Dawkins · 10 months ago
31e1a30 [Codegen][GPU] Add dictionary based lowering config attribute (#17463) by Quinn Dawkins · 10 months ago
008add9 [CodeGen][NFC] Rename DecomposeBatchMmt4DOps to CPUPrepareUkernels. (#17471) by Han-Chung Wang · 10 months ago
30e0238 Bump LLVM to llvm/llvm-project@bd3f5a4bd3d9d7ee8ae801c24c5081073b20abd4 (#17470) by MaheshRavishankar · 10 months ago
9fe159d [LinalgExt] Generalize attention tiling interface implementation (#17408) by Kunwar Grover · 10 months ago
1316c92 [Codegen] NFC: Move the lowering config to an attribute interface (#17439) by Quinn Dawkins · 10 months ago
7813fd3 [CPU] Fix a distribution bug and limiting distribution tile sizes. (#17436) by Han-Chung Wang · 11 months ago
d4aa849 [CPU] Add support for pack/unpack ukernels enablement on llvm-cpu path. (#17427) by Han-Chung Wang · 11 months ago
6c5198d Folding no-op stream.async.update ops away. (#17458) by Ben Vanik · 11 months ago
006af5d [GPU] Support specifying LLVMGPU backend target features (#17451) by Lei Zhang · 11 months ago
a36773a [Codegen][GPU] Add vectorization pattern for iree_gpu.multi_mma (#17453) by Quinn Dawkins · 11 months ago
f6a38ac [GPU] Thread through a common target description (#17217) by Lei Zhang · 11 months ago
62a996b [Codegen] Add lane distribution for scf.forall (#17373) by Quinn Dawkins · 11 months ago
080b1fa [Codegen][GPU] Add a contraction like operation for mma intrinsics (#17374) by Quinn Dawkins · 11 months ago
e0f3c05 [Codegen][GPU] Change iree_gpu.shuffle_tensor to take a region for the read (#17425) by Quinn Dawkins · 11 months ago
a3b74bc [CPU][ArmSME] Update tiling to use all SME accumulators (#16389) by Benjamin Maxwell · 11 months ago
6d95f8c Integrate LLVM at `74a87548` (clean) (#17423) by Ingo Müller · 11 months ago
4f8ee51 Moving demotion/promotion passes to input conversion. (#17422) by Ben Vanik · 11 months ago
dece30e [CPU] Do not decompose pack/unpack ops on x86 backends. (#17366) by Prashant Kumar · 11 months ago
f2fcbbf [iree][global] Add conv2d op to demote to bf16 pass (#17410) by Prashant Kumar · 11 months ago
3b5b70a Integrate LLVM at `1650f1b3` (clean) (#17418) by Ingo Müller · 11 months ago
b4fc0b4 Implementing the f64 VM extension and flipping the flag by default. (#17416) by Ben Vanik · 11 months ago
b716704 Update git-clang-format ref and clang-format version. (#16792) by Scott Todd · 11 months ago
8fcab13 [Flow] Improve annotation name for conv (#17417) by MaheshRavishankar · 11 months ago
356e2b7 [Codegen] Add op for flattening warp and thread ids of forall ops (#17368) by Quinn Dawkins · 11 months ago
90db41a [LLVMGPU] Add Winograd pipeline for LLVMGPU (#17302) by Max191 · 11 months ago
4021109 [Winograd] Add filtering by annotations for Winograd rewrites (#17332) by Max191 · 11 months ago
0260947 [GlobalOpt] Simplify the logic used to pick the groups. (#17405) by MaheshRavishankar · 11 months ago
9a294eb [Winograd] Use output_tile_size for more static output transform tiling (#17200) by Max191 · 11 months ago
748db31 Fuse Generic Ops Generated by `gather` Lowering (#17341) by Ian Wood · 11 months ago
428adf2 [LLVMGPU] Add debug prints for vector distribution config (#17404) by Jakub Kuderski · 11 months ago
2a8d681 [CPU] Remove CPUDoubleTilingPeelingExpert (#17329) by Andrzej Warzyński · 11 months ago
3bac7ec Add math expand patterns pass (#17395) by jinchen · 11 months ago
29a12f3 [Preprocessing] Remove `input=none` option from TransposeMatmulPass (#17364) by Benjamin Maxwell · 11 months ago
8d8d18c [LinalgExt] Simplify Attention unit tests (#17393) by Kunwar Grover · 11 months ago
a8404a8 [LLVMGPU] Preserve config dictionary during MapNestedForallToGpuThreadsOp application (#17381) by Kunwar Grover · 11 months ago
2ed4778 Integrate LLVM at `a1d43c14d` (+1 revert) (#17380) by Benoit Jacob · 11 months ago
06eb43d Use coalesce loops (#17314) by MaheshRavishankar · 11 months ago
4f27e64 Generalize overriding llvm func attr flags in translation info (#17365) by Kunwar Grover · 11 months ago
2a701d5 [LLVMGPU] Add translation_info config knobs to disable passes (#17340) by Jakub Kuderski · 11 months ago
45ca23e [CPU] Take native_vector_size into accounts for attention op tiling. (#17349) by Han-Chung Wang · 11 months ago
3625c60 Revert "Add math expand patterns pass" (#17367) by Scott Todd · 11 months ago
d657082 [LLVMGPU] Switch GPU passes to tablegen definitions. NFC. (#17361) by Jakub Kuderski · 11 months ago
a9ca8e6 Add math expand patterns pass (#17324) by jinchen · 11 months ago
47a5f99 [Codegen][GPU] Move MFMA/WMMA constructors to interface method (#17356) by Quinn Dawkins · 11 months ago
d2dd9e2 Replacing hal.tensor.export storage for hal.tensor.alias. (#17339) by Ben Vanik · 11 months ago
5337bd7 [Codegen] Add pattern for hoisting scf.forall from scf.for (#17312) by Quinn Dawkins · 11 months ago
a3b7e12 Integrate both llvm-project@2083e97e (+1 :leftwards_arrow_with_hook:, +1 :cherries:) and torch-mlir@bce800a3 (#17330) by Benoit Jacob · 11 months ago
c81496c [NFC][LinalgExt] Rename op functions from outdated naming conventions (#17333) by Max191 · 11 months ago
7baef75 [CPU] Add new attribute to control peeling (#17231) by Andrzej Warzyński · 11 months ago
035da66 [NFC] Fixing stray space and unneeded modules in some lit tests. (#17338) by Ben Vanik · 11 months ago
fc3561c [NFC][LinalgExt] Move tiling tests and implementations from IR to Transforms (#17216) by Max191 · 11 months ago
8ae8aaf [Preprocessing] Skip skinny matmuls during PadToIntrinsics. (#17323) by Stanley Winata · 11 months ago
4f2f8cf [LLVMGPU] Fix MMA schedule validation for unaligned shapes (#17317) by Max191 · 11 months ago
2d5c811 [LLVMGPU] Remove non-useful vector_distribution_pipeline_test test (#17318) by Max191 · 11 months ago
afb986e [LLVMGPU] Remove duplicate shared memory bank conflict pass (#17322) by Jakub Kuderski · 11 months ago
07d4fe6 [CPU] Integrate i8mm patterns from upstream (#17007) by Kojo Acquah · 11 months ago
25fd8a3 Fixing VM extui i1->i64 and adding extsi i1->i32/i64. (#17311) by Ben Vanik · 11 months ago
3d23684 [GPU] Introduce PadAndVectorDistribution lowering strategy. (#17234) by Han-Chung Wang · 11 months ago
273abbb [NFC] Cleaning up VM conversion patterns. (#17307) by Ben Vanik · 11 months ago
3277c21 [Codegen] Add pattern for lowering iree_gpu.shuffle_tensor (#17269) by Quinn Dawkins · 11 months ago
b7aa3b7 [Codegen] Add an op for fusing forall ops (#17279) by Quinn Dawkins · 11 months ago
3ca0a49 Moving OutlineConstantsPass to flow and adding parameter support. (#17303) by Ben Vanik · 11 months ago
6e76f28 Removing affinity from stream.resource.size. by Ben Vanik · 11 months ago
1496270 Preserving affinities with hal.tensor.import/export lowerings. by Ben Vanik · 11 months ago
ad79bc7 Attaching AffinityOpInterface to common early-phase ops. by Ben Vanik · 11 months ago
112fad0 Avoid folding globals with different dialect attrs. by Ben Vanik · 11 months ago
d4e1924 [Preprocessing] Remove global transpose matmul option (#17300) by Benjamin Maxwell · 11 months ago
4c9cb3c [Codegen] Add iree_gpu.shuffle_tensor op (#17257) by Quinn Dawkins · 11 months ago
d5e479d [LLVMGPU] Turn on the vector distribution pipeline by default (#17291) by Quinn Dawkins · 11 months ago
792f14d Move `EncodingAttr` and related ops from `LinalgExt` to a new `Encoding` dialect (#17277) by Benoit Jacob · 11 months ago
30aa5e5 [Winograd] Fix element type bug for Conv2DToWinograd with promotion (#17268) by Max191 · 11 months ago
efed94f [Codegen] Add op for copying tensor operands (#17256) by Quinn Dawkins · 11 months ago
71a9945 Adding `--iree-opt-import-parameters=` and cleaning up export. (#16828) by Ben Vanik · 11 months ago
e4ec8f1 [LLVMGPU] Follow-up to fix a bug in LLVMGPUPromoteMatmulToFitMMA pass. (#17264) by Han-Chung Wang · 11 months ago
8547374 [CPU] Limit unrolling factors for generic ops. (#17227) by Han-Chung Wang · 11 months ago
463fed4 Support JitGlobals on inline and outline flow dispatches. (#17259) by Ben Vanik · 11 months ago
e6d8aa7 [LLVMGPU] Introduce a pass that pad matmul to fit mma shapes. (#17225) by Han-Chung Wang · 11 months ago
8f6ecc5 Hide ExecutableVariantOp from TargetBackend pipeline factory methods. (#17255) by Ben Vanik · 11 months ago
e633d07 [GPU] Add option for no gpu.block_dim in GPUDistributeScfFor (#17214) by Max191 · 11 months ago
34449df [NFC] Refactoring MeshToFlow pass out from patterns. (#17245) by Ben Vanik · 11 months ago
e15968f Misc cleanups to flow dialect files. (#17243) by Ben Vanik · 11 months ago
7e2dd20 [Winograd] Generate winograd.filter_transform op in ConvertConv2DToWinograd (#17106) by Max191 · 11 months ago
f4a7df4 [Codegen] Make amdgpu_distribute_vectors return a handle (#17239) by Quinn Dawkins · 11 months ago
6233f4f Fixes to passes for custom dispatch to work with bf16 type (#17242) by Dave Liddell · 11 months ago
f54a861 [VectorDistribution] Add distribution for scalar broadcast (#17248) by Kunwar Grover · 11 months ago
9719aa2 [cpu][codegen] Fix crash in case of complex dtype (#17247) by Prashant Kumar · 11 months ago