Sign in
opensecura
/
3p
/
openxla
/
iree
/
HEAD
818e59f
Bump to llvm/torch-mlir@3cebce2 (#22902)
by zjgarvey
· 5 hours ago
main
45017a7
[Codegen] Add PCF bufferization interfaces (#22805)
by Quinn Dawkins
· 7 hours ago
d8b504f
[ROCM][DT] Add architecture matching to ukernel_info attribute (#22899)
by Zhewen Yu
· 8 hours ago
b361bf7
Bump torch-mlir (#22894)
by Rahul Kayaith
· 10 hours ago
c54764e
[Codegen] Add PCF dialect (#22804)
by Quinn Dawkins
· 11 hours ago
5ed9041
Update shark/SHARK to amd-shark/AMD-SHARK in documentation and URLs in IREE (#22883)
by Bangtian Liu
· 11 hours ago
aba450e
Integrates/llvm 20251212 (#22897)
by Bangtian Liu
· 13 hours ago
1e45c84
[Stream] Handle AsyncCloneOp in UnifyEncodingForGlobals tracing. (#22895)
by Han-Chung Wang
· 20 hours ago
bc6c3fa
Integrates/llvm 20251211 (#22885)
by Bangtian Liu
· 2 days ago
latest-snapshot
a33b55a
[LDS] Improve multiple transfers per lane (#22879)
by Alan Li
· 2 days ago
d0dd889
Integrates/llvm 20251210 (#22876)
by Bangtian Liu
· 2 days ago
248f274
[Codegen] Do not swap extract_slice and collapse_shape for a special case (#22870)
by Vivian Zhang
· 3 days ago
1eb6407
[Encoding] Add resolver swizzle verification (#22867)
by Jorn Tuyls
· 3 days ago
f2156d7
[DT][SVE] adjust tile sizes for mmt4d & disable transposition of narrow-N matmuls (#21701)
by Ege Beysel
· 3 days ago
0f64f41
[bindings] Add `iree_tensor_ext` to python bindings (#22872)
by Rahul Kayaith
· 3 days ago
d2e92fa
[DT][GPU] Redefine intrinsics M/N interleaving (#22812)
by Zhewen Yu
· 3 days ago
4581198
[Pkgci] Update golden dispatch counts (#22869)
by Ian Wood
· 3 days ago
0711973
[Codegen][GPU]Skip prologue pipeline barriers only for non-nested pipelined loops (#22868)
by Zhuoran Yin
· 3 days ago
031445e
Bump the github-actions group across 1 directory with 3 updates (#22858)
by dependabot[bot]
· 4 days ago
dd5f28a
Integrates/llvm 20251209 (#22864)
by Bangtian Liu
· 4 days ago
6ccb336
[libcall] Update musl Makefile to include missing files. (#22859)
by Alan Li
· 4 days ago
fea540f
[Encoding] Add verifier for gpu/cpu/vmvx_encoding_resolver (#22838)
by Jorn Tuyls
· 4 days ago
3585436
[Stream] Add UnifyEncodingForGlobals pass. 1/n (#22767)
by Han-Chung Wang
· 4 days ago
0c1753c
[Codegen] Use workgroup_count_hint for most code paths (#22549)
by Quinn Dawkins
· 4 days ago
621a5e9
Integrates/llvm 20251208 (#22856)
by Bangtian Liu
· 4 days ago
ac28c58
Revert "[Util] Implement InferIntDivisibilityOpInterface for affine o… (#22860)
by Ian Wood
· 4 days ago
21684f2
[LDS] Add AMDGPULowerCoalescedDMAToGatherLDS pass for direct global to LDS loads (#22356)
by Alan Li
· 4 days ago
fd4ff2b
[Dispatch Creation] Create more multi-use dispatches (#22011)
by Ian Wood
· 4 days ago
b44836e
[Torch Models] Fix SDXL golden dispatch counts (#22855)
by Ian Wood
· 4 days ago
69fc446
Add CDNA3 test filtering to CI (#22848)
by Alan Li
· 4 days ago
fa24a3a
[Util] Implement InferIntDivisibilityOpInterface for affine ops (#22723)
by Max191
· 4 days ago
e1694aa
[tests][e2e] Add more llama related shapes (#22831)
by Muzammiluddin Syed
· 5 days ago
21b6cbb
Unify RISC-V toolchain environment variables and remove default path (#22710)
by Han-Kuan Chen
· 5 days ago
8f7ab2c
[Codegen] Add WorkgroupCountHintOp to defer populating the workgroup count (#22533)
by Quinn Dawkins
· 5 days ago
e1eba96
[Encoding] Add verifier for iree_encoding.layouts (#22850)
by Jorn Tuyls
· 5 days ago
c6a044b
[Stream] Encode packed_storage device and host tensors (#22722)
by Lukas Sommer
· 5 days ago
70b2b45
[Encoding] Improve specialized encoding usage in lit test (#22851)
by Jorn Tuyls
· 5 days ago
d49d410
[SPIRV][Codegen] Use single subgroup when reduction consumer has non-distributable broadcast (#22832)
by Eric Feng
· 5 days ago
8f6b259
Fix deadlock in `ROCMDialect::getMlirUKernels` (#22843)
by Benoit Jacob
· 5 days ago
6a28284
[Codegen] Add vector.to/from_elements to bf16 -> i16 conversion (#22846)
by Quinn Dawkins
· 7 days ago
09f5095
[Codegen][GPU] Allow channel first layouts to lower through direct convolution path (#22840)
by Vivian Zhang
· 7 days ago
70b44fa
[e2e][ukernel] Remove dead/duplicate tests (#22834)
by Zhewen Yu
· 7 days ago
554753b
[Runtime][HIP] Correct O(log n) bound search logic and eliminate O(n) loop(#22733) (#22734)
by NohHyeon Kwon
· 7 days ago
3120a77
ElideAsyncCopiesPass refactoring for SCF/transfer support. (#22739)
by Ben Vanik
· 7 days ago
7f5aca2
Fix tests: `noubsan` was not being honored, and MXFP4 matmul tests are static-shape-only (#22836)
by Benoit Jacob
· 8 days ago
883d466
[Encoding] Use struct directive for TestingAttr assembly format (#22826)
by Jorn Tuyls
· 8 days ago
1ccd5d7
Add jtuyls and Yu-Zhewen to CODEOWNERS for ROCM plugin and Encoding (#22810)
by Jorn Tuyls
· 8 days ago
b1e3812
[Codegen][GPU] Preserve loop domain when collapsing dims in Conv to Matmul conversion (#22821)
by Vivian Zhang
· 8 days ago
196b716
Reland "[LLVMGPU] Unroll elementwise operations #21665" (#22828)
by Alan Li
· 8 days ago
82fc1ac
[Dispatch Creation] Don't fuse if there are no common parallel loops (#22819)
by Ian Wood
· 8 days ago
e2dc7a5
[LLVMGPU] Update seeds for scaled gemm (#22798)
by Muzammiluddin Syed
· 8 days ago
6d742a1
[GlobalOpt] Fix rank-reduced permutation in SinkTransposeThroughExtractSlice (#22754)
by Ziliang Zhang
· 8 days ago
f7e0280
[Codegen][GPU] Replace prefetch_shared_memory with prefetch_num_stages in IREEGPUAttrs (#22818)
by Zhuoran Yin
· 8 days ago
cdc5eee
[GPU] Fix alignment check for scaled matmul (#22737)
by Zhewen Yu
· 9 days ago
68a9309
Add SCF support and fence coverage to ElideTimepointsPass. (#22611)
by Ben Vanik
· 9 days ago
4eff28d
[Encoding] Remove unneeded command line option (#22816)
by Muzammiluddin Syed
· 9 days ago
3e65f15
[GPU] Add M dimension constraints for pingpong ukernel (#22801)
by Ian Wood
· 9 days ago
60058ec
[LLVMGPU] Fix lowering strategy for direct convolution (#22802)
by Vivian Zhang
· 9 days ago
91bf741
Add iree_status_t stack trace support on Linux. (#22796)
by Ben Vanik
· 9 days ago
a6f8dfe
[Codegen][LLVMGPU] Replace TransposeSharedMem pipeline (#21661)
by Quinn Dawkins
· 10 days ago
789b515
[HAL] fix IREE_HAL_MAX_QUEUES to be number of bits in queue affinity type (#22702)
by Stefan Schuermans
· 10 days ago
4928091
[Codegen] Support dynamic offsets in collapse_shape fusion to interface stores (#22800)
by Quinn Dawkins
· 10 days ago
97c9020
Bump llvm-project to @a7c1f467339abd1942c89f2ef8b79083e89e7dad (#22787)
by Max191
· 10 days ago
069c079
[LLVMGPU][Codegen] Reland "Emit packed chain FMA from select multi_reductions and contracts" (#22789)
by Eric Feng
· 10 days ago
5160310
[Codegen][GPU] Enable 3-stage pipelining with hipblaslt compute->write->read ordering (#22788)
by Zhuoran Yin
· 11 days ago
d344073
[CI] Update iree-org/iree-test-suites@132f91e4 (#22784)
by Eric Feng
· 11 days ago
d2117d7
[Codegen][GPU] Add fusion barrier after result promotion (#21709)
by Quinn Dawkins
· 11 days ago
08b6af6
build_tools: fix: ensure that iree-flatcc-cli and iree-c-embed-data are build for the target during cross-compilation (#22755)
by Florian Walbroel
· 11 days ago
caf2352
[Flow] Annotate scaled matmul dispatches (#22773)
by Zhewen Yu
· 12 days ago
cdcbfd3
ScheduleExecution enhancements for timeline-aware scheduling and SCF. (#22483)
by Ben Vanik
· 12 days ago
4bb0c12
[Bazel] Migrate to bzlmod for LLVM compatibility (#22771)
by maxbartel
· 2 weeks ago
6a73711
[tests][e2e] Add custom mxfp4 gemm test to verify shape of interest. (#22775)
by Muzammiluddin Syed
· 2 weeks ago
46fbe05
[Codegen][Tuner] Expose the python bindings for LinalgExt::inferScaledContractionDims and LinalgExt::isaScaledContractionOpInterface (#22763)
by Muzammiluddin Syed
· 2 weeks ago
af241f9
Integrate LLVM @ 356479191ca0 (#22772)
by Alan Li
· 2 weeks ago
fb8d0cc
[Input] Register IREETensorExtDialect for Torch plugin (#22719)
by Ian Wood
· 2 weeks ago
39a15a7
[Encoding] Implement compatibility check for packed_storage (#22757)
by Lukas Sommer
· 2 weeks ago
74ee8f2
Integrate llvm/llvm-project@ebf5d9ef (#22761)
by Vivian Zhang
· 2 weeks ago
34b8187
Bump version to 3.10 after 3.9 release. (#22759)
by Sahil Faizal
· 2 weeks ago
77873f7
Update gfx1250 LDS size (#22760)
by Ivan Butygin
· 2 weeks ago
b346a98
[DispatchCreation] Add FoldExtractSliceOfBroadcast Pattern (#22694)
by Bangtian Liu
· 2 weeks ago
a9cae0b
[Codegen] Test Cleanup 3/8: Common tests (#22746)
by Quinn Dawkins
· 2 weeks ago
edff002
Integrate llvm/llvm-project@c582688b (#22758)
by Vivian Zhang
· 2 weeks ago
645b446
Use `scf::tileAndFuseConsumer` in `GPUFuseAndHoistParallelLoops` (#22617)
by MaheshRavishankar
· 2 weeks ago
1f322ce
[Codegen] Test Cleanup 2/8: Common GPU tests (#22745)
by Quinn Dawkins
· 3 weeks ago
acefc23
[Codegen] Test Cleanup 5/8: LLVMCPU tests (#22748)
by Quinn Dawkins
· 3 weeks ago
2e40437
[Codegen] Test Cleanup 6/8: LLVMGPU tests (#22749)
by Quinn Dawkins
· 3 weeks ago
222940b
[Codegen] Test Cleanup 7/8: SPIRV tests (#22750)
by Quinn Dawkins
· 3 weeks ago
1a66819
[Codegen] Test Cleanup 4/8: Dialect tests (#22747)
by Quinn Dawkins
· 3 weeks ago
3b7ff2d
[Codegen] Test Cleanup 8/8: VMVX tests (#22751)
by Quinn Dawkins
· 3 weeks ago
abc8095
[CI] Bump golden value to 165*1.1=181.5 for prefill benchmark on mi325 (#22752)
by Han-Chung Wang
· 3 weeks ago
b9afdb9
[Codegen] Test Cleanup 1/8: Common CPU tests (#22744)
by Quinn Dawkins
· 3 weeks ago
8c9e329
Integrate llvm/llvm-project@778e104d (#22741)
by Vivian Zhang
· 3 weeks ago
a7b0d0b
Fix incompatible pointer types for macOS build. (#22738)
by Han-Chung Wang
· 3 weeks ago
8ae91eb
Bump actions/checkout from 5.0.1 to 6.0.0 in the github-actions group (#22742)
by dependabot[bot]
· 3 weeks ago
5f1ddc3
[TensorExt] Add Operations/Attributes/Interfaces for specifying ragged tensors. (#22267)
by MaheshRavishankar
· 3 weeks ago
843f9d1
[CI][TorchModels] Update flags for CLIP test. (#22413)
by MaheshRavishankar
· 3 weeks ago
a18c213
Update CODEOWNERS to add more reviewers for GPU codegen pieces (#22721)
by MaheshRavishankar
· 3 weeks ago
3483097
[Dispatch Creation] Add aggressive reshape movement flag (#22707)
by Ian Wood
· 3 weeks ago
9269e03
[Codegen][GPU]Fixing barrier placement for 3+ stages pipelining (#22725)
by Zhuoran Yin
· 3 weeks ago
a8f4791
Revert "[LLVMGPU][Codegen] Emit packed chain FMA from select multi_reductions and contracts" (#22736)
by Han-Chung Wang
· 3 weeks ago
Next »