Sign in
opensecura
/
3p
/
openxla
/
iree
/
c285d58acce8b454a6570a0fe462319bf102d4a0
c285d58
Copy sample code into samples/dynamic_shapes/README.md. (#19699)
by Scott Todd
· 8 weeks ago
3c95042
Re-enable MI250 workflows. (#19705)
by saienduri
· 8 weeks ago
27e7a90
[DT][Encoding] Use layouts to calculate storage size when it is present. (#19686)
by Han-Chung Wang
· 8 weeks ago
a953763
Temporarily Disable MI250 workflow due to machine outage (#19702)
by Akansha Bansal
· 8 weeks ago
c320935
Bump dawidd6/action-download-artifact from 3.1.4 to 7 in the github-actions group (#19692)
by dependabot[bot]
· 8 weeks ago
6fd0fd0
[LinalgExt] Implement PartialReductionOpInterface for OnlineAttentionOp (#19684)
by Kunwar Grover
· 8 weeks ago
3c963dd
Update PyTorch sample notebooks using latest iree-turbine code. (#19658)
by Scott Todd
· 8 weeks ago
01c9f14
[LLVMGPUVectorDistribute] Add support for inter-subgroup multi_reduction (#19596)
by Manupa Karunaratne
· 8 weeks ago
21b0101
[GPU] Disable prefetching for loops with no computation (#19695)
by Nirvedh Meshram
· 8 weeks ago
8d1d867
[GPU] Add thread tile size inference for scatter (#19694)
by Quinn Dawkins
· 8 weeks ago
158c636
Revert "Increase default threshold of TileLargeTensor pass (#19671)" (#19693)
by Nirvedh Meshram
· 9 weeks ago
3e34e03
Bump the github-actions group with 8 updates (#19689)
by dependabot[bot]
· 9 weeks ago
3978ce6
Increase default threshold of TileLargeTensor pass (#19671)
by Nirvedh Meshram
· 9 weeks ago
2452b22
[Codegen][GPU] Let integer range optimization narrow GPU computations to i32 (#19473)
by Krzysztof Drewniak
· 9 weeks ago
2b29155
Update GH actions with Dependabot (#19663)
by Marius Brehler
· 9 weeks ago
9b35412
Run on schedule in iree-org only (#19685)
by Marius Brehler
· 9 weeks ago
d90c505
Reshape propagation to enable broadcast(transpose) -> attention(q, kt, vt) fusion. (#19661)
by MaheshRavishankar
· 9 weeks ago
cac7a96
Update IREE test suite to use iree-org/iree-test-suites@c47d13c (#19617)
by MaheshRavishankar
· 9 weeks ago
40c19e3
Better support multidevice placement with `stream.async.barrier` (#19651)
by Rob Suderman
· 9 weeks ago
88d5f59
Update PkgCI test_amd to use MI300x conductor cluster (#19517)
by yamiyysu
· 9 weeks ago
ae50c5e
[DOCS] Update VectorExt::NestedLayoutAttr docs (#19246)
by Manupa Karunaratne
· 9 weeks ago
1441caa
Enable macOS Tracy CI build. (#19668)
by Scott Todd
· 9 weeks ago
a583b25
[GPU] Teach GPUApplyTilingLevel PartialReduction tiling (#19682)
by Kunwar Grover
· 9 weeks ago
9f93691
[LLVMGPU] Use LLVMGPUDistribute for small input scatters (#19670)
by Quinn Dawkins
· 9 weeks ago
f7a2157
Remove Upcasting schedule from TileAndFuse (#19669)
by Nirvedh Meshram
· 9 weeks ago
039b8b4
Using tracy::GetQueue instead of the sketchy static variable reference. (#19653)
by Ben Vanik
· 9 weeks ago
1d91bec
Supporting file descriptors in iree_io_stream_open. (#19665)
by Ben Vanik
· 9 weeks ago
106371d
Bump torch-mlir to f92c587cb6150e73078f32cf847dc3892be16f93 (#19659)
by jinchen
· 9 weeks ago
a88555c
Add macOS workflow running on M1 (#19656)
by Marius Brehler
· 9 weeks ago
e64cb12
Increase strictness of global isel use for ROCM (#19247)
by Tres
· 9 weeks ago
2aca091
[Codegen][Nearly NFC] Move PropagateDispatchSizeBounds to Common/ (#19650)
by Krzysztof Drewniak
· 9 weeks ago
6245db1
[Stream] Attach layouts to tensor ops in encoding specialization pass. (#19649)
by Han-Chung Wang
· 9 weeks ago
c793f90
[i1] Implement `packed_storage` layout encoding attribute (#19354)
by lialan
· 9 weeks ago
801e2c1
Expand runtime_tracing job to include Windows and macOS. (#19655)
by Scott Todd
· 9 weeks ago
7d21c5d
Revert (2nd) of "Propagate reshapes through generics with reduction" (#19647)
by MaheshRavishankar
· 9 weeks ago
b3ff1ed
Rename `unroll_{m,n,k}` to `intrinsics_{m,n,k}` (#19652)
by Benoit Jacob
· 9 weeks ago
6d6bd6e
[runtime] Fix runtime tracing compile failure on gcc (#19642)
by Ian Wood
· 9 weeks ago
bb1c561
Erase all address spaces and get inlined ukernels (#19646)
by Benoit Jacob
· 9 weeks ago
a7bac5d
[Flow] Fix dispatch naming for dynamic shaped fusions (#19439)
by Quinn Dawkins
· 9 weeks ago
9055c9d
[hip] Fix race in the cleanup of queue read operations. (#19645)
by Andrew Woloszyn
· 9 weeks ago
82e37d6
Fix (cross) compiling for 32-bit targets (#19644)
by Marius Brehler
· 9 weeks ago
02d145e
[Stream] Implement SpecializeEncodings pass (1/n) (#19502)
by Han-Chung Wang
· 9 weeks ago
74f8d3c
[LinalgExt] Scatter fusion by expansion 3/3 (#19588)
by Ian Wood
· 9 weeks ago
2347d9f
Supporting (and renaming) IREE_HAL_WHOLE_BUFFER in binding table resolve. (#19640)
by Ben Vanik
· 9 weeks ago
126f0ac
Add docs for updating release git tags manually. (#19637)
by Scott Todd
· 9 weeks ago
af416b3
Bump version to 3.2.0 after releasing 3.1.0. (#19638)
by Scott Todd
· 9 weeks ago
c484058
[GPU] Add barriers when resolving GPUMappedForall to fix race condition (#19635)
by Nirvedh Meshram
· 9 weeks ago
9b4906e
[DispatchCreation] Drop fusion restriction for stride != 1 conv (#19634)
by Quinn Dawkins
· 9 weeks ago
c75b686
[GPU][Codegen] Allowing mfma for narrow problem config sizes (#19615)
by Zhuoran Yin
· 9 weeks ago
7b9aa28
When dumping intermediates, dump how to reproduce the `.optimized.ll` (#19633)
by Benoit Jacob
· 9 weeks ago
be75a30
Update minor Python versions used to build packages (#19632)
by Marius Brehler
· 9 weeks ago
fb21dd6
Adding experimental Tracy API for TLS-less event recording. (#19625)
by Ben Vanik
· 9 weeks ago
a5c3879
Reapply "Propagate reshapes through generics with reduction… (#18968)
by Ian Wood
· 9 weeks ago
80cbf6b
[GPU] Add a pass to convert accumulating GEMMs to GEMMs (#19587)
by Nirvedh Meshram
· 9 weeks ago
550d88e
[GPU] Add lowering configuration logic for scatter (#19624)
by Quinn Dawkins
· 9 weeks ago
349026b
Add explicit tolerances to SDXL benchmark test times. (#19628)
by Scott Todd
· 9 weeks ago
9a83239
[GPU] Add chained reshape support for scf.forall expand destination pattern (#19597)
by Nirvedh Meshram
· 9 weeks ago
7047cc3
Rollup of minor runtime fixes/cleanup from the AMDGPU branch. (#19621)
by Ben Vanik
· 9 weeks ago
aa06523
[NFC] Comment fixes in iree_bitcode_library.
by Ben Vanik
· 9 weeks ago
66723e4
Cleaning up null HAL driver options.
by Ben Vanik
· 9 weeks ago
2199c1d
Adding iree_arena_block_pool_preallocate.
by Ben Vanik
· 9 weeks ago
ea462c8
Removing some IREE_RETURN_AND_END_ZONE_IF_ERROR usage that was ugly.
by Ben Vanik
· 9 weeks ago
4a04c0a
Adding minor iree/base/ time, string view, and memory utilities.
by Ben Vanik
· 9 weeks ago
a8f7a32
Adding iree_hal_queue_affinity_* utilities.
by Ben Vanik
· 9 weeks ago
c9fb739
Fixing HAL driver CTS test to not assume numerical indices exist.
by Ben Vanik
· 9 weeks ago
1ccabe5
Adding COMPILER_TARGET_DEVICE to iree_hal_cts_test_suite.
by Ben Vanik
· 9 weeks ago
d517661
[runtime][python] Add debug sink to bindings (#19013)
by Boian Petkantchin
· 9 weeks ago
c97b084
Including the .kd symbol suffix in AMDGPU executables.
by Ben Vanik
· 9 weeks ago
d224220
Bump LLVM to llvm/llvm-project@21edac25f09faee23015c6a69d95fcbda287efe2 (#19616)
by MaheshRavishankar
· 9 weeks ago
b245e6b
Delete test_models job using SHARK-TestSuite/iree_tests. (#19614)
by Scott Todd
· 10 weeks ago
1445cef
Set MLIR_LINK_MLIR_DYLIB to not link shared libMLIR (#19613)
by Marius Brehler
· 10 weeks ago
340ffbb
[LinalgExt] Drop the unit dims on scatter ops 2/3 (#19450)
by Ian Wood
· 10 weeks ago
0820f10
[hip] Don't join the status in dispatch_thread. (#19583)
by Andrew Woloszyn
· 10 weeks ago
cdf24b9
[Dispatch] Two fixes for CollapseDimensionsPass (#19598)
by Ian Wood
· 10 weeks ago
763406f
[Codegen][Tuner] skip linking based on the default entry point attribute (#19603)
by Bangtian Liu
· 10 weeks ago
c992d29
[runtime][hip] Fix format errors and conflicting types. (#19607)
by Han-Chung Wang
· 10 weeks ago
e7d4fec
Bump to LLVM with nanobind reverts (#19605)
by Marius Brehler
· 10 weeks ago
71d6de7
[Flow] Improve DeduplicateExecutables bucketing (#19601)
by Ian Wood
· 10 weeks ago
6b64fca
[LinalgExt] Fusion support for LinalgExt ScatterOp 1/3 (#19560)
by Ian Wood
· 10 weeks ago
9cb984f
[Codegen] Fix use after erase bug in optimize-tensor-extract-slices (#19460)
by Kunwar Grover
· 10 weeks ago
89afd4e
Add missing definition of `build` method in `Flow::CallOp` (#19552)
by Twice
· 10 weeks ago
0c6d267
Fix typo in serializeExecutable of CUDA target (#19609)
by Twice
· 10 weeks ago
c7086cf
Bump LLVM to llvm/llvm-project@9f5cefe (#19600)
by MaheshRavishankar
· 10 weeks ago
b5272df
[docs] Update copyright year to 2025 in the website footer. (#19599)
by Scott Todd
· 10 weeks ago
9df82fd
[LinalgExt] Add support for fusing scatter with producers (#19584)
by Quinn Dawkins
· 10 weeks ago
5a97523
[GPU] Disable insert/extract slice lowering from pack/unpack by default (#19590)
by Zhuoran Yin
· 10 weeks ago
1e935c4
[PJRT] Fix ninja not found error while re-building the plugin (#19553)
by Twice
· 10 weeks ago
c2d408f
Update IREE third-party/benchmark for RISC-V Compatibility (#19538)
by Phoebe Chen
· 10 weeks ago
e6ac016
[Codegen][Tuner] verifier for the default tuning spec (#19525)
by Bangtian Liu
· 2 months ago
26b24f2
Bump LLVM to llvm/llvm-project@cbff02b (#19589)
by MaheshRavishankar
· 2 months ago
fc6c518
[docs] Fix diagram rendering on C API reference page. (#19586)
by Scott Todd
· 2 months ago
88848b4
[runtime][hip] Do not error when peered memory access already enabled (#19582)
by Boian Petkantchin
· 2 months ago
c203e6b
[LLVMGPU] Prefer non-scatter ops as the configuration root (#19581)
by Quinn Dawkins
· 2 months ago
fa325c5
[infra] Add `--force` flag to `git submodule update` in script. (#19550)
by Scott Todd
· 2 months ago
f27feff
Update LLVM to llvm/llvm-project@ac8bb735 (#19566)
by Stanley Winata
· 2 months ago
a43d893
[Dispatch] Disable scatter fusion with producers (#19565)
by Ian Wood
· 3 months ago
d746a57
Remove revert for https://github.com/llvm/llvm-project/pull/120115 (#19567)
by Kunwar Grover
· 3 months ago
f1e1866
Update LLVM to llvm/llvm-project@b13592219c421820b (#19554)
by Stanley Winata
· 3 months ago
1f19761
Enable peering among all devices on the system. (#19555)
by Andrew Woloszyn
· 3 months ago
0184eee
[Codegen][RoCDL] Add patterns for lowering bit-width emulation operations to LLVM (#19551)
by MaheshRavishankar
· 3 months ago
Next »