Sign in
opensecura
/
3p
/
openxla
/
iree
/
206b60ca59c9dbbca5769694df4714c38cecaced
206b60c
[DispatchCreation] Extend multi-use producer fusion (#18551)
by Ian Wood
· 6 months ago
c6056d1
[docs] Update and harmonize guids for deployment (#18762)
by Marius Brehler
· 6 months ago
f9fa934
Integrates LLVM @ a758bcdbd92efb64a3482eb95d2769d74e33f5bb (#18783)
by Vivian
· 6 months ago
2945399
Revert `tensor.cast` to flow reshape conversion (#18772)
by Ian Wood
· 6 months ago
9f93073
[CMake] Don't update compile definitions for imported targets for MSCV (#18766)
by Maksim Levental
· 6 months ago
6295784
Bump torch-mlir to 45bb17e (#18782)
by zjgarvey
· 6 months ago
3ccd4f1
[Codegen] Replace LICM with a version that checks trip count (#18679)
by Quinn Dawkins
· 6 months ago
a3d8ad6
[ROCM] Fix feature flags for gfx1100 and improve flag handling (#18781)
by Kunwar Grover
· 6 months ago
afe18d2
[Codegen][GPU] Add tiling cleanup pattern to fuse pad without zero gaurd (#18748)
by Kunwar Grover
· 6 months ago
7622770
Integrate LLVM @ 7900daaa7ba57b5f9729bbbdb54f4e0599a45cd7 (#18773)
by Vivian
· 6 months ago
ab88871
Add a default lowering config setting for custom_op. (#18737)
by MaheshRavishankar
· 6 months ago
6d8f0b2
[LLVMGPU] Add configuration tests for IGEMM, fix NCHW case (#18734)
by Max191
· 6 months ago
2eddaea
[IGEMM] Generate matmuls with expanded H and W dims (#18735)
by Max191
· 6 months ago
c3d74d0
[Flow] Fold flow reshape with mismatching dyn dims (#18680)
by Ian Wood
· 6 months ago
3e52344
[LLVMCPU] Enable tileDispatchUsingForall for multiTilingExpert (#18730)
by Prashant Kumar
· 6 months ago
1e6bbb8
[LinalgExt] Remove default implementation for getStaticLoopRanges (#18745)
by Kunwar Grover
· 6 months ago
d7378bb
[Codegen][GPU] Add pass for fallback distribution (#18726)
by Quinn Dawkins
· 6 months ago
1f3382d
Bump torch-mlir to ab62f35373c3944b68e564214fd04fff39dd92fc (#18763)
by saienduri
· 6 months ago
1e155cc
Use integer range and divisibility analysis to propagate int assumptions into dispatch executables. (#18755)
by Stella Laurenzo
· 6 months ago
0c69c9e
Adding `iree_vm_context_fork` to fork a context. (#18751)
by Ben Vanik
· 6 months ago
25e5760
[Codegen] Remove unused arguements from attention op (#18743)
by Kunwar Grover
· 6 months ago
726112d
[Codegen] Remove memref optimizations from OptimizeTensorInsertExtractSlices (#18732)
by Kunwar Grover
· 6 months ago
660f388
Enable analysis based integer optimizations. (#18756)
by Stella Laurenzo
· 6 months ago
90a2e1c
[NFC][GPU] remove some dead code in setTileAndFuseLoweringConfig (#18754)
by Nirvedh Meshram
· 6 months ago
164a60e
[ROCM] Disable mixed precision fma instructions that cause numeric issues (#18753)
by Nirvedh Meshram
· 6 months ago
9aa11e5
Bump torch-mlir to 2665ed343b19713ba5c1c555b2366a93de8b9d2b (#18750)
by saienduri
· 6 months ago
ea52c36
Adding `IREE_STATUS_INCOMPATIBLE`. (#18752)
by Ben Vanik
· 6 months ago
dd3f2a3
[Codegen] Remove old attention transformations (#18740)
by Kunwar Grover
· 6 months ago
e50eda3
Integrate LLVM at f8b7a653
by Alan Li
· 6 months ago
fa670d6
[docs] Further improve Python API docs (#18667)
by Marius Brehler
· 6 months ago
5f3f863
Add `AggregatedOpInterface` to `iree_linalg_ext.custom_op` (#18700)
by MaheshRavishankar
· 6 months ago
598a60e
[Im2col] Allow multiple batch, M, and K dimensions on im2col result (#18593)
by Max191
· 6 months ago
ce4f098
[Codegen] Use constant int matchers for interface reshape foldings (#18733)
by Max191
· 6 months ago
d019851
Bump torch-mlir to 94f54109134506005052632af96944ca24068f72. (#18715)
by saienduri
· 6 months ago
01b9521
Add `TilingInterface` implementation of `iree_linalg_ext.custom_op`. (#18676)
by MaheshRavishankar
· 6 months ago
2963600
Integrate LLVM at f59b151f
by Alan Li
· 6 months ago
eb15493
e2e matmul test improvements (#18725)
by Benoit Jacob
· 6 months ago
5270093
Add an integer divisibility analysis. (#18727)
by Stella Laurenzo
· 6 months ago
1b719b3
[Codegen][GPU] Do not promote fill matmul operands (#18722)
by Quinn Dawkins
· 6 months ago
0e16a89
[Codegen][GPU] Disable consumer fusion for multi use cases (#18723)
by Quinn Dawkins
· 6 months ago
0f28d44
Use upstream dataflow tooling to build an arithmetic opt pass. (#18702)
by Stella Laurenzo
· 6 months ago
0889d13
Add the support to yield multiple results (#18717)
by Prashant Kumar
· 6 months ago
7fb28e0
[Codegen][GPU] Fix delinearized index order in forall resolution (#18724)
by Quinn Dawkins
· 6 months ago
4636257
[Encoding][NFC] Moving Encoding attr/enum to Encoding[Types|Attrs].* (#18711)
by Han-Chung Wang
· 6 months ago
e8ff07e
[Codegen][GPU] Stop using redundant transfer hoisting pattern on memrefs (#18721)
by Quinn Dawkins
· 6 months ago
6c095ed
Revert "Revert "[CPU] Enable tileDispatchUsingForall for mmt4d and co… (#18718)
by Prashant Kumar
· 6 months ago
8f3f1e2
[docs] Remove call to `t()` in external param example. (#18706)
by Vinayak Dev
· 6 months ago
e8ab5f8
Integrate LLVM at 634c57d7
by Alan Li
· 6 months ago
f6a9b6b
[Codegen][GPU] Enable destination fusion for unit trip loops (#18674)
by Quinn Dawkins
· 6 months ago
ad68964
[LLVMGPU] Pad to intrinsic shape in LLVMGPUPadAndVectorDistribute pipeline (#18632)
by Kunwar Grover
· 6 months ago
6001f9c
Fix distribution logic when number of parallel loops is greater than 3 (#18714)
by MaheshRavishankar
· 6 months ago
5b0680d
[Codegen] Add option to disable copy vectorization (#18673)
by Quinn Dawkins
· 6 months ago
cc3b28f
[Codegen][GPU] Improve loop fusion pattern verification (#18671)
by Quinn Dawkins
· 6 months ago
88cb0ab
Add @bjacob and myself to Encoding dialect owners. (#18712)
by Han-Chung Wang
· 6 months ago
62f8d7e
Revert "[CPU] Enable tileDispatchUsingForall for mmt4d and convolution pipelines. " (#18707)
by Han-Chung Wang
· 6 months ago
8c9f2cb
Integrate with LLVM 20241007
by Alan Li
· 6 months ago
691aa4f
Bump llvm/torch-mlir@e9ed4af9ced23c201 (#18699)
by Rob Suderman
· 6 months ago
23b63cd
[GPU][DT] Add e2e matmul tests for GPU data tiling (#18627)
by Max191
· 6 months ago
65158ac
Rework util.assume.* ops to util.assume.int and base on attributes. (#18703)
by Stella Laurenzo
· 6 months ago
caacf6c
Honor narrow dims in GPUMaterializeEncoding (#18622)
by Benoit Jacob
· 7 months ago
bb5f2f5
[CPU] Enable tileDispatchUsingForall for mmt4d and convolution pipelines. (#18618)
by Prashant Kumar
· 7 months ago
126e334
[CPU][NFC] Collapsing methods that query a config is enabled. (#18686)
by Han-Chung Wang
· 7 months ago
4710c98
Integrate LLVM at 208f42fd (#18696)
by Benoit Jacob
· 7 months ago
b89ba05
[CPU] Switching to linalg::LinalgOp for MaterializeContractionOp pattern (#18690)
by Han-Chung Wang
· 7 months ago
3801a5d
[GPU][DT] Add support for materializing gemm with batch dimensions. (#18693)
by Han-Chung Wang
· 7 months ago
58dac26
[Encoding] Introduce getElementTypesArray helper in EncodingAttr. (#18692)
by Han-Chung Wang
· 7 months ago
f7a02a0
[DT] Use createOrFold to simplify the logic of creating tensor.cast op. (#18691)
by Han-Chung Wang
· 7 months ago
067ba0e
[GPU][NFC] Updates comments/style/TODO/debug_message for GPU data-tiling (#18688)
by Han-Chung Wang
· 7 months ago
79e979f
Legalize newly added compiler hints ops. (#18689)
by Stella Laurenzo
· 7 months ago
a9f5628
Limit transposition of narrow-N to only CPUMaterializeEncoding (#18684)
by Benoit Jacob
· 7 months ago
c7a9af7
Integrate Stablehlo at d40285e (#18685)
by Benoit Jacob
· 7 months ago
f39884d
Cherry-pick stablehlo build fix for MSVC. (#18683)
by Scott Todd
· 7 months ago
747081a
[Codegen][GPU] Force linalg ops to always use the same vector size (#18654)
by Quinn Dawkins
· 7 months ago
0e8a573
Adding vm.cast.f32.si64 and vm.cast.f32.ui64 ops (#18642)
by Nirvedh Meshram
· 7 months ago
a6043e2
Adding a null HAL driver. (#18675)
by Ben Vanik
· 7 months ago
0bbc432
[VectorDistribution]Add distribution pattern and test mlir file for vector.gather (#18660)
by Bangtian Liu
· 7 months ago
88153eb
[Codegen][GPU] Fuse into destinations for parallel tiling (#18666)
by Quinn Dawkins
· 7 months ago
24ee841
[Codegen][GPU] Fix alloc creation for dynamic outputs in loop fusion (#18665)
by Quinn Dawkins
· 7 months ago
42ca044
Integrate LLVM at e8137503 (#18678)
by Benoit Jacob
· 7 months ago
718b4fd
[docs][pytorch] Add examples for compiling with external weights. (#18658)
by Vinayak Dev
· 7 months ago
206c1f2
[Codegen] Allow vectorizing linalg.copy ops on memrefs (#18672)
by Quinn Dawkins
· 7 months ago
903ab0a
Integrate LLVM at 9fa55ec3 (#18670)
by Benoit Jacob
· 7 months ago
cd48b10
[NFC] Delete dead ops after cloning (#18669)
by Ian Wood
· 7 months ago
7a2705d
Bump stablehlo to `f7f8e4e35` and drop LLVM local reverts (#18668)
by Benoit Jacob
· 7 months ago
d341128
[ExternalInterfaces] Make fill non-hoistableLeafOp, hoist linalg init operands (#18634)
by Max191
· 7 months ago
66c3397
[docs] Update Python API docs (#18662)
by Marius Brehler
· 7 months ago
916bb88
Stopped threads from holding a reference to themselves. (#18636)
by Andrew Woloszyn
· 7 months ago
84ac47b
[LLVMGPU] Switch LLVMGPUVectorDistribute to use iree_gpu.lowering_config (#18651)
by Kunwar Grover
· 7 months ago
462ecb6
[torch] Materialize all derivable bounds and divisor information in the IR. (#18646)
by Stella Laurenzo
· 7 months ago
8de9856
[LinalgExt] Add Interfaces for implementing fusion support for `iree_linalg_ext.custom_op`. (#18647)
by MaheshRavishankar
· 7 months ago
451ef71
[Codegen] Add pass for unrolling annotated for loops (#18641)
by Quinn Dawkins
· 7 months ago
9c39a29
[Codegen][GPU] Fix forall hositing arg use in single trip loops (#18657)
by Nirvedh Meshram
· 7 months ago
c86b621
Switch build_package to arm-hosted runner. (#18656)
by Scott Todd
· 7 months ago
0a71ea5
[Codegen][GPU] Add iree_gpu.multi_mma op to PartitionableLoopsInterface (#18653)
by Max191
· 7 months ago
e45c570
Switch linux_arm64_clang workflow to arm-hosted runner. (#18643)
by Scott Todd
· 7 months ago
839f7f6
[GPU] Use shared memory for data tiled multi_mma ops (#18625)
by Max191
· 7 months ago
4d31d89
[LinalgExt] Avoid rank reduction in Im2Col lowering pattern (#18637)
by Quinn Dawkins
· 7 months ago
0d65b6e
Integrate LLVM at `a86e966a` (#18644)
by Benoit Jacob
· 7 months ago
a7d84f9
[ROCm] Fix known target info for MI300A (#18648)
by Jakub Kuderski
· 7 months ago
20a7638
[ROCm] Always require `--iree-hip-target` (#18645)
by Jakub Kuderski
· 7 months ago
Next »