Sign in
opensecura
/
3p
/
openxla
/
iree
/
5b243a84fc8374181c2fe5c2aaeccf27ad54316c
5b243a8
[Backend][ROCM] Add gfx1150 support. (#17508)
by Stanley Winata
· 10 months ago
aa0bc40
[Codegen][GPU] Add pass to fuse and hoist scf.forall ops (#17505)
by Quinn Dawkins
· 10 months ago
29e70ab
Update onnx package version minimum to 1.16.0. (#17504)
by Scott Todd
· 10 months ago
a6a56a9
Add `LinalgFusionInterface` to support fusion for linalg_ext ops (added `scatter` and `reverse`) (#17428)
by Ian Wood
· 10 months ago
3d1364e
[Codegen][GPU] Add pattern to lower iree_gpu.multi_mma to intrinsics (#17457)
by Quinn Dawkins
· 10 months ago
ab8f668
Revert "Data tiling: transpose narrow-N into narrow-M" (#17503)
by Benoit Jacob
· 10 months ago
e33ca89
[LinalgExt] Split TileAndDecomposeAttention (#17468)
by Kunwar Grover
· 10 months ago
322d688
[Codegen][GPU] Add pattern to drop lead unit dims of multi_mma ops (#17456)
by Quinn Dawkins
· 10 months ago
117cb43
Test 'console' provider in 'tracing' job. (#16454)
by Scott Todd
· 10 months ago
16bdaa9
Data tiling: transpose narrow-N into narrow-M (#17446)
by lialan
· 10 months ago
6c75aa1
[Codegen][GPU] Allow iree_gpu.tensor_barrier to take vectors (#17479)
by Quinn Dawkins
· 10 months ago
1750e2b
Integrate LLVM at 855eef2abd81cb8c7543d4748353d5e378fdd4c2 (#17501)
by Benoit Jacob
· 10 months ago
051c361
NFC: Make a few loop transformations more accessible (#17489)
by Quinn Dawkins
· 10 months ago
9e3d27a
Upgrade to nanobind 2.0. (#17497)
by Stella Laurenzo
· 10 months ago
cad02f9
[Codegen][GPU] Add unrolling pattern for iree_gpu.multi_mma (#17454)
by Quinn Dawkins
· 10 months ago
46c6bf5
[CPU] Add support for pack ukernel preparation. (#17472)
by Han-Chung Wang
· 10 months ago
3d6a8ee
Bump Tracy to https://github.com/wolfpld/tracy/commit/cf2344111. (#17488)
by Scott Todd
· 10 months ago
abdf550
Update IREE onnx import to be in sync with Torch-MLIR (#17476)
by saienduri
· 10 months ago
a842527
[Codegen][GPU] Drop dead PassDetail.h file (#17490)
by Quinn Dawkins
· 10 months ago
440c870
Bump torch-mlir to 5bb1a65 on 2024-05-23 (#17483)
by zjgarvey
· 10 months ago
63dff03
[Codegen][GPU] Add iree_gpu.tensor_barrier op (#17478)
by Quinn Dawkins
· 10 months ago
31e1a30
[Codegen][GPU] Add dictionary based lowering config attribute (#17463)
by Quinn Dawkins
· 10 months ago
3a2617f
[runtime][hip][cuda] Fix semaphore multi-wait, action GPU events and cleanup (#17213)
by Boian Petkantchin
· 10 months ago
ea7d01e
Pin nanobind to 1.9.2 to defer 2.0.0 API changes. (#17481)
by Scott Todd
· 10 months ago
fe3fb24
Allow passwordless sudo in docker images (#17473)
by Boian Petkantchin
· 10 months ago
008add9
[CodeGen][NFC] Rename DecomposeBatchMmt4DOps to CPUPrepareUkernels. (#17471)
by Han-Chung Wang
· 10 months ago
30e0238
Bump LLVM to llvm/llvm-project@bd3f5a4bd3d9d7ee8ae801c24c5081073b20abd4 (#17470)
by MaheshRavishankar
· 10 months ago
9fe159d
[LinalgExt] Generalize attention tiling interface implementation (#17408)
by Kunwar Grover
· 10 months ago
900ec67
Split benchmark jobs into their own independent workflow file. (#17400)
by Scott Todd
· 10 months ago
f7ca45d
[ArmSME][test] Enable TransposeMatmulPass and peeling for e2e matmuls (#17452)
by Benjamin Maxwell
· 10 months ago
1316c92
[Codegen] NFC: Move the lowering config to an attribute interface (#17439)
by Quinn Dawkins
· 10 months ago
e36b355
Update integrate branch and title regexes for new naming. (#17464)
by Scott Todd
· 10 months ago
db8b536
Bump torch-mlir to b870729efe5929b1ee6ff1c7b27d4d1857cdacc7 on 2024-05-21 (#17460)
by zjgarvey
· 10 months ago
02c660c
Log compile and run commands on successful model tests too. (#17290)
by Scott Todd
· 10 months ago
de5760d
Bump LLVM to llvm/llvm-project@1727594 (#17459)
by MaheshRavishankar
· 10 months ago
7813fd3
[CPU] Fix a distribution bug and limiting distribution tile sizes. (#17436)
by Han-Chung Wang
· 10 months ago
d4aa849
[CPU] Add support for pack/unpack ukernels enablement on llvm-cpu path. (#17427)
by Han-Chung Wang
· 10 months ago
9f59514
Add AVX-512 pack ukernel tile function for `16x2xbf16`. (#17432)
by Benoit Jacob
· 10 months ago
01b020e
Re-enable w7900 jobs. (#17445)
by saienduri
· 10 months ago
6c5198d
Folding no-op stream.async.update ops away. (#17458)
by Ben Vanik
· 10 months ago
006af5d
[GPU] Support specifying LLVMGPU backend target features (#17451)
by Lei Zhang
· 10 months ago
a36773a
[Codegen][GPU] Add vectorization pattern for iree_gpu.multi_mma (#17453)
by Quinn Dawkins
· 10 months ago
f6a38ac
[GPU] Thread through a common target description (#17217)
by Lei Zhang
· 10 months ago
62a996b
[Codegen] Add lane distribution for scf.forall (#17373)
by Quinn Dawkins
· 10 months ago
080b1fa
[Codegen][GPU] Add a contraction like operation for mma intrinsics (#17374)
by Quinn Dawkins
· 10 months ago
29d0ceb
Enable a test suite for convolution + winograd. (#17447)
by Han-Chung Wang
· 10 months ago
e0f3c05
[Codegen][GPU] Change iree_gpu.shuffle_tensor to take a region for the read (#17425)
by Quinn Dawkins
· 10 months ago
dc61fcc
Register ShapeDialect in StableHLO plugin. (#17444)
by Scott Todd
· 10 months ago
2a2a4d0
Update various deps to their latest commits. (#17442)
by Scott Todd
· 10 months ago
a3b74bc
[CPU][ArmSME] Update tiling to use all SME accumulators (#16389)
by Benjamin Maxwell
· 11 months ago
4132d2e
[runtime][hal][hip] Implement collectives via RCCL (#17270)
by Boian Petkantchin
· 11 months ago
f849e2f
Integrate LLVM at `502ccd81` (clean) (#17429)
by Ingo Müller
· 11 months ago
98973b3
Add tip for adding new signing key to github (#17420)
by Kunwar Grover
· 11 months ago
6d95f8c
Integrate LLVM at `74a87548` (clean) (#17423)
by Ingo Müller
· 11 months ago
4f8ee51
Moving demotion/promotion passes to input conversion. (#17422)
by Ben Vanik
· 11 months ago
dece30e
[CPU] Do not decompose pack/unpack ops on x86 backends. (#17366)
by Prashant Kumar
· 11 months ago
218b934
Support GGUF version 2 as well as 3. (#17319)
by Scott Todd
· 11 months ago
c1fdd75
Introduce new logo assets. (#17424)
by Scott Todd
· 11 months ago
f2fcbbf
[iree][global] Add conv2d op to demote to bf16 pass (#17410)
by Prashant Kumar
· 11 months ago
3b5b70a
Integrate LLVM at `1650f1b3` (clean) (#17418)
by Ingo Müller
· 11 months ago
b4fc0b4
Implementing the f64 VM extension and flipping the flag by default. (#17416)
by Ben Vanik
· 11 months ago
b716704
Update git-clang-format ref and clang-format version. (#16792)
by Scott Todd
· 11 months ago
8fcab13
[Flow] Improve annotation name for conv (#17417)
by MaheshRavishankar
· 11 months ago
a19fa24
Add tips on signing commits for the DCO check. (#17412)
by Scott Todd
· 11 months ago
356e2b7
[Codegen] Add op for flattening warp and thread ids of forall ops (#17368)
by Quinn Dawkins
· 11 months ago
b17410c
Integrate LLVM at `fb9a028b` (clean) (#17411)
by Ingo Müller
· 11 months ago
90db41a
[LLVMGPU] Add Winograd pipeline for LLVMGPU (#17302)
by Max191
· 11 months ago
05d5710
Integrate LLVM at `c5e67b86` (+1 local revert) (#17409)
by Ingo Müller
· 11 months ago
4021109
[Winograd] Add filtering by annotations for Winograd rewrites (#17332)
by Max191
· 11 months ago
0260947
[GlobalOpt] Simplify the logic used to pick the groups. (#17405)
by MaheshRavishankar
· 11 months ago
bf0fbf0
Fix typo in community/blog/posts/mmt4d.md (#17406)
by Bruce Lai
· 11 months ago
9a294eb
[Winograd] Use output_tile_size for more static output transform tiling (#17200)
by Max191
· 11 months ago
748db31
Fuse Generic Ops Generated by `gather` Lowering (#17341)
by Ian Wood
· 11 months ago
428adf2
[LLVMGPU] Add debug prints for vector distribution config (#17404)
by Jakub Kuderski
· 11 months ago
ecc6983
Drop Tracy from CI benchmarks. (#17383)
by Scott Todd
· 11 months ago
78f5e8d
Integrate torch-mlir@ec6d7aa onnx.resize op (#17358)
by Chi_Liu
· 11 months ago
2a8d681
[CPU] Remove CPUDoubleTilingPeelingExpert (#17329)
by Andrzej Warzyński
· 11 months ago
b0f5521
[GitHub] Add Jakub to codeowners for SPIR-V/Vulkan and LLVMGPU/ROCm (#17399)
by Jakub Kuderski
· 11 months ago
3bac7ec
Add math expand patterns pass (#17395)
by jinchen
· 11 months ago
9f0282b
Fixes double-free in ReorderBroadcastInDimOpAndElementwiseOp. (#17394)
by Ben Vanik
· 11 months ago
29a12f3
[Preprocessing] Remove `input=none` option from TransposeMatmulPass (#17364)
by Benjamin Maxwell
· 11 months ago
a78cee1
Add support for serializing the textual representation of LLVM IR. (#17193)
by Phoebe Chen
· 11 months ago
8d8d18c
[LinalgExt] Simplify Attention unit tests (#17393)
by Kunwar Grover
· 11 months ago
a8404a8
[LLVMGPU] Preserve config dictionary during MapNestedForallToGpuThreadsOp application (#17381)
by Kunwar Grover
· 11 months ago
2ed4778
Integrate LLVM at `a1d43c14d` (+1 revert) (#17380)
by Benoit Jacob
· 11 months ago
06eb43d
Use coalesce loops (#17314)
by MaheshRavishankar
· 11 months ago
01ef465
Bump LLVM to llvm/llvm-project@04ce103 (#17352)
by MaheshRavishankar
· 11 months ago
07d6508
Drop `--retries=2` from pytest to fix `--timeout` behavior. (#17384)
by Scott Todd
· 11 months ago
4bada64
Switch docs and samples from 'tf-nightly' to 'tensorflow'. (#17382)
by Scott Todd
· 11 months ago
bf93db7
Switch TensorFlow test requirement off of tf-nightly. (#17378)
by Scott Todd
· 11 months ago
4f27e64
Generalize overriding llvm func attr flags in translation info (#17365)
by Kunwar Grover
· 11 months ago
ab0258d
Switch pkgci CPU ONNX tests to use standard GitHub runner. (#17375)
by Scott Todd
· 11 months ago
2a701d5
[LLVMGPU] Add translation_info config knobs to disable passes (#17340)
by Jakub Kuderski
· 11 months ago
309831a
Disable all w7900 jobs until the runner is stable. (#17371)
by Scott Todd
· 11 months ago
45ca23e
[CPU] Take native_vector_size into accounts for attention op tiling. (#17349)
by Han-Chung Wang
· 11 months ago
a8930d7
Disable test_amd_w7900 job in ci.yml while runner is unstable. (#17369)
by Scott Todd
· 11 months ago
4cc52f7
Bump jinja2 from 2.11.3 to 3.1.4 in /build_tools/benchmarks/reporting (#17288)
by dependabot[bot]
· 11 months ago
3625c60
Revert "Add math expand patterns pass" (#17367)
by Scott Todd
· 11 months ago
d657082
[LLVMGPU] Switch GPU passes to tablegen definitions. NFC. (#17361)
by Jakub Kuderski
· 11 months ago
a9ca8e6
Add math expand patterns pass (#17324)
by jinchen
· 11 months ago
Next »