Log - 5b243a84fc8374181c2fe5c2aaeccf27ad54316c - 3p/openxla/iree

5b243a8 [Backend][ROCM] Add gfx1150 support. (#17508) by Stanley Winata · 10 months ago
aa0bc40 [Codegen][GPU] Add pass to fuse and hoist scf.forall ops (#17505) by Quinn Dawkins · 10 months ago
29e70ab Update onnx package version minimum to 1.16.0. (#17504) by Scott Todd · 10 months ago
a6a56a9 Add `LinalgFusionInterface` to support fusion for linalg_ext ops (added `scatter` and `reverse`) (#17428) by Ian Wood · 10 months ago
3d1364e [Codegen][GPU] Add pattern to lower iree_gpu.multi_mma to intrinsics (#17457) by Quinn Dawkins · 10 months ago
ab8f668 Revert "Data tiling: transpose narrow-N into narrow-M" (#17503) by Benoit Jacob · 10 months ago
e33ca89 [LinalgExt] Split TileAndDecomposeAttention (#17468) by Kunwar Grover · 10 months ago
322d688 [Codegen][GPU] Add pattern to drop lead unit dims of multi_mma ops (#17456) by Quinn Dawkins · 10 months ago
117cb43 Test 'console' provider in 'tracing' job. (#16454) by Scott Todd · 10 months ago
16bdaa9 Data tiling: transpose narrow-N into narrow-M (#17446) by lialan · 10 months ago
6c75aa1 [Codegen][GPU] Allow iree_gpu.tensor_barrier to take vectors (#17479) by Quinn Dawkins · 10 months ago
1750e2b Integrate LLVM at 855eef2abd81cb8c7543d4748353d5e378fdd4c2 (#17501) by Benoit Jacob · 10 months ago
051c361 NFC: Make a few loop transformations more accessible (#17489) by Quinn Dawkins · 10 months ago
9e3d27a Upgrade to nanobind 2.0. (#17497) by Stella Laurenzo · 10 months ago
cad02f9 [Codegen][GPU] Add unrolling pattern for iree_gpu.multi_mma (#17454) by Quinn Dawkins · 10 months ago
46c6bf5 [CPU] Add support for pack ukernel preparation. (#17472) by Han-Chung Wang · 10 months ago
3d6a8ee Bump Tracy to https://github.com/wolfpld/tracy/commit/cf2344111. (#17488) by Scott Todd · 10 months ago
abdf550 Update IREE onnx import to be in sync with Torch-MLIR (#17476) by saienduri · 10 months ago
a842527 [Codegen][GPU] Drop dead PassDetail.h file (#17490) by Quinn Dawkins · 10 months ago
440c870 Bump torch-mlir to 5bb1a65 on 2024-05-23 (#17483) by zjgarvey · 10 months ago
63dff03 [Codegen][GPU] Add iree_gpu.tensor_barrier op (#17478) by Quinn Dawkins · 10 months ago
31e1a30 [Codegen][GPU] Add dictionary based lowering config attribute (#17463) by Quinn Dawkins · 10 months ago
3a2617f [runtime][hip][cuda] Fix semaphore multi-wait, action GPU events and cleanup (#17213) by Boian Petkantchin · 10 months ago
ea7d01e Pin nanobind to 1.9.2 to defer 2.0.0 API changes. (#17481) by Scott Todd · 10 months ago
fe3fb24 Allow passwordless sudo in docker images (#17473) by Boian Petkantchin · 10 months ago
008add9 [CodeGen][NFC] Rename DecomposeBatchMmt4DOps to CPUPrepareUkernels. (#17471) by Han-Chung Wang · 10 months ago
30e0238 Bump LLVM to llvm/llvm-project@bd3f5a4bd3d9d7ee8ae801c24c5081073b20abd4 (#17470) by MaheshRavishankar · 10 months ago
9fe159d [LinalgExt] Generalize attention tiling interface implementation (#17408) by Kunwar Grover · 10 months ago
900ec67 Split benchmark jobs into their own independent workflow file. (#17400) by Scott Todd · 10 months ago
f7ca45d [ArmSME][test] Enable TransposeMatmulPass and peeling for e2e matmuls (#17452) by Benjamin Maxwell · 10 months ago
1316c92 [Codegen] NFC: Move the lowering config to an attribute interface (#17439) by Quinn Dawkins · 10 months ago
e36b355 Update integrate branch and title regexes for new naming. (#17464) by Scott Todd · 10 months ago
db8b536 Bump torch-mlir to b870729efe5929b1ee6ff1c7b27d4d1857cdacc7 on 2024-05-21 (#17460) by zjgarvey · 10 months ago
02c660c Log compile and run commands on successful model tests too. (#17290) by Scott Todd · 10 months ago
de5760d Bump LLVM to llvm/llvm-project@1727594 (#17459) by MaheshRavishankar · 10 months ago
7813fd3 [CPU] Fix a distribution bug and limiting distribution tile sizes. (#17436) by Han-Chung Wang · 10 months ago
d4aa849 [CPU] Add support for pack/unpack ukernels enablement on llvm-cpu path. (#17427) by Han-Chung Wang · 10 months ago
9f59514 Add AVX-512 pack ukernel tile function for `16x2xbf16`. (#17432) by Benoit Jacob · 10 months ago
01b020e Re-enable w7900 jobs. (#17445) by saienduri · 10 months ago
6c5198d Folding no-op stream.async.update ops away. (#17458) by Ben Vanik · 10 months ago
006af5d [GPU] Support specifying LLVMGPU backend target features (#17451) by Lei Zhang · 10 months ago
a36773a [Codegen][GPU] Add vectorization pattern for iree_gpu.multi_mma (#17453) by Quinn Dawkins · 10 months ago
f6a38ac [GPU] Thread through a common target description (#17217) by Lei Zhang · 10 months ago
62a996b [Codegen] Add lane distribution for scf.forall (#17373) by Quinn Dawkins · 10 months ago
080b1fa [Codegen][GPU] Add a contraction like operation for mma intrinsics (#17374) by Quinn Dawkins · 10 months ago
29d0ceb Enable a test suite for convolution + winograd. (#17447) by Han-Chung Wang · 10 months ago
e0f3c05 [Codegen][GPU] Change iree_gpu.shuffle_tensor to take a region for the read (#17425) by Quinn Dawkins · 10 months ago
dc61fcc Register ShapeDialect in StableHLO plugin. (#17444) by Scott Todd · 10 months ago
2a2a4d0 Update various deps to their latest commits. (#17442) by Scott Todd · 10 months ago
a3b74bc [CPU][ArmSME] Update tiling to use all SME accumulators (#16389) by Benjamin Maxwell · 11 months ago
4132d2e [runtime][hal][hip] Implement collectives via RCCL (#17270) by Boian Petkantchin · 11 months ago
f849e2f Integrate LLVM at `502ccd81` (clean) (#17429) by Ingo Müller · 11 months ago
98973b3 Add tip for adding new signing key to github (#17420) by Kunwar Grover · 11 months ago
6d95f8c Integrate LLVM at `74a87548` (clean) (#17423) by Ingo Müller · 11 months ago
4f8ee51 Moving demotion/promotion passes to input conversion. (#17422) by Ben Vanik · 11 months ago
dece30e [CPU] Do not decompose pack/unpack ops on x86 backends. (#17366) by Prashant Kumar · 11 months ago
218b934 Support GGUF version 2 as well as 3. (#17319) by Scott Todd · 11 months ago
c1fdd75 Introduce new logo assets. (#17424) by Scott Todd · 11 months ago
f2fcbbf [iree][global] Add conv2d op to demote to bf16 pass (#17410) by Prashant Kumar · 11 months ago
3b5b70a Integrate LLVM at `1650f1b3` (clean) (#17418) by Ingo Müller · 11 months ago
b4fc0b4 Implementing the f64 VM extension and flipping the flag by default. (#17416) by Ben Vanik · 11 months ago
b716704 Update git-clang-format ref and clang-format version. (#16792) by Scott Todd · 11 months ago
8fcab13 [Flow] Improve annotation name for conv (#17417) by MaheshRavishankar · 11 months ago
a19fa24 Add tips on signing commits for the DCO check. (#17412) by Scott Todd · 11 months ago
356e2b7 [Codegen] Add op for flattening warp and thread ids of forall ops (#17368) by Quinn Dawkins · 11 months ago
b17410c Integrate LLVM at `fb9a028b` (clean) (#17411) by Ingo Müller · 11 months ago
90db41a [LLVMGPU] Add Winograd pipeline for LLVMGPU (#17302) by Max191 · 11 months ago
05d5710 Integrate LLVM at `c5e67b86` (+1 local revert) (#17409) by Ingo Müller · 11 months ago
4021109 [Winograd] Add filtering by annotations for Winograd rewrites (#17332) by Max191 · 11 months ago
0260947 [GlobalOpt] Simplify the logic used to pick the groups. (#17405) by MaheshRavishankar · 11 months ago
bf0fbf0 Fix typo in community/blog/posts/mmt4d.md (#17406) by Bruce Lai · 11 months ago
9a294eb [Winograd] Use output_tile_size for more static output transform tiling (#17200) by Max191 · 11 months ago
748db31 Fuse Generic Ops Generated by `gather` Lowering (#17341) by Ian Wood · 11 months ago
428adf2 [LLVMGPU] Add debug prints for vector distribution config (#17404) by Jakub Kuderski · 11 months ago
ecc6983 Drop Tracy from CI benchmarks. (#17383) by Scott Todd · 11 months ago
78f5e8d Integrate torch-mlir@ec6d7aa onnx.resize op (#17358) by Chi_Liu · 11 months ago
2a8d681 [CPU] Remove CPUDoubleTilingPeelingExpert (#17329) by Andrzej Warzyński · 11 months ago
b0f5521 [GitHub] Add Jakub to codeowners for SPIR-V/Vulkan and LLVMGPU/ROCm (#17399) by Jakub Kuderski · 11 months ago
3bac7ec Add math expand patterns pass (#17395) by jinchen · 11 months ago
9f0282b Fixes double-free in ReorderBroadcastInDimOpAndElementwiseOp. (#17394) by Ben Vanik · 11 months ago
29a12f3 [Preprocessing] Remove `input=none` option from TransposeMatmulPass (#17364) by Benjamin Maxwell · 11 months ago
a78cee1 Add support for serializing the textual representation of LLVM IR. (#17193) by Phoebe Chen · 11 months ago
8d8d18c [LinalgExt] Simplify Attention unit tests (#17393) by Kunwar Grover · 11 months ago
a8404a8 [LLVMGPU] Preserve config dictionary during MapNestedForallToGpuThreadsOp application (#17381) by Kunwar Grover · 11 months ago
2ed4778 Integrate LLVM at `a1d43c14d` (+1 revert) (#17380) by Benoit Jacob · 11 months ago
06eb43d Use coalesce loops (#17314) by MaheshRavishankar · 11 months ago
01ef465 Bump LLVM to llvm/llvm-project@04ce103 (#17352) by MaheshRavishankar · 11 months ago
07d6508 Drop `--retries=2` from pytest to fix `--timeout` behavior. (#17384) by Scott Todd · 11 months ago
4bada64 Switch docs and samples from 'tf-nightly' to 'tensorflow'. (#17382) by Scott Todd · 11 months ago
bf93db7 Switch TensorFlow test requirement off of tf-nightly. (#17378) by Scott Todd · 11 months ago
4f27e64 Generalize overriding llvm func attr flags in translation info (#17365) by Kunwar Grover · 11 months ago
ab0258d Switch pkgci CPU ONNX tests to use standard GitHub runner. (#17375) by Scott Todd · 11 months ago
2a701d5 [LLVMGPU] Add translation_info config knobs to disable passes (#17340) by Jakub Kuderski · 11 months ago
309831a Disable all w7900 jobs until the runner is stable. (#17371) by Scott Todd · 11 months ago
45ca23e [CPU] Take native_vector_size into accounts for attention op tiling. (#17349) by Han-Chung Wang · 11 months ago
a8930d7 Disable test_amd_w7900 job in ci.yml while runner is unstable. (#17369) by Scott Todd · 11 months ago
4cc52f7 Bump jinja2 from 2.11.3 to 3.1.4 in /build_tools/benchmarks/reporting (#17288) by dependabot[bot] · 11 months ago
3625c60 Revert "Add math expand patterns pass" (#17367) by Scott Todd · 11 months ago
d657082 [LLVMGPU] Switch GPU passes to tablegen definitions. NFC. (#17361) by Jakub Kuderski · 11 months ago
a9ca8e6 Add math expand patterns pass (#17324) by jinchen · 11 months ago