Sign in
opensecura
/
3p
/
openxla
/
iree
/
f5660ee72bd8e71910d00cec726375fb4b4a6de8
f5660ee
Harden how ConstEval uses llvm-cpu and the runtime libraries. (#17075)
by Scott Todd
· 12 months ago
d12291f
Add Colab notebook showing Hugging Face import via Turbine. (#17093)
by Scott Todd
· 12 months ago
36b3ce1
[runtime][cts] Add test to wait multiple times for the same semaphore value (#17125)
by Boian Petkantchin
· 12 months ago
46f03af
[CodeGen] Add a basic unit test for the TilingConfig class (#17072)
by Andrzej Warzyński
· 12 months ago
07d88b1
[CPU][NFC] CPU/KernelDispatch cleanups (#17124)
by Benoit Jacob
· 12 months ago
0d947f3
Removing the `iree.compiler.consteval` attr. (#17056)
by Ben Vanik
· 12 months ago
cbb4257
Update PckgCI cpu testing to include SDXL model testing + benchmark (#17117)
by saienduri
· 12 months ago
44ccc22
[runtime][hip][cuda] Fix bug where zombified actions may not get cleaned up (#17107)
by Boian Petkantchin
· 12 months ago
f4bee0c
[runtime] Make the runtime more TSan-friendly (#17051)
by Boian Petkantchin
· 12 months ago
0f6bc24
[runtime][hip][cuda] Fix mutex locking when waiting on a semaphore (#17118)
by Boian Petkantchin
· 12 months ago
afd7cab
Add MAINTAINERS.md and RELEASING.md. (#17122)
by Stella Laurenzo
· 12 months ago
f50de8c
Add Python 3.12 to Windows and MacOS builds.
by Stella Laurenzo
· 12 months ago
fd79fca
[Codegen] NFC: Cleanup common transform dialect ops (#17120)
by Quinn Dawkins
· 12 months ago
d861372
[Codegen] Drop iree.fold_arith_ext_into_contraction for upstream variant (#17119)
by Quinn Dawkins
· 12 months ago
e8f4948
[DT] Teach encoding about padding. (#17077)
by Han-Chung Wang
· 1 year ago
78005ef
[runtime][cts] add test where 2 batches wait on different semaphore values (#17091)
by Boian Petkantchin
· 1 year ago
0ec9166
Add f32_to_i2 and i2_to_f32 e2e tests. (#17074)
by Han-Chung Wang
· 1 year ago
074cbf3
[runtime] Refactor semaphore submission CTS (#17108)
by Boian Petkantchin
· 1 year ago
cda70e8
Integrate LLVM at llvm/llvm-project@08163cd9d82690e808c28515523b5fd0923d7b38 (#17116)
by Stanley Winata
· 1 year ago
36b3891
Skip unused check test compilation in riscv + emscripten jobs. (#17114)
by Scott Todd
· 1 year ago
8e86156
[LLVMGPU] Modify layouts to be able to handle dequant operation. (#17113)
by Stanley Winata
· 1 year ago
e87ff17
[LLVMGPU] allow multiple m and n dims in contraction distribution (#16943)
by Quinn Dawkins
· 1 year ago
9f97989
[CodeGen] Fix MLIR types in the function comment. (#17109)
by Han-Chung Wang
· 1 year ago
3f51a55
Replace openxla/iree with iree-org/iree across the project. (#17110)
by Scott Todd
· 1 year ago
04b9f76
Update instructions for getting write-access in iree-org. (#17112)
by Scott Todd
· 1 year ago
6295074
Replace openxla with iree-org in Github runner configs and scripts (#17065)
by Jerry Wu
· 1 year ago
5ed2fec
[GlobalOptimization] Add a pass to do horizontal fusion of contraction operations with a common operand. (#17059)
by Prashant Kumar
· 1 year ago
a2476ce
[metal] Disable failing semaphore submission test until fixing (#17100)
by Lei Zhang
· 1 year ago
0e1e6bf
Clarify fusion heuristic (#17098)
by MaheshRavishankar
· 1 year ago
125f420
[CodeGen] Add a pattern to fold extract_slice consumer into xfer.write. (#17067)
by Han-Chung Wang
· 1 year ago
f755b42
[Codegen] Add folding in createBoundedTileSize for partially dynamic wgSize. (#17089)
by Stanley Winata
· 1 year ago
a0b4853
Fixes to enable out-of-tree plugin builds. (#17095)
by MaheshRavishankar
· 1 year ago
bd1b106
[CodeGen] Drop encoding for HAL and Flow ops when DT is not supported. (#17081)
by Han-Chung Wang
· 1 year ago
886c416
[Winograd] Use TilingInterface for all levels of winograd op tiling (#17061)
by Max191
· 1 year ago
2441959
Add GPU dialect dependencies to C/Python bindings. (#17090)
by Scott Todd
· 1 year ago
ab4babe
[ConstEval] Add flag to adjust tensor size limit for hoisting (#17064)
by Max191
· 1 year ago
eec081c
[LLVMGPU] Fallback if dynamic dim found on vector distribute. (#17085)
by Stanley Winata
· 1 year ago
f86f21c
[python] Expose MLIR python bindings for gpu and transform (#17088)
by Martin Paul Lücke
· 1 year ago
4778f5f
Categorize matmul/metvec like generic ops for dispatches (#17084)
by Lei Zhang
· 1 year ago
f32a87c
[Flow] Move elementwise op fusion and bubble up expand shapes patterns into their own pass. (#17068)
by MaheshRavishankar
· 1 year ago
3677fbc
[runtime] Add semaphore test where 2 batches wait on a former batch amongst 2 (#17080)
by Boian Petkantchin
· 1 year ago
ff624dd
Categorize dispatch name better for linalg.generic cases (#16677)
by Lei Zhang
· 1 year ago
d284154
Making FlattenFullFillToSplat more conservative. (#17079)
by Ben Vanik
· 1 year ago
a8731a3
Set top level token permissions (#16744)
by Marius Brehler
· 1 year ago
cd282de
[runtime][hip][cuda] Fix waiting on a semaphore on the host (#17073)
by Boian Petkantchin
· 1 year ago
fdfe344
[LinalgExt] Moving encoding utils to EncodingAttr builtin or LinalgExt/IR (#17053)
by Han-Chung Wang
· 1 year ago
c83f9ba
NFC: Add VectorLayoutInterface method for getting the layout rank (#17071)
by Quinn Dawkins
· 1 year ago
5adae9e
[Codegen] Fix `TilingConfig::getTilingLevelForVectorDimPosition()` for <= 2 tiling levels (#17050)
by Benjamin Maxwell
· 1 year ago
ace6397
Add boolean option to do RELU in mlp plugin (#17058)
by Nirvedh Meshram
· 1 year ago
872f0b6
Fix crash in transform dialect script when using attention script with ToT IREE (#17066)
by MaheshRavishankar
· 1 year ago
56541e4
[NFC] Switching to not use using-directives in UtilExternalModels.cpp (#17063)
by Han-Chung Wang
· 1 year ago
5e75105
[Flow] Implement ValueBoundsOpInterface for flow.dispatch.tensor.load op (#17062)
by Han-Chung Wang
· 1 year ago
06f41ce
[Preprocessing] Add pad to MMA intrinsic size pass (#17057)
by Jakub Kuderski
· 1 year ago
6d4a99c
Use `iree-turbine` in PyTorch docs and samples. (#17036)
by Scott Todd
· 1 year ago
915b42e
Add DataLayoutPropagation pass to bubble up/push down pack and unpack (#16731)
by Jerry Wu
· 1 year ago
503edab
[GlobalOpt][DT] Simplify logics in SetEncoding pass. (#17040)
by Han-Chung Wang
· 1 year ago
0bd3c1d
[stablehlo] Update stablehlo to 341e063f0924fc1350538dc53a92c21ec5e022a3 (#17026)
by Balaji V. Iyer
· 1 year ago
e36844f
Add ability to call the same custom dispatch multiple times when using pdl patterns. (#16967)
by Nirvedh Meshram
· 1 year ago
1d00b50
Update Discord invite link. (#17052)
by Scott Todd
· 1 year ago
22faa15
Update README.md with other discord link.
by Stella Laurenzo
· 1 year ago
72aeee2
Update README.md with new Discord link.
by Stella Laurenzo
· 1 year ago
2039d56
Remove the fixed point iteration in the global opt pipeline. (#17049)
by Ben Vanik
· 1 year ago
529826f
Add missing line continuation slash to recently updated page. (#17048)
by Scott Todd
· 1 year ago
459fab6
[runtime][hip][cuda] Fix waiting on wait semaphores before executing actions (#17025)
by Boian Petkantchin
· 1 year ago
3d626b5
[Preprocessing] NFC: Finish migrating passes to use new tablegen (#17047)
by Quinn Dawkins
· 1 year ago
39091a7
[Flow] Switch to new pass generation tablegen definitions (#17046)
by Quinn Dawkins
· 1 year ago
1c49d6a
[runtime][hip][cuda] Add tracing in graph execution mode (#16894)
by Boian Petkantchin
· 1 year ago
080657f
Fix failing transform dialect CUDA tests. (#17042)
by MaheshRavishankar
· 1 year ago
bdbf42a
Clone `tensor.empty` operations on dispatches with only `linalg.generic` ops. (#17043)
by MaheshRavishankar
· 1 year ago
c2abb93
Disable TD tests on CUDA backends due to failure. (#17041)
by MaheshRavishankar
· 1 year ago
55fafcf
Forking dynamic behavior from flow.tensor.constant. (#17034)
by Ben Vanik
· 1 year ago
954cb36
Move Codegen pass pipelines to nest on `FunctionOpInterface`. (#16665)
by MaheshRavishankar
· 1 year ago
699d244
Integrate llvm 20240412 (#17031)
by Vivian
· 1 year ago
f616123
[python] Expose python bindings for amdgpu in iree.compiler.dialects (#17028)
by Martin Paul Lücke
· 1 year ago
40f2533
[HIP] Add inline execution mode (#16951)
by Nithin Meganathan
· 1 year ago
390f865
Integrate llvm 20240411 at 56954a53e58282d7584e31ec14a2b1052cd861e8 (#17027)
by Vivian
· 1 year ago
76e9cfe
Allowing flow.tensor.constant to be used for constants. (#17024)
by Ben Vanik
· 1 year ago
fbd31b0
[Vulkan] Fix coop matrix property initialization (#17023)
by Jakub Kuderski
· 1 year ago
94971b4
[LLVMGPU] Fit mma schedules inside shared memory limits (#16927)
by Kunwar Grover
· 1 year ago
4437c43
[CPU RISCV] Avoid using pre-configured tile sizes as input vector sizes (#17018)
by Bruce Lai
· 1 year ago
26f77de
[CPU] Fix FusionOfTensorOps nullptr (#17015)
by Diego Caballero
· 1 year ago
7dca44b
[GPU] Overhaul reduce shared memory bank conflicts pass for gfx9 (#17010)
by Jakub Kuderski
· 1 year ago
0a92a71
Integrate llvm at 9760872b537ba8e6eee2e68eb81b7d26af5b40e4 (#17011)
by Vivian
· 1 year ago
2566f15
Re-enable ROCm ONNX tests, running one test at a time. (#17014)
by Scott Todd
· 1 year ago
67e234c
Update "bindings" reference page to reflect current support status. (#17005)
by Scott Todd
· 1 year ago
2780fd5
Fix typo in benchmark tracing warning message.
by Ben Vanik
· 1 year ago
336ba12
Fill in details on parameter formats (IRPA, GGUF, safetensors). (#17006)
by Scott Todd
· 1 year ago
e006a05
Document recently added utility compiler passes. (#16983)
by Scott Todd
· 1 year ago
b4273a4
Increase pip install timeout for pkgci venv setup. (#17008)
by Scott Todd
· 1 year ago
92f10a7
[CodeGen] Add fpowi pattern to PolynomialApproximationPass (#17003)
by Daniel Garvey
· 1 year ago
8a94d5c
Refresh "profiling with Tracy" developer docs. (#16939)
by Scott Todd
· 1 year ago
8ee0ada
[GlobalOpt][DT] Add a flag to disable early materialization. (#16997)
by Han-Chung Wang
· 1 year ago
d42e457
Add e2e tests for FA2 (#16953)
by erman-gurses
· 1 year ago
ab949ef
[Codegen] Add support for vectorizing tensor.unpack ops with masking. (#16664)
by Han-Chung Wang
· 1 year ago
5a95fd4
Avoid distributing loops that are statically known to be unit trip count (#16985)
by MaheshRavishankar
· 1 year ago
a144ff6
Fix a “Library not loaded” issue on macos (#16987)
by Atomoper
· 1 year ago
190d959
Allow users to specify riscv cpu and get hardware features (#16902)
by Alex Chiang
· 1 year ago
bb7e536
Disable out of tree ROCm tests again. (#16994)
by Scott Todd
· 1 year ago
dcc8e19
Adds a flag to enable/disable vector contract custom kernels in `LLVMCPUMmt4dVectorLoweringPass` (#16867)
by Kojo Acquah
· 1 year ago
39bf204
Integrate LLVM at 9708d0900311503aa4685d6810d8caf0412e15d7 (#16988)
by Benoit Jacob
· 1 year, 1 month ago
Next »