Sign in
opensecura
/
3p
/
openxla
/
iree
/
cc2ccc31d7926a7813588c44491b4449af747f87
cc2ccc3
Use our private copy of Vulkan header files explicitly. (#13346)
by Scott Todd
· 2 years ago
0b77bfd
Unify the two plugin examples to use the same micro-kernel. (#13324)
by MaheshRavishankar
· 2 years ago
935b110
[hal][cts] Add more tests for drivers device creation APIs (#12064)
by Lei Zhang
· 2 years ago
1cbb0fe
benchmark: doubling batch count, set bytes processed (#13269)
by bjacob
· 2 years ago
3cff105
Update tflite bindings to accept both identifier forms (#13195)
by Jacques Pienaar
· 2 years ago
e312348
[StableHLO] Port lowering to LinalgExt (#13331)
by Jakub Kuderski
· 2 years ago
b8a8f5c
Update python bindings for `iree-benchmark-module` to use `--module=-` (#13345)
by Kojo Acquah
· 2 years ago
72c6169
Making iree_hal_channel_provider_t a ref object and exposing on devices. (#13317)
by Ben Vanik
· 2 years ago
8f03e8d
[StableHLO] Port control flow legalization (#13337)
by Jakub Kuderski
· 2 years ago
50a6c17
Add prefetches to fix performance regression on ARM Cortex-X2 (#13342)
by bjacob
· 2 years ago
5d15caf
Removing a verifier check on equality between threadblock K shape and problem K shape (#13310)
by Manish Gupta
· 2 years ago
6c2f27d
[StableHLO] Port shape computation legalization (#13328)
by Jakub Kuderski
· 2 years ago
475af42
[spirv][vulkan] Add f16 e2e matmul tests (#13327)
by Jakub Kuderski
· 2 years ago
d318c54
Rework iree-run-mlir to operate against the IREE compiler C API. (#12715)
by Stella Laurenzo
· 2 years ago
c792591
nonfatal failures in ukernel tests (#13316)
by bjacob
· 2 years ago
4208fee
Refactor OOBI to save framework agnostic results (#13206)
by Phoenix Meadowlark
· 2 years ago
cca2173
[CI] Fix buildkite pipeline (#13322)
by Jakub Kuderski
· 2 years ago
9f09ae4
Add trailer to skip default benchmarking on LLVM integration PR (#13314)
by Jerry Wu
· 2 years ago
af5f4c3
[spirv][vulkan] Do not run nvidia gpu tests on buildkite (#13319)
by Jakub Kuderski
· 2 years ago
3912546
Add a sample that uses the iree_codegen.ukernel.generic operation to call a microkernel supplied as a plugin. (#13132)
by MaheshRavishankar
· 2 years ago
02f85ea
Moving MPI library loading to hal/utils. (#13152)
by Calin Cascaval
· 2 years ago
32a92b6
Post GCS links in CI summary (#13273)
by Jerry Wu
· 2 years ago
ad65e56
Always run benchmarks on LLVM integration PR (#13298)
by Jerry Wu
· 2 years ago
c6ba2a8
polish ukernel test cpu features (#13266)
by bjacob
· 2 years ago
dc1684d
[spirv][vulkan] Run e2 i8 matmul tests in CI (#13312)
by Jakub Kuderski
· 2 years ago
5cd6e1d
Update sample Colab notebooks following TF importer changes. (#13311)
by Scott Todd
· 2 years ago
ebf8490
Integrate llvm/llvm-project@be9c91843bab (#13296)
by Lei Zhang
· 2 years ago
f0fac24
Move benchmark config generation to build_e2e_test_artifacts (#13291)
by Jerry Wu
· 2 years ago
7a86df4
[SPIRV] Add ability to specify transform dialect codegen spec file (#13267)
by Quinn Dawkins
· 2 years ago
e0dc18e
Remove --split-input-file from iree-run-mlir. (#13302)
by Ben Vanik
· 2 years ago
6852a34
[StableHLO][NFC] Port remaining op tests (#13297)
by Jakub Kuderski
· 2 years ago
6677e81
[spirv][vulkan] Enable integer dot prod on desktop GPUs (#13279)
by Jakub Kuderski
· 2 years ago
ddb3223
[NFC] Organize `compiler/Utils/TracingUtils.cpp`. (#13087)
by Scott Todd
· 2 years ago
26f9cdf
Unify typed VMVX entry points into untyped functions. (#13270)
by bjacob
· 2 years ago
4f5d55b
Handle supported ImportOptions in tf.py and fix Windows compatibility. (#13287)
by Scott Todd
· 2 years ago
eca606e
[spirv][webgpu] Expand extended multiplication ops (#13274)
by Jakub Kuderski
· 2 years ago
3537df7
Stop preserving `"encoding"` attribute on `pack` op. (#13293)
by bjacob
· 2 years ago
550b0c4
Swap tile sizes adjustment about pack/unpack to a proper order. (#13272)
by Han-Chung Wang
· 2 years ago
b476624
Add support for computing dispatch workgroup count using program slices. (#13038)
by MaheshRavishankar
· 2 years ago
34e0770
Update pjrt-plugin link in jax.md. (#13284)
by Scott Todd
· 2 years ago
5ad8800
Update mmperf (#13221)
by mariecwhite
· 2 years ago
31de207
Disable failing ukernel tests on Windows (#13276)
by bjacob
· 2 years ago
8cf1db2
Using FindThreads instead of hardcode `-lpthread`. (#13118)
by Rechie Kho
· 2 years ago
49d0123
ukernels: pack: move the generation of i64 padding_value to codegen (#13264)
by bjacob
· 2 years ago
1fd449b
ukernels: fold type enums into flags (#13260)
by bjacob
· 2 years ago
9a8e63e
Ukernel interface: take offsets, reorder fields (#13235)
by bjacob
· 2 years ago
47da9cf
Disable TF Bert-Large due to #13211 (#13212)
by mariecwhite
· 2 years ago
5f16489
ukernels: let `pack` take `padding_value` by value (#13233)
by bjacob
· 2 years ago
6bd3211
ukernels: drop the unused `i8` case in `unpack`. (#13231)
by bjacob
· 2 years ago
4bca308
Ukernels: separate public vs internal headers (#13230)
by bjacob
· 2 years ago
772a335
[runtime] Fix std::array parameter unpacking (#13222)
by Eugene Zhulenev
· 2 years ago
52861ca
Integrate llvm-project at 9e9b1effac34 and bump dependencies (#13232)
by Kojo Acquah
· 2 years ago
c6c713c
[StableHLO] Add initial input conversion pipeline and e2e tests (#13223)
by Jakub Kuderski
· 2 years ago
786e1a0
Add missing canonicalization to LLVMCPUVectorLowering (#13169)
by Diego Caballero
· 2 years ago
84d938e
Make python dependency in lit test configurable (#13224)
by Tori Baker
· 2 years ago
ddbb3dd
Relax the application conditions for the matmul tensorcore transform … (#13228)
by Nicolas Vasilache
· 2 years ago
c59cf83
NFC - Improve debug messages around transform dialect matching (#13226)
by Nicolas Vasilache
· 2 years ago
585d387
Plumb compiler version info through to built Python wheels. (#13214)
by Stella Laurenzo
· 2 years ago
e199a08
Split mhlo.optimization_barrier lowering into several util.optimization_barrier ops if there are multiple inputs (#13210)
by NatashaKnk
· 2 years ago
69236fa
Remove outdated python version requirement for Pytype (#13209)
by Phoenix Meadowlark
· 2 years ago
43ceb15
Update matmul tensorcore strategy to properly trigger pipelining with… (#13194)
by Nicolas Vasilache
· 2 years ago
fe604d3
[StableHLO][NFC] Trim down stablehlo to linalg conversion deps (#13180)
by Jakub Kuderski
· 2 years ago
978754a
tidy up elementwise ukernels (#13204)
by bjacob
· 2 years ago
4dd4eac
[llvmgpu] check if operands can be distributable for shmem promotion (#13201)
by Okwan Kwon
· 2 years ago
ab37989
Integrate llvm-project at 571eaead173b and bump dependencies (#13200)
by Kojo Acquah
· 2 years ago
e1a4a2b
Remove the `matmul` ukernel (#13175)
by bjacob
· 2 years ago
e6b7fae
port `pack` and `unpack` to `ukernel.generic` (#13173)
by bjacob
· 2 years ago
0d82968
Use git directly instead of `actions/checkout` on Windows (#13186)
by Geoffrey Martin-Noble
· 2 years ago
0ab01b6
Use GCS for Windows ccache (#13183)
by Geoffrey Martin-Noble
· 2 years ago
de2ecca
Remove Pixel 4 GPU benchmarks (#13188)
by Jerry Wu
· 2 years ago
eafc042
Fix ASAN issue casting to uint32 (#13193)
by Tori Baker
· 2 years ago
0e9c462
Add lowering for BF16 arith/math operations to F32 with truncf/extf wrappers (#12841)
by Rob Suderman
· 2 years ago
302a289
Generalize matmul tensorcore strategy to work with arbitrary unaligned f32 tens… (#13192)
by Nicolas Vasilache
· 2 years ago
3730042
Refactor strategy towards generalization (#13190)
by Nicolas Vasilache
· 2 years ago
a806149
Disable models due to recent regression: https://github.com/openxla/iree/issues/13189
by mariecwhite
· 2 years ago
7758993
Remove pull_request
by mariecwhite
· 2 years ago
af46cd8
Add Bert-Large TF models in various batch sizes
by mariecwhite
· 2 years, 1 month ago
ed174dc
Fix batch_matmul + transpose using nvgpu.mma.sync.1688.f32 (#13075) (#13170)
by Manish Gupta
· 2 years ago
ab5bcbd
Add compiler C API `ireeCompilerGetRevision`. (#13184)
by Stella Laurenzo
· 2 years ago
8a1182c
Put ccache hint in docs on one line. (#13185)
by Stella Laurenzo
· 2 years ago
20e014b
Enable Shared Memory Swizzle Optimization for Batched Matmul (#13060)
by Manish Gupta
· 2 years ago
4880d00
[Dev Container] - Adding VSCode Dev Container Capability (#12921)
by Jonathan DEKHTIAR
· 2 years ago
8548216
Fix exportTargetAttr construction in InlineExecutablesPass. (#13144)
by Scott Todd
· 2 years ago
6a7f69f
Fixing vm::ref operator& after type consistency changes. (#13178)
by Ben Vanik
· 2 years ago
b8527c6
NFC - Parameterize the matmul tensor core strategy towards generaliza… (#13177)
by Nicolas Vasilache
· 2 years ago
04d2ba5
Updating architecture diagram with "plugins" in a few places. (#12615)
by Ben Vanik
· 2 years ago
d2f5a7c
Update `experimental/web/sample_dynamic/` after runtime changes. (#13174)
by Scott Todd
· 2 years ago
56c4eff
Update TensorFlow package in `samples` Dockerfile to 2.12.0. (#13171)
by Scott Todd
· 2 years ago
e4e2398
Add CI workflow to run android tests on mobile phones (#13024)
by Jerry Wu
· 2 years ago
81cf28c
Simplifying iree-run-mlir by making it run only a single function. (#13149)
by Ben Vanik
· 2 years ago
0d5f837
[xla] Drop arg & res attributes on XLA import (#13154)
by Jacques Pienaar
· 2 years ago
9040c43
Opt in to `-Wc++20-extensions` warning in clang. (#13158)
by Scott Todd
· 2 years ago
48d7a42
Replace `ukernel.mmt4d` by `ukernel.generic`. (#13143)
by bjacob
· 2 years ago
479900a
[mhlo] Remove redundant preprocessing pattern. (#13153)
by Jacques Pienaar
· 2 years ago
abbe3da
[StableHLO][NFC] Do not alias the stablehlo namespace (#13150)
by Jakub Kuderski
· 2 years ago
cc783ff
[StableHLO] Port convolution to linalg lowering (#13151)
by Jakub Kuderski
· 2 years ago
73f29cf
Appease Windows compiler (#13155)
by Nicolas Vasilache
· 2 years ago
cf3d786
Add a transform dialect CUDA strategy for unaligned f32 matmul that are still divisible by 4 (#13098)
by Thomas
· 2 years ago
8d1a790
Integrate llvm-project at c2c49f464435 and bump dependencies (#13135)
by Kojo Acquah
· 2 years ago
0b92d79
Enable passing tests on CPU. (#13147)
by Han-Chung Wang
· 2 years ago
Next »