Log - cc2ccc31d7926a7813588c44491b4449af747f87 - 3p/openxla/iree

cc2ccc3 Use our private copy of Vulkan header files explicitly. (#13346) by Scott Todd · 2 years ago
0b77bfd Unify the two plugin examples to use the same micro-kernel. (#13324) by MaheshRavishankar · 2 years ago
935b110 [hal][cts] Add more tests for drivers device creation APIs (#12064) by Lei Zhang · 2 years ago
1cbb0fe benchmark: doubling batch count, set bytes processed (#13269) by bjacob · 2 years ago
3cff105 Update tflite bindings to accept both identifier forms (#13195) by Jacques Pienaar · 2 years ago
e312348 [StableHLO] Port lowering to LinalgExt (#13331) by Jakub Kuderski · 2 years ago
b8a8f5c Update python bindings for `iree-benchmark-module` to use `--module=-` (#13345) by Kojo Acquah · 2 years ago
72c6169 Making iree_hal_channel_provider_t a ref object and exposing on devices. (#13317) by Ben Vanik · 2 years ago
8f03e8d [StableHLO] Port control flow legalization (#13337) by Jakub Kuderski · 2 years ago
50a6c17 Add prefetches to fix performance regression on ARM Cortex-X2 (#13342) by bjacob · 2 years ago
5d15caf Removing a verifier check on equality between threadblock K shape and problem K shape (#13310) by Manish Gupta · 2 years ago
6c2f27d [StableHLO] Port shape computation legalization (#13328) by Jakub Kuderski · 2 years ago
475af42 [spirv][vulkan] Add f16 e2e matmul tests (#13327) by Jakub Kuderski · 2 years ago
d318c54 Rework iree-run-mlir to operate against the IREE compiler C API. (#12715) by Stella Laurenzo · 2 years ago
c792591 nonfatal failures in ukernel tests (#13316) by bjacob · 2 years ago
4208fee Refactor OOBI to save framework agnostic results (#13206) by Phoenix Meadowlark · 2 years ago
cca2173 [CI] Fix buildkite pipeline (#13322) by Jakub Kuderski · 2 years ago
9f09ae4 Add trailer to skip default benchmarking on LLVM integration PR (#13314) by Jerry Wu · 2 years ago
af5f4c3 [spirv][vulkan] Do not run nvidia gpu tests on buildkite (#13319) by Jakub Kuderski · 2 years ago
3912546 Add a sample that uses the iree_codegen.ukernel.generic operation to call a microkernel supplied as a plugin. (#13132) by MaheshRavishankar · 2 years ago
02f85ea Moving MPI library loading to hal/utils. (#13152) by Calin Cascaval · 2 years ago
32a92b6 Post GCS links in CI summary (#13273) by Jerry Wu · 2 years ago
ad65e56 Always run benchmarks on LLVM integration PR (#13298) by Jerry Wu · 2 years ago
c6ba2a8 polish ukernel test cpu features (#13266) by bjacob · 2 years ago
dc1684d [spirv][vulkan] Run e2 i8 matmul tests in CI (#13312) by Jakub Kuderski · 2 years ago
5cd6e1d Update sample Colab notebooks following TF importer changes. (#13311) by Scott Todd · 2 years ago
ebf8490 Integrate llvm/llvm-project@be9c91843bab (#13296) by Lei Zhang · 2 years ago
f0fac24 Move benchmark config generation to build_e2e_test_artifacts (#13291) by Jerry Wu · 2 years ago
7a86df4 [SPIRV] Add ability to specify transform dialect codegen spec file (#13267) by Quinn Dawkins · 2 years ago
e0dc18e Remove --split-input-file from iree-run-mlir. (#13302) by Ben Vanik · 2 years ago
6852a34 [StableHLO][NFC] Port remaining op tests (#13297) by Jakub Kuderski · 2 years ago
6677e81 [spirv][vulkan] Enable integer dot prod on desktop GPUs (#13279) by Jakub Kuderski · 2 years ago
ddb3223 [NFC] Organize `compiler/Utils/TracingUtils.cpp`. (#13087) by Scott Todd · 2 years ago
26f9cdf Unify typed VMVX entry points into untyped functions. (#13270) by bjacob · 2 years ago
4f5d55b Handle supported ImportOptions in tf.py and fix Windows compatibility. (#13287) by Scott Todd · 2 years ago
eca606e [spirv][webgpu] Expand extended multiplication ops (#13274) by Jakub Kuderski · 2 years ago
3537df7 Stop preserving `"encoding"` attribute on `pack` op. (#13293) by bjacob · 2 years ago
550b0c4 Swap tile sizes adjustment about pack/unpack to a proper order. (#13272) by Han-Chung Wang · 2 years ago
b476624 Add support for computing dispatch workgroup count using program slices. (#13038) by MaheshRavishankar · 2 years ago
34e0770 Update pjrt-plugin link in jax.md. (#13284) by Scott Todd · 2 years ago
5ad8800 Update mmperf (#13221) by mariecwhite · 2 years ago
31de207 Disable failing ukernel tests on Windows (#13276) by bjacob · 2 years ago
8cf1db2 Using FindThreads instead of hardcode `-lpthread`. (#13118) by Rechie Kho · 2 years ago
49d0123 ukernels: pack: move the generation of i64 padding_value to codegen (#13264) by bjacob · 2 years ago
1fd449b ukernels: fold type enums into flags (#13260) by bjacob · 2 years ago
9a8e63e Ukernel interface: take offsets, reorder fields (#13235) by bjacob · 2 years ago
47da9cf Disable TF Bert-Large due to #13211 (#13212) by mariecwhite · 2 years ago
5f16489 ukernels: let `pack` take `padding_value` by value (#13233) by bjacob · 2 years ago
6bd3211 ukernels: drop the unused `i8` case in `unpack`. (#13231) by bjacob · 2 years ago
4bca308 Ukernels: separate public vs internal headers (#13230) by bjacob · 2 years ago
772a335 [runtime] Fix std::array parameter unpacking (#13222) by Eugene Zhulenev · 2 years ago
52861ca Integrate llvm-project at 9e9b1effac34 and bump dependencies (#13232) by Kojo Acquah · 2 years ago
c6c713c [StableHLO] Add initial input conversion pipeline and e2e tests (#13223) by Jakub Kuderski · 2 years ago
786e1a0 Add missing canonicalization to LLVMCPUVectorLowering (#13169) by Diego Caballero · 2 years ago
84d938e Make python dependency in lit test configurable (#13224) by Tori Baker · 2 years ago
ddbb3dd Relax the application conditions for the matmul tensorcore transform … (#13228) by Nicolas Vasilache · 2 years ago
c59cf83 NFC - Improve debug messages around transform dialect matching (#13226) by Nicolas Vasilache · 2 years ago
585d387 Plumb compiler version info through to built Python wheels. (#13214) by Stella Laurenzo · 2 years ago
e199a08 Split mhlo.optimization_barrier lowering into several util.optimization_barrier ops if there are multiple inputs (#13210) by NatashaKnk · 2 years ago
69236fa Remove outdated python version requirement for Pytype (#13209) by Phoenix Meadowlark · 2 years ago
43ceb15 Update matmul tensorcore strategy to properly trigger pipelining with… (#13194) by Nicolas Vasilache · 2 years ago
fe604d3 [StableHLO][NFC] Trim down stablehlo to linalg conversion deps (#13180) by Jakub Kuderski · 2 years ago
978754a tidy up elementwise ukernels (#13204) by bjacob · 2 years ago
4dd4eac [llvmgpu] check if operands can be distributable for shmem promotion (#13201) by Okwan Kwon · 2 years ago
ab37989 Integrate llvm-project at 571eaead173b and bump dependencies (#13200) by Kojo Acquah · 2 years ago
e1a4a2b Remove the `matmul` ukernel (#13175) by bjacob · 2 years ago
e6b7fae port `pack` and `unpack` to `ukernel.generic` (#13173) by bjacob · 2 years ago
0d82968 Use git directly instead of `actions/checkout` on Windows (#13186) by Geoffrey Martin-Noble · 2 years ago
0ab01b6 Use GCS for Windows ccache (#13183) by Geoffrey Martin-Noble · 2 years ago
de2ecca Remove Pixel 4 GPU benchmarks (#13188) by Jerry Wu · 2 years ago
eafc042 Fix ASAN issue casting to uint32 (#13193) by Tori Baker · 2 years ago
0e9c462 Add lowering for BF16 arith/math operations to F32 with truncf/extf wrappers (#12841) by Rob Suderman · 2 years ago
302a289 Generalize matmul tensorcore strategy to work with arbitrary unaligned f32 tens… (#13192) by Nicolas Vasilache · 2 years ago
3730042 Refactor strategy towards generalization (#13190) by Nicolas Vasilache · 2 years ago
a806149 Disable models due to recent regression: https://github.com/openxla/iree/issues/13189 by mariecwhite · 2 years ago
7758993 Remove pull_request by mariecwhite · 2 years ago
af46cd8 Add Bert-Large TF models in various batch sizes by mariecwhite · 2 years, 1 month ago
ed174dc Fix batch_matmul + transpose using nvgpu.mma.sync.1688.f32 (#13075) (#13170) by Manish Gupta · 2 years ago
ab5bcbd Add compiler C API `ireeCompilerGetRevision`. (#13184) by Stella Laurenzo · 2 years ago
8a1182c Put ccache hint in docs on one line. (#13185) by Stella Laurenzo · 2 years ago
20e014b Enable Shared Memory Swizzle Optimization for Batched Matmul (#13060) by Manish Gupta · 2 years ago
4880d00 [Dev Container] - Adding VSCode Dev Container Capability (#12921) by Jonathan DEKHTIAR · 2 years ago
8548216 Fix exportTargetAttr construction in InlineExecutablesPass. (#13144) by Scott Todd · 2 years ago
6a7f69f Fixing vm::ref operator& after type consistency changes. (#13178) by Ben Vanik · 2 years ago
b8527c6 NFC - Parameterize the matmul tensor core strategy towards generaliza… (#13177) by Nicolas Vasilache · 2 years ago
04d2ba5 Updating architecture diagram with "plugins" in a few places. (#12615) by Ben Vanik · 2 years ago
d2f5a7c Update `experimental/web/sample_dynamic/` after runtime changes. (#13174) by Scott Todd · 2 years ago
56c4eff Update TensorFlow package in `samples` Dockerfile to 2.12.0. (#13171) by Scott Todd · 2 years ago
e4e2398 Add CI workflow to run android tests on mobile phones (#13024) by Jerry Wu · 2 years ago
81cf28c Simplifying iree-run-mlir by making it run only a single function. (#13149) by Ben Vanik · 2 years ago
0d5f837 [xla] Drop arg & res attributes on XLA import (#13154) by Jacques Pienaar · 2 years ago
9040c43 Opt in to `-Wc++20-extensions` warning in clang. (#13158) by Scott Todd · 2 years ago
48d7a42 Replace `ukernel.mmt4d` by `ukernel.generic`. (#13143) by bjacob · 2 years ago
479900a [mhlo] Remove redundant preprocessing pattern. (#13153) by Jacques Pienaar · 2 years ago
abbe3da [StableHLO][NFC] Do not alias the stablehlo namespace (#13150) by Jakub Kuderski · 2 years ago
cc783ff [StableHLO] Port convolution to linalg lowering (#13151) by Jakub Kuderski · 2 years ago
73f29cf Appease Windows compiler (#13155) by Nicolas Vasilache · 2 years ago
cf3d786 Add a transform dialect CUDA strategy for unaligned f32 matmul that are still divisible by 4 (#13098) by Thomas · 2 years ago
8d1a790 Integrate llvm-project at c2c49f464435 and bump dependencies (#13135) by Kojo Acquah · 2 years ago
0b92d79 Enable passing tests on CPU. (#13147) by Han-Chung Wang · 2 years ago