Log - 31de207c4ffaf6f4dfb5d4f9e219201aeda5b26e - 3p/openxla/iree

31de207 Disable failing ukernel tests on Windows (#13276) by bjacob · 2 years ago
8cf1db2 Using FindThreads instead of hardcode `-lpthread`. (#13118) by Rechie Kho · 2 years ago
49d0123 ukernels: pack: move the generation of i64 padding_value to codegen (#13264) by bjacob · 2 years ago
1fd449b ukernels: fold type enums into flags (#13260) by bjacob · 2 years ago
9a8e63e Ukernel interface: take offsets, reorder fields (#13235) by bjacob · 2 years ago
47da9cf Disable TF Bert-Large due to #13211 (#13212) by mariecwhite · 2 years ago
5f16489 ukernels: let `pack` take `padding_value` by value (#13233) by bjacob · 2 years ago
6bd3211 ukernels: drop the unused `i8` case in `unpack`. (#13231) by bjacob · 2 years ago
4bca308 Ukernels: separate public vs internal headers (#13230) by bjacob · 2 years ago
772a335 [runtime] Fix std::array parameter unpacking (#13222) by Eugene Zhulenev · 2 years ago
52861ca Integrate llvm-project at 9e9b1effac34 and bump dependencies (#13232) by Kojo Acquah · 2 years ago
c6c713c [StableHLO] Add initial input conversion pipeline and e2e tests (#13223) by Jakub Kuderski · 2 years ago
786e1a0 Add missing canonicalization to LLVMCPUVectorLowering (#13169) by Diego Caballero · 2 years ago
84d938e Make python dependency in lit test configurable (#13224) by Tori Baker · 2 years ago
ddbb3dd Relax the application conditions for the matmul tensorcore transform … (#13228) by Nicolas Vasilache · 2 years ago
c59cf83 NFC - Improve debug messages around transform dialect matching (#13226) by Nicolas Vasilache · 2 years ago
585d387 Plumb compiler version info through to built Python wheels. (#13214) by Stella Laurenzo · 2 years ago
e199a08 Split mhlo.optimization_barrier lowering into several util.optimization_barrier ops if there are multiple inputs (#13210) by NatashaKnk · 2 years ago
69236fa Remove outdated python version requirement for Pytype (#13209) by Phoenix Meadowlark · 2 years ago
43ceb15 Update matmul tensorcore strategy to properly trigger pipelining with… (#13194) by Nicolas Vasilache · 2 years ago
fe604d3 [StableHLO][NFC] Trim down stablehlo to linalg conversion deps (#13180) by Jakub Kuderski · 2 years ago
978754a tidy up elementwise ukernels (#13204) by bjacob · 2 years ago
4dd4eac [llvmgpu] check if operands can be distributable for shmem promotion (#13201) by Okwan Kwon · 2 years ago
ab37989 Integrate llvm-project at 571eaead173b and bump dependencies (#13200) by Kojo Acquah · 2 years ago
e1a4a2b Remove the `matmul` ukernel (#13175) by bjacob · 2 years ago
e6b7fae port `pack` and `unpack` to `ukernel.generic` (#13173) by bjacob · 2 years ago
0d82968 Use git directly instead of `actions/checkout` on Windows (#13186) by Geoffrey Martin-Noble · 2 years ago
0ab01b6 Use GCS for Windows ccache (#13183) by Geoffrey Martin-Noble · 2 years ago
de2ecca Remove Pixel 4 GPU benchmarks (#13188) by Jerry Wu · 2 years ago
eafc042 Fix ASAN issue casting to uint32 (#13193) by Tori Baker · 2 years ago
0e9c462 Add lowering for BF16 arith/math operations to F32 with truncf/extf wrappers (#12841) by Rob Suderman · 2 years ago
302a289 Generalize matmul tensorcore strategy to work with arbitrary unaligned f32 tens… (#13192) by Nicolas Vasilache · 2 years ago
3730042 Refactor strategy towards generalization (#13190) by Nicolas Vasilache · 2 years ago
a806149 Disable models due to recent regression: https://github.com/openxla/iree/issues/13189 by mariecwhite · 2 years ago
7758993 Remove pull_request by mariecwhite · 2 years ago
af46cd8 Add Bert-Large TF models in various batch sizes by mariecwhite · 2 years, 1 month ago
ed174dc Fix batch_matmul + transpose using nvgpu.mma.sync.1688.f32 (#13075) (#13170) by Manish Gupta · 2 years ago
ab5bcbd Add compiler C API `ireeCompilerGetRevision`. (#13184) by Stella Laurenzo · 2 years ago
8a1182c Put ccache hint in docs on one line. (#13185) by Stella Laurenzo · 2 years ago
20e014b Enable Shared Memory Swizzle Optimization for Batched Matmul (#13060) by Manish Gupta · 2 years ago
4880d00 [Dev Container] - Adding VSCode Dev Container Capability (#12921) by Jonathan DEKHTIAR · 2 years ago
8548216 Fix exportTargetAttr construction in InlineExecutablesPass. (#13144) by Scott Todd · 2 years ago
6a7f69f Fixing vm::ref operator& after type consistency changes. (#13178) by Ben Vanik · 2 years ago
b8527c6 NFC - Parameterize the matmul tensor core strategy towards generaliza… (#13177) by Nicolas Vasilache · 2 years ago
04d2ba5 Updating architecture diagram with "plugins" in a few places. (#12615) by Ben Vanik · 2 years ago
d2f5a7c Update `experimental/web/sample_dynamic/` after runtime changes. (#13174) by Scott Todd · 2 years ago
56c4eff Update TensorFlow package in `samples` Dockerfile to 2.12.0. (#13171) by Scott Todd · 2 years ago
e4e2398 Add CI workflow to run android tests on mobile phones (#13024) by Jerry Wu · 2 years ago
81cf28c Simplifying iree-run-mlir by making it run only a single function. (#13149) by Ben Vanik · 2 years ago
0d5f837 [xla] Drop arg & res attributes on XLA import (#13154) by Jacques Pienaar · 2 years ago
9040c43 Opt in to `-Wc++20-extensions` warning in clang. (#13158) by Scott Todd · 2 years ago
48d7a42 Replace `ukernel.mmt4d` by `ukernel.generic`. (#13143) by bjacob · 2 years ago
479900a [mhlo] Remove redundant preprocessing pattern. (#13153) by Jacques Pienaar · 2 years ago
abbe3da [StableHLO][NFC] Do not alias the stablehlo namespace (#13150) by Jakub Kuderski · 2 years ago
cc783ff [StableHLO] Port convolution to linalg lowering (#13151) by Jakub Kuderski · 2 years ago
73f29cf Appease Windows compiler (#13155) by Nicolas Vasilache · 2 years ago
cf3d786 Add a transform dialect CUDA strategy for unaligned f32 matmul that are still divisible by 4 (#13098) by Thomas · 2 years ago
8d1a790 Integrate llvm-project at c2c49f464435 and bump dependencies (#13135) by Kojo Acquah · 2 years ago
0b92d79 Enable passing tests on CPU. (#13147) by Han-Chung Wang · 2 years ago
1859381 [StableHLO] Port rng to linalg lowering (#13137) by Jakub Kuderski · 2 years ago
9461d3b Adding support for loading VM modules from dynamic libraries. (#13112) by Ben Vanik · 2 years ago
62153df Generalize `ukernel.generic` with `strided_outer_dims` attribute (#13138) by bjacob · 2 years ago
169e6a4 Update mmperf docker image by mariecwhite · 2 years ago
26a145a Update benchmark documentation to store artifacts in build directory (#13141) by Phoenix Meadowlark · 2 years ago
5f81fd3 [StableHLO] Port reduce_window to linalg lowering (#13128) by Jakub Kuderski · 2 years ago
fe53bc6 [NVIDIA Container] Fix `PATH` to include NVIDIA binaries (#13126) by Jonathan DEKHTIAR · 2 years ago
27179e2 Use MPI for NCCL unique ID exchange by default (#12902) by Okwan Kwon · 2 years ago
c205355 fatal non-numerical errors in e2e matmul test (#13139) by bjacob · 2 years ago
0c3a30e Revert "Reorder ukernel operands to match what `ukernel.generic_raw` can generate" (#13136) by bjacob · 2 years ago
6f3c0a4 [codeGen] Added math.exp2 expansion. (#13124) by Balaji V. Iyer · 2 years ago
4b09dc5 Add warpDims argument to buildMapToBlockAndThreads (#13134) by Nicolas Vasilache · 2 years ago
5abc05f Integrate llvm-project at de088dd3a0aa and bump dependencies (#13130) by Kojo Acquah · 2 years ago
63fee48 Adding support for passing processor_id/data to dynamic imports. (#13109) by Ben Vanik · 2 years ago
1b1571d [StableHLO][NFC] Port select_and_scatter tests (#13122) by Jakub Kuderski · 2 years ago
e4c0a11 [StableHLO][NFC] Port reduction tests (#13121) by Jakub Kuderski · 2 years ago
7171c45 [TransformExtensions] Add a `pad(extract_slice(fill))` -> `fill` pattern to `tiling_canonicalization` (#13093) by qcolombet · 2 years ago
e19fc8e Adding a local executable plugin mechanism. (#12625) by Ben Vanik · 2 years ago
51fbedf Update KernelConfig.cpp to properly tile batch dim for convolutions. (#13108) by Kojo Acquah · 2 years ago
70985b5 Expand the CUDA header file pattern for building with CUDA 12.x. (#13106) by pjannaty · 2 years ago
e68e0ba [ci] Make vendor/arch definitions in benchmarks better (#13099) by Lei Zhang · 2 years ago
3f1c154 Reorder ukernel operands to match what `ukernel.generic_raw` can generate (#13103) by bjacob · 2 years ago
f98b5d8 Compute reduction across batch dims (#13091) by harsh-nod · 2 years ago
b798319 Fix MSVC warning: wrong pointer type in `_mm_prefetch` (#13102) by bjacob · 2 years ago
e2ce623 `ukernel.generic_raw` op passing only buffer and offset for memref operands (#13053) by bjacob · 2 years ago
ff19816 Integrate LLVM at llvm/llvm-project@75f1f158 (#13094) by Thomas · 2 years ago
c582ec7 Make boot disks bigger instead of futzing with ramdisk and local ssd (#13079) by Geoffrey Martin-Noble · 2 years ago
544f785 [TransformExtensions] Fix the vector_to_mma_conversion op (#13092) by qcolombet · 2 years ago
245a4f2 Some of the recently added tests require sm80 (#13077) by Tori Baker · 2 years ago
1b06f74 Allow more general accumulators when distributing reduction + broadcast + transpose (#13039) by harsh-nod · 2 years ago
186cdd4 [ROCm] Fix package name prefix and package root dir (#13088) by Boian Petkantchin · 2 years ago
644820c Add support to emit masked cp.async with 2-D masks (#13084) by Nicolas Vasilache · 2 years ago
09630d6 Finally moving VM type registration to iree_vm_instance_t. (#12650) by Ben Vanik · 2 years ago
8be370c [StableHLO] Port patterns to handle scalar op to arith lowering (#13083) by Jakub Kuderski · 2 years ago
df166ed Adding flow.tensor.alloc op for unique allocations. (#13081) by Ben Vanik · 2 years ago
5c005b7 [StableHLO] Re-enable linalg.map lowering tests for non-pointwise ops (#13082) by Jakub Kuderski · 2 years ago
87ef655 Add a structured op matcher for 2d convolutions (#12805) by Quinn Dawkins · 2 years ago
40ecb03 [StableHLO] Port more non-pointwise linalg lowering tests (#13080) by Jakub Kuderski · 2 years ago
586f19e Add codegen side plumbing for calling micro kernels from codegen backends. (#12831) by MaheshRavishankar · 2 years ago
e635b2f Disable gather vectorization (#13067) by Jacques Pienaar · 2 years ago
60b8ef4 [StableHLO][NFC] Enable FileCheck variable scope in linalg lowering tests (#13063) by Jakub Kuderski · 2 years ago