- 573ff1f Integrate LLVM at llvm/llvm-project@362aa434cc31ccca96749a6db8cd97f5b7d71206 (#16960) by Benoit Jacob · 1 year, 1 month ago
- 180e458 Add PDL pre-processing pass to compiler (#16945) by Nirvedh Meshram · 1 year, 1 month ago
- 29db67c Revert "[LLVMCPU][ArmSME] Add `2d-scalable-to-1d-scalable` pass" (#16963) by Scott Todd · 1 year, 1 month ago
- 782ac9d [LLVMCPU][ArmSME] Add `2d-scalable-to-1d-scalable` pass (#16712) by Benjamin Maxwell · 1 year, 1 month ago
- e913ff1 [CPU] Set vectorization options for Mmt4dTilingExpert pipeline. (#16954) by Han-Chung Wang · 1 year, 1 month ago
- 3fa9fbd Integrate LLVM at llvm/llvm-project@a6d932bca8875198fbf34564cda8a8d1640cdcbc (#16944) by Benoit Jacob · 1 year, 1 month ago
- 2c88e49 [LLVMGPU] Wmma layout for LLVMGPU vector distribute pipeline (#16928) by Stanley Winata · 1 year, 1 month ago
- d1eef77 Skip custom hip kernel sample if it would fail to build. (#16949) by Scott Todd · 1 year, 1 month ago
- e34c979 [torch-mlir] Cherrypick fix to fx_importer causing issues with int types. (#16950) by Stella Laurenzo · 1 year, 1 month ago
- c60dcc1 Fix bug in Horner's rule (#16865) by Pawel Paruzel · 1 year, 1 month ago
- 05ff73f [Flow] Do not propagate reshape when it's blocking unpack+generic fusion (#16930) by Han-Chung Wang · 1 year, 1 month ago
- 6ef1cfe Disable LLVM optional deps. (#16942) by Stella Laurenzo · 1 year, 1 month ago
- 52c8d52 Integrate torch-mlir at head (5325d3e6e6e0722ba78e14725b93107e0915710a). (#16940) by Stella Laurenzo · 1 year, 1 month ago
- 1204192 [GPU] Add workgroup transpose strategy to workgroup reordering pass (#16938) by Jakub Kuderski · 1 year, 1 month ago
- be9f097 [hip] Introduce options to control load of libamdhip64.so. (#16766) by Stella Laurenzo · 1 year, 1 month ago
- 41e312a [doc] Add documentation for HIP HAL backend (#16931) by Nithin Meganathan · 1 year, 1 month ago
- cc2ef92 [LLVMGPU] Allow workgroup reordering on ROCm (#16934) by Jakub Kuderski · 1 year, 1 month ago
- 623c1c8 [NFC] Simplify repeated vector push backs (#16936) by Jakub Kuderski · 1 year, 1 month ago
- 12a0b56 [NFC] Simplify type checks with isa predicates (#16935) by Jakub Kuderski · 1 year, 1 month ago
- 5cd0a0c Update Github runner to 2.315 (#16929) by Jerry Wu · 1 year, 1 month ago
- d884c54 [LLVMGPU][ROCm] Tweak preferred tile sizes in the MatmulSimt pipeline (#16923) by Jakub Kuderski · 1 year, 1 month ago
- 719a8a9 [LLVMGPU] Allow sending expanded convolutions down mfma pipeline (#16917) by Quinn Dawkins · 1 year, 1 month ago
- 501cb20 [VectorDistribution] Add better verifiers for anchors in layout analysis (#16924) by Kunwar Grover · 1 year, 1 month ago
- abe9aed [NFC] Fixing typo double and (#16904) by Jose Manuel Monsalve Diaz · 1 year, 1 month ago
- e8f8888 [Flow][Transforms] Add dynamic dim capture support to `scf.for` (#16889) by Markus Böck · 1 year, 1 month ago
- e942406 [cmake] Require runtime tracing for compiler tracing (#16922) by Jakub Kuderski · 1 year, 1 month ago
- 5acacb7 [Codegen] Fix layout analysis for vector.transpose (#16820) (#16921) by Quinn Dawkins · 1 year, 1 month ago
- bfdbd16 Cherrypick llvm/llvm-project@c43932ebdc40. (#16920) by Scott Todd · 1 year, 1 month ago
- c46daa6 Fix for iree-run-module with --input=not-a-path.bin (#16919) by James Newling · 1 year, 1 month ago
- e44cf32 Move external test suite configs out of experimental. (#16907) by Scott Todd · 1 year, 1 month ago
- ff820d6 Re-land "start testing real weight models ..." (#16918) by Scott Todd · 1 year, 1 month ago
- aacdd33 Tighten up the `lower_to_ukernel_ops.mlir` test (#16883) by Benoit Jacob · 1 year, 1 month ago
- daee1dd Ukernels: replace boilerplate by tables. (#16879) by Benoit Jacob · 1 year, 1 month ago
- ab1a65b Integrate LLVM at llvm/llvm-project@a9d1fead9614 (#16891) by Scott Todd · 1 year, 1 month ago
- d6357b4 [TD][Preprocessing] Speed up `match.cast_compatible_dag_from_root` (#16914) by Jakub Kuderski · 1 year, 1 month ago
- 831fcfa [docs][website] Fix up indexing (#16912) by Jakub Kuderski · 1 year, 1 month ago
- 7bda2ec Bump torch-mlir to HEAD (e2343cf4ce9a13e8fa09d6c5ade6524fa7cf2b02). (#16911) by Stella Laurenzo · 1 year, 1 month ago
- c160cb4 [LLVMGPU] Send skinny matmuls to the gpu reduction pipeline (#16898) by Jakub Kuderski · 1 year, 1 month ago
- cd1068b Revert "Start testing real weight models from external test suite." (#16910) by Scott Todd · 1 year, 1 month ago
- de65adf [docs][website] Add subsection on profiling with perf and pprof (#16908) by Jakub Kuderski · 1 year, 1 month ago
- 03749e7 Fix conv preprocessing filtering logic (#16897) by Jakub Kuderski · 1 year, 1 month ago
- 8ab68b6 Start testing real weight models from external test suite. (#16801) by Scott Todd · 1 year, 1 month ago
- 61a1f2e Mark regression tests as passing that now pass. (#16900) by Stella Laurenzo · 1 year, 1 month ago
- 07a854c [CPU][ArmSME] Add `-arm-sme-vector-legalization` to ArmSME pipeline (#16881) by Benjamin Maxwell · 1 year, 1 month ago
- e3ced3a [CodeGen][NFC] Remove unused encoding utils. (#16892) by Han-Chung Wang · 1 year, 1 month ago
- b96adf6 Bump torch-mlir to HEAD (17eeac880af409c6c0473c5930a2c08e25209f4c). (#16896) by Stella Laurenzo · 1 year, 1 month ago
- 2cdf145 fix build errors when tracing mode = 1 (#16884) by Okwan Kwon · 1 year, 1 month ago
- aa72368 Bump TF to nightly dev20240207 (#16871) by Julian Walker · 1 year, 1 month ago
- f3b6bcd Address comments by mariecwhite · 1 year, 1 month ago
- 76515a7 Add i8*i4 matmul microbenchmark by mariecwhite · 1 year, 1 month ago
- 5f2743b [hip] Move into hal/drivers and build by default (#16706) by Lei Zhang · 1 year, 1 month ago
- f19780a Add AMDGPU dialect to registerMlirDialects. (#16859) by Han-Chung Wang · 1 year, 1 month ago
- 565225e [CPU] Add data-tiling for s8s4s32 Arm64 ukernels by mariecwhite · 1 year, 1 month ago
- 767a611 [tracing] fix build errors when tracing mode = 1 (#16853) by Okwan Kwon · 1 year, 1 month ago
- 9e95c38 [Flow] Fix exponential blowup when optimizing dynamic `tensor.dim`s (#16847) by Markus Böck · 1 year, 1 month ago
- e7eef08 Adds a nullptr check around optional implementation in dynamic modules. (#16845) by Stella Laurenzo · 1 year, 1 month ago
- 253881e Fix verifier on stream.async.call to allow call to unknown lifetime. (#16844) by Stella Laurenzo · 1 year, 1 month ago
- ed59bed Enabling external HIP HSACO support ala CUDA external PTX support. (#16830) by Ben Vanik · 1 year, 1 month ago
- 21e067b Integrate LLVM at llvm/llvm-project@1a6ec906fb37 (#16753) by Han-Chung Wang · 1 year, 1 month ago
- 77cc34b Bump numpy dep up to the 2.0 prerelease. (#16800) by Scott Todd · 1 year, 1 month ago
- 2eae9b3 [CPU] Remove 8x8x16 i8mm microkernel by mariecwhite · 1 year, 1 month ago
- a2ed5d1 Trace allocate/deallocate in rocm_allocator. (#16822) by Scott Todd · 1 year, 1 month ago
- 92fe572 [CPU] Add s8s4s32 i8mm ukernel (#16678) by mariecwhite · 1 year, 1 month ago
- b61a918 Enable LTO optimization by default for runtime releases. (#16811) by Stella Laurenzo · 1 year, 1 month ago
- ee32fc7 [rocm] Fix crash when executable source information is missing (#16805) by Lei Zhang · 1 year, 1 month ago
- 2395046 [rocm] Excises (almost) dependence on /opt/rocm from the compiler. (#16803) by Stella Laurenzo · 1 year, 1 month ago
- d05b4a1 Splitting stack trace support out of status.c. (#16791) by Ben Vanik · 1 year, 1 month ago
- 94a0108 Improving fixupGlobalMutability in IREE::VM::GlobalInitializationPass. (#16783) by Ben Vanik · 1 year, 1 month ago
- 15d9039 Embedding executable source contents in binaries for tracing. (#16757) by Ben Vanik · 1 year, 1 month ago
- 05ff3e2 Don't link `opencl.bc` when compiling for ROCm. (#16778) by Scott Todd · 1 year, 1 month ago
- 3b30ab4 Revert "Ukernels: enable limited debug information that is useful in profilers like Tracy." (#16779) by Benoit Jacob · 1 year, 1 month ago
- cdff01f Ukernels: enable limited debug information that is useful in profilers like Tracy. (#15756) by Benoit Jacob · 1 year, 1 month ago
- e074a44 [ROCm] Add MI300 and MI300A target chips to doc (#16767) by Boian Petkantchin · 1 year, 1 month ago
- 20913f8 Read LLVM_VERSION_MAJOR as a directory property (#16771) by Benoit Jacob · 1 year, 1 month ago
- d2542cd Update tracy docs regarding `--iree-hal-dump-executable-sources-to=` (#15814) by Benoit Jacob · 1 year, 1 month ago
- 50aa9a3 Update 'iree-samples' -> 'iree-experimental' after rename. (#16761) by Scott Todd · 1 year, 1 month ago
- e9ee873 Static link when building RISC-V Linux benchmark tools (#16752) by Jerry Wu · 1 year, 1 month ago
- 8711c81 [EmitC] Fix some of the TODOs introduced in #16357 (#16759) by Simon Camphausen · 1 year, 1 month ago
- 858cce6 [LLVMGPU] Fix fused elementwise broadcasts in mfma pipeline (#16756) by Quinn Dawkins · 1 year, 1 month ago
- 5497435 Fixing iree-run-mlir error messages. (#16749) by Ben Vanik · 1 year, 1 month ago
- 26924e4 Adding `iree.tensor.trace` support for printf debugging. (#16746) by Ben Vanik · 1 year, 1 month ago
- e12ab47 Replace `unaryOperator` by EmitC LogicalNotOp (#16730) by Marius Brehler · 1 year, 1 month ago
- 0b8a096 Use EmitCBuilder for VariableOp (#16740) by Marius Brehler · 1 year, 1 month ago
- e9f7ecd [cuda][hip] Drop name_literal reference like Vulkan side (#16742) by Lei Zhang · 1 year, 1 month ago
- 3c296a5 Disabling inlining on the torch async function. (#16739) by Ben Vanik · 1 year, 1 month ago
- de398c3 Let RISCV_LINKER_FLAGS_EXE can be assigned by user (#16738) by Yun Hsiang · 1 year, 1 month ago
- a3603c6 [ROCM] Fix build with runtime tracing enabled (#16737) by Quinn Dawkins · 1 year, 1 month ago
- 18d73c7 [rocm] Fix IREE_ROCM_TRACE_ZONE symbol (#16736) by Lei Zhang · 1 year, 1 month ago
- c8081fd Adding legacy ROCM tracing zones. (#16735) by Ben Vanik · 1 year, 1 month ago
- 0077030 [pkgci] Enable on sdxl feature branch. by Stella Laurenzo · 1 year, 1 month ago
- 12fae0e Update JAX and TFLite MLIR artifacts for benchmarking by mariecwhite · 1 year, 1 month ago
- 7f9d97b Add optional attribute to set MFMA read layout (#16733) by harsh-nod · 1 year, 1 month ago
- 331801c bump torch to 80c7bc3f7ae12413836a2f610a6491794b4dbb08 (#16717) by Daniel Garvey · 1 year, 1 month ago
- 3baa82b [CUDA] Fix CUDA transform tests for generalized mmt (#16729) by Jakub Kuderski · 1 year, 1 month ago
- 27b837c Replace `binaryOperator` by EmitC ops (#16728) by Marius Brehler · 1 year, 1 month ago
- 69432b6 Delete compiler/src/iree/compiler/Dialect/Flow/Transforms/SubsetInsertionOpInterfaceImpl.cpp by Ben Vanik · 1 year, 1 month ago
- fc684f4 [EmitC] Fix error message in compiler driver (#16727) by Simon Camphausen · 1 year, 1 month ago
- 7c2b48c [LLVMGPU][SPIR-V] Run named op generalization early in configuration pipeline (#16726) by Jakub Kuderski · 1 year, 1 month ago
- d153b1c [LLVMGPU] Add shared memory prefetching (#16723) by Kunwar Grover · 1 year, 1 month ago
- 50714ae Retry failed pytest cases to try limiting flakes. (#16718) by Scott Todd · 1 year, 1 month ago