- abe9aed [NFC] Fixing typo double and (#16904) by Jose Manuel Monsalve Diaz · 1 year, 1 month ago
- e8f8888 [Flow][Transforms] Add dynamic dim capture support to `scf.for` (#16889) by Markus Böck · 1 year, 1 month ago
- e942406 [cmake] Require runtime tracing for compiler tracing (#16922) by Jakub Kuderski · 1 year, 1 month ago
- 5acacb7 [Codegen] Fix layout analysis for vector.transpose (#16820) (#16921) by Quinn Dawkins · 1 year, 1 month ago
- bfdbd16 Cherrypick llvm/llvm-project@c43932ebdc40. (#16920) by Scott Todd · 1 year, 1 month ago
- c46daa6 Fix for iree-run-module with --input=not-a-path.bin (#16919) by James Newling · 1 year, 1 month ago
- e44cf32 Move external test suite configs out of experimental. (#16907) by Scott Todd · 1 year, 1 month ago
- ff820d6 Re-land "start testing real weight models ..." (#16918) by Scott Todd · 1 year, 1 month ago
- aacdd33 Tighten up the `lower_to_ukernel_ops.mlir` test (#16883) by Benoit Jacob · 1 year, 1 month ago
- daee1dd Ukernels: replace boilerplate by tables. (#16879) by Benoit Jacob · 1 year, 1 month ago
- ab1a65b Integrate LLVM at llvm/llvm-project@a9d1fead9614 (#16891) by Scott Todd · 1 year, 1 month ago
- d6357b4 [TD][Preprocessing] Speed up `match.cast_compatible_dag_from_root` (#16914) by Jakub Kuderski · 1 year, 1 month ago
- 831fcfa [docs][website] Fix up indexing (#16912) by Jakub Kuderski · 1 year, 1 month ago
- 7bda2ec Bump torch-mlir to HEAD (e2343cf4ce9a13e8fa09d6c5ade6524fa7cf2b02). (#16911) by Stella Laurenzo · 1 year, 1 month ago
- c160cb4 [LLVMGPU] Send skinny matmuls to the gpu reduction pipeline (#16898) by Jakub Kuderski · 1 year, 1 month ago
- cd1068b Revert "Start testing real weight models from external test suite." (#16910) by Scott Todd · 1 year, 1 month ago
- de65adf [docs][website] Add subsection on profiling with perf and pprof (#16908) by Jakub Kuderski · 1 year, 1 month ago
- 03749e7 Fix conv preprocessing filtering logic (#16897) by Jakub Kuderski · 1 year, 1 month ago
- 8ab68b6 Start testing real weight models from external test suite. (#16801) by Scott Todd · 1 year, 1 month ago
- 61a1f2e Mark regression tests as passing that now pass. (#16900) by Stella Laurenzo · 1 year, 1 month ago
- 07a854c [CPU][ArmSME] Add `-arm-sme-vector-legalization` to ArmSME pipeline (#16881) by Benjamin Maxwell · 1 year, 1 month ago
- e3ced3a [CodeGen][NFC] Remove unused encoding utils. (#16892) by Han-Chung Wang · 1 year, 1 month ago
- b96adf6 Bump torch-mlir to HEAD (17eeac880af409c6c0473c5930a2c08e25209f4c). (#16896) by Stella Laurenzo · 1 year, 1 month ago
- 2cdf145 fix build errors when tracing mode = 1 (#16884) by Okwan Kwon · 1 year, 1 month ago
- aa72368 Bump TF to nightly dev20240207 (#16871) by Julian Walker · 1 year, 1 month ago
- f3b6bcd Address comments by mariecwhite · 1 year, 1 month ago
- 76515a7 Add i8*i4 matmul microbenchmark by mariecwhite · 1 year, 1 month ago
- 5f2743b [hip] Move into hal/drivers and build by default (#16706) by Lei Zhang · 1 year, 1 month ago
- f19780a Add AMDGPU dialect to registerMlirDialects. (#16859) by Han-Chung Wang · 1 year, 1 month ago
- 565225e [CPU] Add data-tiling for s8s4s32 Arm64 ukernels by mariecwhite · 1 year, 1 month ago
- 767a611 [tracing] fix build errors when tracing mode = 1 (#16853) by Okwan Kwon · 1 year, 1 month ago
- 9e95c38 [Flow] Fix exponential blowup when optimizing dynamic `tensor.dim`s (#16847) by Markus Böck · 1 year, 1 month ago
- e7eef08 Adds a nullptr check around optional implementation in dynamic modules. (#16845) by Stella Laurenzo · 1 year, 1 month ago
- 253881e Fix verifier on stream.async.call to allow call to unknown lifetime. (#16844) by Stella Laurenzo · 1 year, 1 month ago
- ed59bed Enabling external HIP HSACO support ala CUDA external PTX support. (#16830) by Ben Vanik · 1 year, 1 month ago
- 21e067b Integrate LLVM at llvm/llvm-project@1a6ec906fb37 (#16753) by Han-Chung Wang · 1 year, 1 month ago
- 77cc34b Bump numpy dep up to the 2.0 prerelease. (#16800) by Scott Todd · 1 year, 1 month ago
- 2eae9b3 [CPU] Remove 8x8x16 i8mm microkernel by mariecwhite · 1 year, 1 month ago
- a2ed5d1 Trace allocate/deallocate in rocm_allocator. (#16822) by Scott Todd · 1 year, 1 month ago
- 92fe572 [CPU] Add s8s4s32 i8mm ukernel (#16678) by mariecwhite · 1 year, 1 month ago
- b61a918 Enable LTO optimization by default for runtime releases. (#16811) by Stella Laurenzo · 1 year, 1 month ago
- ee32fc7 [rocm] Fix crash when executable source information is missing (#16805) by Lei Zhang · 1 year, 1 month ago
- 2395046 [rocm] Excises (almost) dependence on /opt/rocm from the compiler. (#16803) by Stella Laurenzo · 1 year, 1 month ago
- d05b4a1 Splitting stack trace support out of status.c. (#16791) by Ben Vanik · 1 year, 1 month ago
- 94a0108 Improving fixupGlobalMutability in IREE::VM::GlobalInitializationPass. (#16783) by Ben Vanik · 1 year, 1 month ago
- 15d9039 Embedding executable source contents in binaries for tracing. (#16757) by Ben Vanik · 1 year, 1 month ago
- 05ff3e2 Don't link `opencl.bc` when compiling for ROCm. (#16778) by Scott Todd · 1 year, 1 month ago
- 3b30ab4 Revert "Ukernels: enable limited debug information that is useful in profilers like Tracy." (#16779) by Benoit Jacob · 1 year, 1 month ago
- cdff01f Ukernels: enable limited debug information that is useful in profilers like Tracy. (#15756) by Benoit Jacob · 1 year, 1 month ago
- e074a44 [ROCm] Add MI300 and MI300A target chips to doc (#16767) by Boian Petkantchin · 1 year, 1 month ago
- 20913f8 Read LLVM_VERSION_MAJOR as a directory property (#16771) by Benoit Jacob · 1 year, 1 month ago
- d2542cd Update tracy docs regarding `--iree-hal-dump-executable-sources-to=` (#15814) by Benoit Jacob · 1 year, 1 month ago
- 50aa9a3 Update 'iree-samples' -> 'iree-experimental' after rename. (#16761) by Scott Todd · 1 year, 1 month ago
- e9ee873 Static link when building RISC-V Linux benchmark tools (#16752) by Jerry Wu · 1 year, 1 month ago
- 8711c81 [EmitC] Fix some of the TODOs introduced in #16357 (#16759) by Simon Camphausen · 1 year, 1 month ago
- 858cce6 [LLVMGPU] Fix fused elementwise broadcasts in mfma pipeline (#16756) by Quinn Dawkins · 1 year, 1 month ago
- 5497435 Fixing iree-run-mlir error messages. (#16749) by Ben Vanik · 1 year, 1 month ago
- 26924e4 Adding `iree.tensor.trace` support for printf debugging. (#16746) by Ben Vanik · 1 year, 1 month ago
- e12ab47 Replace `unaryOperator` by EmitC LogicalNotOp (#16730) by Marius Brehler · 1 year, 1 month ago
- 0b8a096 Use EmitCBuilder for VariableOp (#16740) by Marius Brehler · 1 year, 1 month ago
- e9f7ecd [cuda][hip] Drop name_literal reference like Vulkan side (#16742) by Lei Zhang · 1 year, 1 month ago
- 3c296a5 Disabling inlining on the torch async function. (#16739) by Ben Vanik · 1 year, 1 month ago
- de398c3 Let RISCV_LINKER_FLAGS_EXE can be assigned by user (#16738) by Yun Hsiang · 1 year, 1 month ago
- a3603c6 [ROCM] Fix build with runtime tracing enabled (#16737) by Quinn Dawkins · 1 year, 1 month ago
- 18d73c7 [rocm] Fix IREE_ROCM_TRACE_ZONE symbol (#16736) by Lei Zhang · 1 year, 1 month ago
- c8081fd Adding legacy ROCM tracing zones. (#16735) by Ben Vanik · 1 year, 1 month ago
- 0077030 [pkgci] Enable on sdxl feature branch. by Stella Laurenzo · 1 year, 1 month ago
- 12fae0e Update JAX and TFLite MLIR artifacts for benchmarking by mariecwhite · 1 year, 1 month ago
- 7f9d97b Add optional attribute to set MFMA read layout (#16733) by harsh-nod · 1 year, 1 month ago
- 331801c bump torch to 80c7bc3f7ae12413836a2f610a6491794b4dbb08 (#16717) by Daniel Garvey · 1 year, 1 month ago
- 3baa82b [CUDA] Fix CUDA transform tests for generalized mmt (#16729) by Jakub Kuderski · 1 year, 1 month ago
- 27b837c Replace `binaryOperator` by EmitC ops (#16728) by Marius Brehler · 1 year, 1 month ago
- 69432b6 Delete compiler/src/iree/compiler/Dialect/Flow/Transforms/SubsetInsertionOpInterfaceImpl.cpp by Ben Vanik · 1 year, 1 month ago
- fc684f4 [EmitC] Fix error message in compiler driver (#16727) by Simon Camphausen · 1 year, 1 month ago
- 7c2b48c [LLVMGPU][SPIR-V] Run named op generalization early in configuration pipeline (#16726) by Jakub Kuderski · 1 year, 1 month ago
- d153b1c [LLVMGPU] Add shared memory prefetching (#16723) by Kunwar Grover · 1 year, 1 month ago
- 50714ae Retry failed pytest cases to try limiting flakes. (#16718) by Scott Todd · 1 year, 1 month ago
- b7c4152 [LinalgExt] Add scaling to attention op (#16679) by Kunwar Grover · 1 year, 1 month ago
- 46326ef Integrate LLVM at llvm/llvm-project@abfac56 (#16710) by Jakub Kuderski · 1 year, 1 month ago
- 4691fc5 Use subgroup size when doing shuffles (#16698) by harsh-nod · 1 year, 1 month ago
- 7da8af6 Cleanup a few docs/references to old paths in HAL/Target. (#16716) by Scott Todd · 1 year, 1 month ago
- a283044 [hip] Make graph command buffer as default for initialization (#16707) by Lei Zhang · 1 year, 1 month ago
- de02c1d Disable external test suite on ROCm while flaky. (#16705) by Scott Todd · 1 year, 1 month ago
- c2a3245 Convert LLVMCPU compiler target to a plugin. (#16704) by Scott Todd · 1 year, 1 month ago
- b5af996 Prefer broadcasting RHS over LHS in AVX-512 multiply-accumulate instructions (#16709) by Benoit Jacob · 1 year, 1 month ago
- 47dfaa5 If negative tolerance in numerical test, then emit error (#16694) by James Newling · 1 year, 1 month ago
- 7884dc8 Revert adding unit dim folding to GlobalOps (#16708) by Max191 · 1 year, 1 month ago
- f513fe2 Add timeouts to pytest cases in pkgci/iree_tests. (#16703) by Scott Todd · 1 year, 1 month ago
- 0545746 [hip] Mark device local + host visible as low performance (#16701) by Lei Zhang · 1 year, 1 month ago
- b027da4 Convert VulkanSPIRV compiler target into a plugin. (#16699) by Scott Todd · 1 year, 1 month ago
- a86b8bf [Preprocessing] Change nesting of FoldUnitExtentDims (#16697) by Max191 · 1 year, 1 month ago
- 2f1d32d [linalg] Add the lowering of quantized_batch_matmul op. (#16615) by Prashant Kumar · 1 year, 1 month ago
- 6d03805 [CPU] Centralize pipeline lowering options and apply them consistently. (#16690) by Han-Chung Wang · 1 year, 2 months ago
- c344e26 Cleanup compiler plugin directory and include paths. (#16691) by Scott Todd · 1 year, 2 months ago
- c87eafe Update external test suite version pin and XFAIL sets. (#16675) by Scott Todd · 1 year, 2 months ago
- bb9409f [VectorDistribution] Emit diagnostics for invalid layouts (#16688) by Jakub Kuderski · 1 year, 2 months ago
- c07d110 [GlobalOpt][Flow] Add GlobalOp folding to FoldUnitExtentDims (#16611) by Max191 · 1 year, 2 months ago
- e612e91 Use createOrFold for linalg_ext dim queries. (#16685) by Ben Vanik · 1 year, 2 months ago
- f812ce2 Drop lists of VulkanSPIRV flags. (#16680) by Scott Todd · 1 year, 2 months ago
- 3bdb45b Use correct 'webgpu-spirv' flag name in samples. (#16681) by Scott Todd · 1 year, 2 months ago