Log - c6924b619d22276d5148ca5c608b986bdf51b217 - 3p/openxla/iree

c6924b6 [EmitC] Remove array related macros from `ops_emitc.h` (#18335) by Simon Camphausen · 7 months ago
8da4564 Revert "[Flow] Convert from tensor.cast to flow.tensor.reshape early …(#18256)" (#18331) by Nirvedh Meshram · 7 months ago
c44d29b [compiler] Make cuda/hip/vulkan target cl options consistent (#17710) by Lei Zhang · 7 months ago
d1ccc8c [Codegen] Disable transform dialect jit by default (#18305) by Nithin Meganathan · 7 months ago
292f2d4 [Stream] fix ordering issue in EmplaceAllocations (#18321) by Nirvedh Meshram · 7 months ago
8e42839 [Codegen][GPU] Add pass to combine value_barrier ops (#18298) by Kunwar Grover · 7 months ago
6ca0613 Integrate LLVM at llvm/llvm-project@7aa22f013e24d20291aad745368ff907baa9dfa4 (#18286) by Kunwar Grover · 7 months ago
1c0c5a6 [Flow] Convert from tensor.cast to flow.tensor.reshape early (#18256) by Nirvedh Meshram · 7 months ago
8dd1db3 Bubble expand shapes through `AttentionOp`s (#18074) by Ian Wood · 7 months ago
6a92fb7 Add onnx to InputType enum (#18310) by Tim Gymnich · 7 months ago
7c8fedc Remove PyYAML dependency from Python bindings. (#18262) by Scott Todd · 7 months ago
3a06719 [compiler][stream] Avoid circular dependencies between partitions in execution scheduling (#18217) by Boian Petkantchin · 7 months ago
cea581f Move LinalgQuantized* passes to GlobalOptimization (#18287) by Quinn Dawkins · 7 months ago
3af05b9 [Flow] Make CollapseDimensions iterative (#18203) by Ian Wood · 7 months ago
137e365 [Codegen][GPU] Add pass to annotate memory spaces on allocations (#18251) by Quinn Dawkins · 7 months ago
5beb9ad [LinalgExt] Better doc for FP8 attention clamping. (#18301) by Stanley Winata · 7 months ago
5ba9a89 [LinalgExt] Reland QK scaling for attention decomp and further optimizations of it. (#18293) by Stanley Winata · 7 months ago
ab0d4c6 [Codegen][GPU] Rework scf.forall fusion to support different thread counts (#18280) by Quinn Dawkins · 7 months ago
87084d5 [Codegen][GPU] Remove insert_slice semantics from shuffle_tensor (#18279) by Quinn Dawkins · 7 months ago
95d5562 [vulkan] Update default RDNA GPU subgroup size to 32 (#18207) by Nithin Meganathan · 7 months ago
30040c7 [Codegen] Add pass to tile and distribute workgroups using `scf.forall` op. (#18266) by MaheshRavishankar · 7 months ago
ed512a2 [HAL][Loader] Switch to tablegen pass generation (#18282) by Marius Brehler · 7 months ago
da958c4 [HAL][Inline] Switch to tablegen pass generation (#18278) by Marius Brehler · 7 months ago
f224bc8 [ConstEval] Switch to tablegen pass generation (#18228) by Marius Brehler · 7 months ago
d5c6370 Make `iree_gpu.value_barrier` accept multiple operands (and return multiple results) (#18192) by Kunwar Grover · 7 months ago
aeda149 [InputConversion] Switch to tablegen pass generation (#18245) by Marius Brehler · 7 months ago
10ba28d [Codegen][GPU] Add kernel config for LLVMGPUTileAndFuse (#17791) by Quinn Dawkins · 7 months ago
7cf3fc6 [Codegen][GPU] Fix allocation space in iree_gpu.shuffle_tensor lowering (#18250) by Quinn Dawkins · 7 months ago
a884b93 Bump LLVM to llvm/llvm-project@ddda37a (#18258) by Stanley Winata · 7 months ago
0e152d2 [Codegen] Add `DeviceMappingAttr` that maps to workgroup IDs. (#18264) by MaheshRavishankar · 7 months ago
40258db [CodeGen][DT] Make the TypeConverter carry targetAttr info. (#18242) by Han-Chung Wang · 7 months ago
d25712c [VMVX] Switch to tablegen pass generation (#18248) by Marius Brehler · 7 months ago
8a1d78b [Codegen][CPU] Enable scalable transfer lowerings (#18170) by Benjamin Maxwell · 7 months ago
551cd54 [TOSA] Switch to tablegen pass generation (#18227) by Marius Brehler · 7 months ago
878a99b [torch] Switch to tablegen pass generation (#18226) by Marius Brehler · 7 months ago
41f1f49 [Codegen] Add a pass option to control input -> dest pattern (#18240) by Quinn Dawkins · 7 months ago
78f54c2 [Codegen][GPU] Add a pass for basic distribution verification (#18236) by Quinn Dawkins · 7 months ago
66ed138 [CPU] Make VectorPreProcStrategy consider undefined behaviors (#18146) by lialan · 8 months ago
b144e90 [test] Check depthwise conv is vectorized in test (NFC) (#18225) by Benjamin Maxwell · 8 months ago
53a7bc4 Replace `iree_compiler::VscaleRange` with `vector::VscaleRange` (NFC) (#18218) by Benjamin Maxwell · 8 months ago
300af39 [codegen] Add max_workgroup_counts to TargetWgpAttr (#17771) by Krzysztof Drewniak · 8 months ago
7d60397 [LinalgExt] Switch to new pass generation tablegen definitions. (#18216) by Han-Chung Wang · 8 months ago
fe638b0 [Codegen][CPU] Eliminate all-true vector masks after vectorization (#18190) by Benjamin Maxwell · 8 months ago
c71fe1a [WGSL][NFC] Switch to new pass generation tablegen definitions. (#18215) by Han-Chung Wang · 8 months ago
7cac1b2 [SPIRV] Switch to new pass generation tablegen definitions. (#18214) by Han-Chung Wang · 8 months ago
a72e78b [LLVMGPU] Switch to new pass generation tablegen definitions. (#18213) by Han-Chung Wang · 8 months ago
0c2f51b [LLVMGPU] Drop WorkgroupSpecializationPass (#18212) by Nirvedh Meshram · 8 months ago
868f41e [ROCM] fix layout for WMMA_F16_16x16x16_F16 intrinsic (#18206) by Nirvedh Meshram · 8 months ago
08583d5 Bump LLVM to llvm/llvm-project@6b7afaa9db8f (#18197) by Stanley Winata · 8 months ago
b297d5b [Codegen][GPU] Add bank conflict reduction pass to TileAndFuse (#18204) by Quinn Dawkins · 8 months ago
2ea9b14 [Codegen] Add support for memref.expand_shape to propagation util (#18202) by Quinn Dawkins · 8 months ago
9c951ca [Flow] Generalize horizontal contraction fusion to cover more cases. (#17880) by MaheshRavishankar · 8 months ago
7812c77 [Codegen][GPU] Add support for all other intrinsics to TileAndFuse (#18179) by Quinn Dawkins · 8 months ago
3901e62 [GPU][NFC] Update the comment of intrinsic format. (#18194) by Han-Chung Wang · 8 months ago
ad2f0f8 [LLVMCPU] Add option `onlyFuseProducerInputOperands` to tileRootFuseConsumerProducer Pass (#18114) by Prashant Kumar · 8 months ago
6ac6be6 [GlobalOpt] Improve unary elementwise propagation to consider broadcasted operands (#17903) by Quinn Dawkins · 8 months ago
8dc6820 Adding simplified HAL dispatch methods. (#18189) by Ben Vanik · 8 months ago
3483893 [CodeGen][Common] Switch to new pass generation tablegen definitions. (#18166) by Han-Chung Wang · 8 months ago
49198a9 [EmitC] Remove unsused code from builders (#18191) by Simon Camphausen · 8 months ago
67b0e25 Remove leftovers from the old CppEmitter (#18175) by Marius Brehler · 8 months ago
b06bf6a [Codegen] Query `#iree_gpu.target` for shared memory limit (#18184) by Nithin Meganathan · 8 months ago
5a48912 [GPU] Add check for contractionOpInterface in setMatmulLoweringConfig (#18178) by Max191 · 8 months ago
ab12a4e [compiler][python] Make target_backends optional (#18151) by Boian Petkantchin · 8 months ago
f0e8cda [Codegen][IGEMM] Add new pass for IGEMM transformation with reshape propagation (#18161) by Max191 · 8 months ago
1fddcd6 [Codegen][CPU] Add MaterializeEncoding conversions for parallel generic ops (#18071) by Max191 · 8 months ago
50f18f1 [NFC][Encoding] Outline encodings in lit tests (#18165) by Max191 · 8 months ago
df3d588 Erase shape_assertion ops (#18167) by Jacques Pienaar · 8 months ago
6f88125 [Codegen] Lower `hal.interface.workgroup.size` in GPU codegen (#18145) by Nithin Meganathan · 8 months ago
2695fe9 [GlobalOpt] Switch to new pass generation tablegen definitions. (#18163) by Han-Chung Wang · 8 months ago
8545650 [MLIR][EmitC] Remove struct related macros from ops_emitc.h (#18081) by Simon Camphausen · 8 months ago
4bea50e [VMVX] Switch to new pass generation tablegen definitions (#18149) by Han-Chung Wang · 8 months ago
050a449 [CPU] Switch to new pass generation tablegen definitions (#18132) by Han-Chung Wang · 8 months ago
7ab66ff [Codegen][GPU] Move conversion to multi_mma to PackToIntrinsics (#18141) by Quinn Dawkins · 8 months ago
643f719 Add canonicalization pass for torch import (#18150) by Rob Suderman · 8 months ago
e9e24f8 [GPU] Follow the official naming convention for WMMA attributes. (#18147) by Han-Chung Wang · 8 months ago
235e110 [Codegen][GPU] Add pass to expand multi_mma op shapes to intrinsic layout (#18139) by Max191 · 8 months ago
352e05f Integrate LLVM at llvm/llvm-project@f7b2c2e4 (#18143) by Han-Chung Wang · 8 months ago
e341692 [Flow] Add pattern to canonicalize consecutive pads (#17878) by Quinn Dawkins · 8 months ago
c067270 [Flow] Fix error in CollapseDimensionsPass (#18128) by Ian Wood · 8 months ago
de679c9 Creating reusable command buffers in stream->hal lowering. (#18100) by Ben Vanik · 8 months ago
cc5566c [stream] SinkAwaitToFirstConsumer could break domination (#18131) by Rob Suderman · 8 months ago
4716f68 [Codegen][DT] Remove tensor.pad logics entirely from materialization. (#18130) by Han-Chung Wang · 8 months ago
18e86ab [Codegen][GPU] Add tiling interface implementation for iree_gpu.multi_mma (#17984) by Quinn Dawkins · 8 months ago
b76f89c [Codegen][GPU] Add producer fusion pattern to loop fusion and hoisting pass (#18118) by Quinn Dawkins · 8 months ago
82012e6 [GPU][NFC] Follow the official convention to define mfma/wmma attributes (#18127) by Han-Chung Wang · 8 months ago
71f1e20 Revert "Optimize `fp8` `linalg_ext.attention` by rework Q@K scaling" (#18112) by Stanley Winata · 8 months ago
5d8362c [Codegen][GPU] Add kernel config for LLVMGPUTileAndFuse for targeting mma (#18105) by Quinn Dawkins · 8 months ago
3d8ebc1 Integrate LLVM at llvm/llvm-project@b0329206 (#18102) by Han-Chung Wang · 8 months ago
98a9ca2 [Codegen] Support dynamic/scalable sizes when folding insert_slice into xfer_write (#17963) by Benjamin Maxwell · 8 months ago
1c50edd [LLVMGPU] Support i8 MFMA intrinsics in GPUTileAndFuse pipeline (#18104) by Max191 · 8 months ago
3a29039 [LLVMGPU] Remove redundant vector distribution tests (#18116) by Kunwar Grover · 8 months ago
b324f2a [VectorExt] Teach vectorization to to_layout (#18092) by Kunwar Grover · 8 months ago
e22b78d [GlobalOpt] Improve reshape/empty cleanup in transpose propagation (#17905) by Quinn Dawkins · 8 months ago
95fb6cb [LLVMCPU] Fix test (#18113) by Prashant Kumar · 8 months ago
113fae8 [LLVMCPU] Tile root and fuse consumer producer pass (#17804) by Prashant Kumar · 8 months ago
4a1f619 [Codegen][GPU] Add pass to unroll to native mma widths (#18101) by Quinn Dawkins · 8 months ago
2193406 Attaching pipeline layout to hal.interface.binding.subspan & co. (#18098) by Ben Vanik · 8 months ago
345e655 [EmitC][NFC] Use builder with default arguments for opaque_call ops (#17600) by Simon Camphausen · 8 months ago
ba9ea85 [LLVMGPU] Add im2col pipeline for convolution codegen (#18086) by Max191 · 8 months ago
74790bd Fix assert syntax for macOS compiler builds. by Scott Todd · 8 months ago