Sign in
opensecura
/
3p
/
openxla
/
iree
/
c6924b619d22276d5148ca5c608b986bdf51b217
/
compiler
c6924b6
[EmitC] Remove array related macros from `ops_emitc.h` (#18335)
by Simon Camphausen
· 7 months ago
8da4564
Revert "[Flow] Convert from tensor.cast to flow.tensor.reshape early …(#18256)" (#18331)
by Nirvedh Meshram
· 7 months ago
c44d29b
[compiler] Make cuda/hip/vulkan target cl options consistent (#17710)
by Lei Zhang
· 7 months ago
d1ccc8c
[Codegen] Disable transform dialect jit by default (#18305)
by Nithin Meganathan
· 7 months ago
292f2d4
[Stream] fix ordering issue in EmplaceAllocations (#18321)
by Nirvedh Meshram
· 7 months ago
8e42839
[Codegen][GPU] Add pass to combine value_barrier ops (#18298)
by Kunwar Grover
· 7 months ago
6ca0613
Integrate LLVM at llvm/llvm-project@7aa22f013e24d20291aad745368ff907baa9dfa4 (#18286)
by Kunwar Grover
· 7 months ago
1c0c5a6
[Flow] Convert from tensor.cast to flow.tensor.reshape early (#18256)
by Nirvedh Meshram
· 7 months ago
8dd1db3
Bubble expand shapes through `AttentionOp`s (#18074)
by Ian Wood
· 7 months ago
6a92fb7
Add onnx to InputType enum (#18310)
by Tim Gymnich
· 7 months ago
7c8fedc
Remove PyYAML dependency from Python bindings. (#18262)
by Scott Todd
· 7 months ago
3a06719
[compiler][stream] Avoid circular dependencies between partitions in execution scheduling (#18217)
by Boian Petkantchin
· 7 months ago
cea581f
Move LinalgQuantized* passes to GlobalOptimization (#18287)
by Quinn Dawkins
· 7 months ago
3af05b9
[Flow] Make CollapseDimensions iterative (#18203)
by Ian Wood
· 7 months ago
137e365
[Codegen][GPU] Add pass to annotate memory spaces on allocations (#18251)
by Quinn Dawkins
· 7 months ago
5beb9ad
[LinalgExt] Better doc for FP8 attention clamping. (#18301)
by Stanley Winata
· 7 months ago
5ba9a89
[LinalgExt] Reland QK scaling for attention decomp and further optimizations of it. (#18293)
by Stanley Winata
· 7 months ago
ab0d4c6
[Codegen][GPU] Rework scf.forall fusion to support different thread counts (#18280)
by Quinn Dawkins
· 7 months ago
87084d5
[Codegen][GPU] Remove insert_slice semantics from shuffle_tensor (#18279)
by Quinn Dawkins
· 7 months ago
95d5562
[vulkan] Update default RDNA GPU subgroup size to 32 (#18207)
by Nithin Meganathan
· 7 months ago
30040c7
[Codegen] Add pass to tile and distribute workgroups using `scf.forall` op. (#18266)
by MaheshRavishankar
· 7 months ago
ed512a2
[HAL][Loader] Switch to tablegen pass generation (#18282)
by Marius Brehler
· 7 months ago
da958c4
[HAL][Inline] Switch to tablegen pass generation (#18278)
by Marius Brehler
· 7 months ago
f224bc8
[ConstEval] Switch to tablegen pass generation (#18228)
by Marius Brehler
· 7 months ago
d5c6370
Make `iree_gpu.value_barrier` accept multiple operands (and return multiple results) (#18192)
by Kunwar Grover
· 7 months ago
aeda149
[InputConversion] Switch to tablegen pass generation (#18245)
by Marius Brehler
· 7 months ago
10ba28d
[Codegen][GPU] Add kernel config for LLVMGPUTileAndFuse (#17791)
by Quinn Dawkins
· 7 months ago
7cf3fc6
[Codegen][GPU] Fix allocation space in iree_gpu.shuffle_tensor lowering (#18250)
by Quinn Dawkins
· 7 months ago
a884b93
Bump LLVM to llvm/llvm-project@ddda37a (#18258)
by Stanley Winata
· 7 months ago
0e152d2
[Codegen] Add `DeviceMappingAttr` that maps to workgroup IDs. (#18264)
by MaheshRavishankar
· 7 months ago
40258db
[CodeGen][DT] Make the TypeConverter carry targetAttr info. (#18242)
by Han-Chung Wang
· 7 months ago
d25712c
[VMVX] Switch to tablegen pass generation (#18248)
by Marius Brehler
· 7 months ago
8a1d78b
[Codegen][CPU] Enable scalable transfer lowerings (#18170)
by Benjamin Maxwell
· 7 months ago
551cd54
[TOSA] Switch to tablegen pass generation (#18227)
by Marius Brehler
· 7 months ago
878a99b
[torch] Switch to tablegen pass generation (#18226)
by Marius Brehler
· 7 months ago
41f1f49
[Codegen] Add a pass option to control input -> dest pattern (#18240)
by Quinn Dawkins
· 7 months ago
78f54c2
[Codegen][GPU] Add a pass for basic distribution verification (#18236)
by Quinn Dawkins
· 7 months ago
66ed138
[CPU] Make VectorPreProcStrategy consider undefined behaviors (#18146)
by lialan
· 8 months ago
b144e90
[test] Check depthwise conv is vectorized in test (NFC) (#18225)
by Benjamin Maxwell
· 8 months ago
53a7bc4
Replace `iree_compiler::VscaleRange` with `vector::VscaleRange` (NFC) (#18218)
by Benjamin Maxwell
· 8 months ago
300af39
[codegen] Add max_workgroup_counts to TargetWgpAttr (#17771)
by Krzysztof Drewniak
· 8 months ago
7d60397
[LinalgExt] Switch to new pass generation tablegen definitions. (#18216)
by Han-Chung Wang
· 8 months ago
fe638b0
[Codegen][CPU] Eliminate all-true vector masks after vectorization (#18190)
by Benjamin Maxwell
· 8 months ago
c71fe1a
[WGSL][NFC] Switch to new pass generation tablegen definitions. (#18215)
by Han-Chung Wang
· 8 months ago
7cac1b2
[SPIRV] Switch to new pass generation tablegen definitions. (#18214)
by Han-Chung Wang
· 8 months ago
a72e78b
[LLVMGPU] Switch to new pass generation tablegen definitions. (#18213)
by Han-Chung Wang
· 8 months ago
0c2f51b
[LLVMGPU] Drop WorkgroupSpecializationPass (#18212)
by Nirvedh Meshram
· 8 months ago
868f41e
[ROCM] fix layout for WMMA_F16_16x16x16_F16 intrinsic (#18206)
by Nirvedh Meshram
· 8 months ago
08583d5
Bump LLVM to llvm/llvm-project@6b7afaa9db8f (#18197)
by Stanley Winata
· 8 months ago
b297d5b
[Codegen][GPU] Add bank conflict reduction pass to TileAndFuse (#18204)
by Quinn Dawkins
· 8 months ago
2ea9b14
[Codegen] Add support for memref.expand_shape to propagation util (#18202)
by Quinn Dawkins
· 8 months ago
9c951ca
[Flow] Generalize horizontal contraction fusion to cover more cases. (#17880)
by MaheshRavishankar
· 8 months ago
7812c77
[Codegen][GPU] Add support for all other intrinsics to TileAndFuse (#18179)
by Quinn Dawkins
· 8 months ago
3901e62
[GPU][NFC] Update the comment of intrinsic format. (#18194)
by Han-Chung Wang
· 8 months ago
ad2f0f8
[LLVMCPU] Add option `onlyFuseProducerInputOperands` to tileRootFuseConsumerProducer Pass (#18114)
by Prashant Kumar
· 8 months ago
6ac6be6
[GlobalOpt] Improve unary elementwise propagation to consider broadcasted operands (#17903)
by Quinn Dawkins
· 8 months ago
8dc6820
Adding simplified HAL dispatch methods. (#18189)
by Ben Vanik
· 8 months ago
3483893
[CodeGen][Common] Switch to new pass generation tablegen definitions. (#18166)
by Han-Chung Wang
· 8 months ago
49198a9
[EmitC] Remove unsused code from builders (#18191)
by Simon Camphausen
· 8 months ago
67b0e25
Remove leftovers from the old CppEmitter (#18175)
by Marius Brehler
· 8 months ago
b06bf6a
[Codegen] Query `#iree_gpu.target` for shared memory limit (#18184)
by Nithin Meganathan
· 8 months ago
5a48912
[GPU] Add check for contractionOpInterface in setMatmulLoweringConfig (#18178)
by Max191
· 8 months ago
ab12a4e
[compiler][python] Make target_backends optional (#18151)
by Boian Petkantchin
· 8 months ago
f0e8cda
[Codegen][IGEMM] Add new pass for IGEMM transformation with reshape propagation (#18161)
by Max191
· 8 months ago
1fddcd6
[Codegen][CPU] Add MaterializeEncoding conversions for parallel generic ops (#18071)
by Max191
· 8 months ago
50f18f1
[NFC][Encoding] Outline encodings in lit tests (#18165)
by Max191
· 8 months ago
df3d588
Erase shape_assertion ops (#18167)
by Jacques Pienaar
· 8 months ago
6f88125
[Codegen] Lower `hal.interface.workgroup.size` in GPU codegen (#18145)
by Nithin Meganathan
· 8 months ago
2695fe9
[GlobalOpt] Switch to new pass generation tablegen definitions. (#18163)
by Han-Chung Wang
· 8 months ago
8545650
[MLIR][EmitC] Remove struct related macros from ops_emitc.h (#18081)
by Simon Camphausen
· 8 months ago
4bea50e
[VMVX] Switch to new pass generation tablegen definitions (#18149)
by Han-Chung Wang
· 8 months ago
050a449
[CPU] Switch to new pass generation tablegen definitions (#18132)
by Han-Chung Wang
· 8 months ago
7ab66ff
[Codegen][GPU] Move conversion to multi_mma to PackToIntrinsics (#18141)
by Quinn Dawkins
· 8 months ago
643f719
Add canonicalization pass for torch import (#18150)
by Rob Suderman
· 8 months ago
e9e24f8
[GPU] Follow the official naming convention for WMMA attributes. (#18147)
by Han-Chung Wang
· 8 months ago
235e110
[Codegen][GPU] Add pass to expand multi_mma op shapes to intrinsic layout (#18139)
by Max191
· 8 months ago
352e05f
Integrate LLVM at llvm/llvm-project@f7b2c2e4 (#18143)
by Han-Chung Wang
· 8 months ago
e341692
[Flow] Add pattern to canonicalize consecutive pads (#17878)
by Quinn Dawkins
· 8 months ago
c067270
[Flow] Fix error in CollapseDimensionsPass (#18128)
by Ian Wood
· 8 months ago
de679c9
Creating reusable command buffers in stream->hal lowering. (#18100)
by Ben Vanik
· 8 months ago
cc5566c
[stream] SinkAwaitToFirstConsumer could break domination (#18131)
by Rob Suderman
· 8 months ago
4716f68
[Codegen][DT] Remove tensor.pad logics entirely from materialization. (#18130)
by Han-Chung Wang
· 8 months ago
18e86ab
[Codegen][GPU] Add tiling interface implementation for iree_gpu.multi_mma (#17984)
by Quinn Dawkins
· 8 months ago
b76f89c
[Codegen][GPU] Add producer fusion pattern to loop fusion and hoisting pass (#18118)
by Quinn Dawkins
· 8 months ago
82012e6
[GPU][NFC] Follow the official convention to define mfma/wmma attributes (#18127)
by Han-Chung Wang
· 8 months ago
71f1e20
Revert "Optimize `fp8` `linalg_ext.attention` by rework Q@K scaling" (#18112)
by Stanley Winata
· 8 months ago
5d8362c
[Codegen][GPU] Add kernel config for LLVMGPUTileAndFuse for targeting mma (#18105)
by Quinn Dawkins
· 8 months ago
3d8ebc1
Integrate LLVM at llvm/llvm-project@b0329206 (#18102)
by Han-Chung Wang
· 8 months ago
98a9ca2
[Codegen] Support dynamic/scalable sizes when folding insert_slice into xfer_write (#17963)
by Benjamin Maxwell
· 8 months ago
1c50edd
[LLVMGPU] Support i8 MFMA intrinsics in GPUTileAndFuse pipeline (#18104)
by Max191
· 8 months ago
3a29039
[LLVMGPU] Remove redundant vector distribution tests (#18116)
by Kunwar Grover
· 8 months ago
b324f2a
[VectorExt] Teach vectorization to to_layout (#18092)
by Kunwar Grover
· 8 months ago
e22b78d
[GlobalOpt] Improve reshape/empty cleanup in transpose propagation (#17905)
by Quinn Dawkins
· 8 months ago
95fb6cb
[LLVMCPU] Fix test (#18113)
by Prashant Kumar
· 8 months ago
113fae8
[LLVMCPU] Tile root and fuse consumer producer pass (#17804)
by Prashant Kumar
· 8 months ago
4a1f619
[Codegen][GPU] Add pass to unroll to native mma widths (#18101)
by Quinn Dawkins
· 8 months ago
2193406
Attaching pipeline layout to hal.interface.binding.subspan & co. (#18098)
by Ben Vanik
· 8 months ago
345e655
[EmitC][NFC] Use builder with default arguments for opaque_call ops (#17600)
by Simon Camphausen
· 8 months ago
ba9ea85
[LLVMGPU] Add im2col pipeline for convolution codegen (#18086)
by Max191
· 8 months ago
74790bd
Fix assert syntax for macOS compiler builds.
by Scott Todd
· 8 months ago
Next »