1. ae70815 Pipeclean byo_llvm, add docs and support requested features. (#14164) by Stella Laurenzo · 1 year, 10 months ago
  2. 88d92bf [LLVMCPU] Drop unit dims on memory transfers (#13340) by Jerry Wu · 1 year, 10 months ago
  3. 6c016ca StableHLOToStableHLO rewriter for RngBitGenerator f32 conversion. (#14162) by jvstokes · 1 year, 10 months ago
  4. 59c17a3 Decomposition of `chlo.tan` for `complex` types (#14190) by Rob Suderman · 1 year, 10 months ago
  5. 7f9ddc2 Fixing fmaxf for CPU 16 bit (#13808) by Ranvir Singh Virk · 1 year, 10 months ago
  6. 14452a1 Strip IREE prefix from tablegen doc files. (#14189) by Scott Todd · 1 year, 10 months ago
  7. 389e9cb Delete old e2e tests with massive checked-in IR files (#14179) by Geoffrey Martin-Noble · 1 year, 10 months ago
  8. 7ca13e3 Add MLIR dialects to website "Reference" section. (#14117) by Scott Todd · 1 year, 10 months ago
  9. 7758970 Fix memory effects of transform flow extension ops (#14185) by Oleksandr "Alex" Zinenko · 1 year, 10 months ago
  10. 72995e1 avoid Clang <= 10 bug with -fno-lax-vector-conversions (#14168) by bjacob · 1 year, 10 months ago
  11. 928a590 Integrate llvm-project at 88f07a31 (#14165) by Matthias Springer · 1 year, 10 months ago
  12. ea67a76 Extract a generic CopyMapping logic from the MatmulTensorCore strategy. (#14083) by Nicolas Vasilache · 1 year, 10 months ago
  13. 3459833 Create dispatches for scalar computation using program slices. (#13711) by MaheshRavishankar · 1 year, 10 months ago
  14. 5757feb [NFC] Drop format_diff.py script (#14180) by Jakub Kuderski · 1 year, 10 months ago
  15. bb10bfa Regenerate bleeding-edge images (#14177) by Jerry Wu · 1 year, 10 months ago
  16. d322009 Correctly tag matmul tests requiring sm80 (#14173) by Geoffrey Martin-Noble · 1 year, 10 months ago
  17. e49a1de Update lint script to run black (#14178) by Jakub Kuderski · 1 year, 10 months ago
  18. 6db40b3 Add .git-blame-ignore-revs file for large-scale changes (#14175) by Jakub Kuderski · 1 year, 10 months ago
  19. b700ddb [CI] Do not run black when no files changed (#14172) by Jakub Kuderski · 1 year, 10 months ago
  20. e737184 Tag CTS tests with the driver they use (#14170) by Geoffrey Martin-Noble · 1 year, 10 months ago
  21. 7e1326c Update scripting to use T4 for GPU runners (#14171) by Geoffrey Martin-Noble · 1 year, 10 months ago
  22. be24f02 Use Black to format Python files (#14161) by Jakub Kuderski · 1 year, 10 months ago
  23. e6d6c39 Add diagrams to ml-frameworks website pages. (#14145) by Scott Todd · 1 year, 10 months ago
  24. 39d45a1 Add riscv64 baremetal runtime (#14105) by Yun Hsiang · 1 year, 10 months ago
  25. cd0d1f1 [LLVMCPU] Add multi lowering_config support for SplitReductionPass. (#14061) by Han-Chung Wang · 1 year, 10 months ago
  26. 089f3d6 Check timestamp instead of canceling benchmark trigger (#14149) by Jerry Wu · 1 year, 10 months ago
  27. 347c550 [CPU] Optimize transposed masks (#14141) by Diego Caballero · 1 year, 10 months ago
  28. bbf1c1e Fix `stablehlo.convolution` padding with `complex<f32>` (#14123) by Rob Suderman · 1 year, 10 months ago
  29. 84b379e Fix the ukernels system build on older toolchains. (#14155) by bjacob · 1 year, 10 months ago
  30. 05c9b0d Use cuGetProcAddress to load CUDA entry points (#14056) by Trevor Morris · 1 year, 10 months ago
  31. 89a41d9 Adds memory mapping and alignment controls to VmModule construction. (#14153) by Stella Laurenzo · 1 year, 10 months ago
  32. d478a0e Introduce `iree_codegen.query_tile_sizes` replacing `vmvx.query_tile_sizes` (#14147) by bjacob · 1 year, 10 months ago
  33. 0c9c8ae [CPU] Scalarize vector loads used in a scalar fashion (#14150) by Diego Caballero · 1 year, 10 months ago
  34. b662b5a [SPIRV] Add pattern for breaking down vector.bitcast (#14151) by Quinn Dawkins · 1 year, 10 months ago
  35. 413e3a0 Add transform dialect registration in CUDA target. (#14146) by frgossen · 1 year, 10 months ago
  36. 8ad1c0d Fix transform dialect registration by Frederik Gossen · 1 year, 10 months ago
  37. 59fd431 Make iree-lld tool optional in python bindings (#14144) by Eugene Zhulenev · 1 year, 10 months ago
  38. ffeaa26 Use macOS 13 for x86_64 macOS runners (#14136) by Lei Zhang · 1 year, 10 months ago
  39. df119bd Test the bring-your-own-LLVM path. (#14035) by bjacob · 1 year, 10 months ago
  40. 9303825 [CPU] Introduce TilingConfig class (#14082) by Diego Caballero · 1 year, 10 months ago
  41. 77eda48 Update docs to explain benchmark presets and usages (#14095) by Jerry Wu · 1 year, 10 months ago
  42. 84427e6 Refresh website glossary with entries for StableHLO + OpenXLA. (#14134) by Scott Todd · 1 year, 10 months ago
  43. ceedf20 Fix cmake typo that disabled many e2e tests. (#14137) by bjacob · 1 year, 10 months ago
  44. db85f2d [NFC] Reformat python integrate scripts (#14132) by Jakub Kuderski · 1 year, 10 months ago
  45. e9061b3 Revert "Add VmModule.mmap() to Python API. (#14124)" by Stella Laurenzo · 1 year, 10 months ago
  46. 60b0764 Allow defining `IREE_HOST_SIZE_T` to other types. (#14040) by Scott Todd · 1 year, 10 months ago
  47. 2e4487d [iree-dialects][test] Remove dead code from the transfrom script (#14127) by qcolombet · 1 year, 10 months ago
  48. 3345b76 Add VmModule.mmap() to Python API. (#14124) by Stella Laurenzo · 1 year, 10 months ago
  49. 5e6c0ce Revert NVPTX buggy commit in LLVM (#14125) by Diego Caballero · 1 year, 10 months ago
  50. 3c7c6c2 Fix `stream.tensor.constant` for `complex<f32>` crash (#14120) by Rob Suderman · 1 year, 10 months ago
  51. eb26576 Use always-enabled vmvx targets in assign_target_devices test. (#14119) by Scott Todd · 1 year, 10 months ago
  52. aef121e Add iota->broadcast case to iota->sort->slice mapping for TopK (#14043) by NatashaKnk · 1 year, 10 months ago
  53. 6034421 Remove old / no longer needed flags from mnist_training notebook. (#14118) by Scott Todd · 1 year, 10 months ago
  54. 7ed4f4b Adding a console tracing provider and support for external ones. (#14113) by Ben Vanik · 1 year, 10 months ago
  55. 8af37c0 [ci] Update Xcode command line tools path on x86_64 macOS runners (#14108) by Lei Zhang · 1 year, 10 months ago
  56. 3546f2a [StableHLO] Use stablehlo submodule (#14094) by Jakub Kuderski · 1 year, 10 months ago
  57. a219cb5 Remove IREE apply_patterns op (#14054) by Matthias Springer · 1 year, 10 months ago
  58. 11388b8 [metal] Implement a Metal HAL driver (#12713) by Lei Zhang · 1 year, 10 months ago
  59. 028acfb [metal] Improve error handling in command buffer create/destroy by Lei Zhang · 1 year, 10 months ago
  60. 7c82a3d [metal] Avoid resource set leak in queue execution by Lei Zhang · 1 year, 10 months ago
  61. 3097b3a [metal] Use pipeline layout to query set and binding count by Lei Zhang · 1 year, 10 months ago
  62. 9c384b6 [metal] Unify pipeline object creation in MTLLibrary and source paths by Lei Zhang · 1 year, 10 months ago
  63. f7d9642 [metal] Use the last command buffer for semaphore signaling by Lei Zhang · 1 year, 10 months ago
  64. 1fa5da6 [metal] Use one resource set to handle queue execution resources by Lei Zhang · 1 year, 10 months ago
  65. eba9f5a [metal] Manage staging buffer refcount in command buffer lifetime by Lei Zhang · 1 year, 10 months ago
  66. d7fb981 [metal] Use the kernel layout to query push constant count by Lei Zhang · 1 year, 10 months ago
  67. e8679ad [metal] NFC: Make code in buffer fill less branchy by Lei Zhang · 1 year, 10 months ago
  68. ec93093 [metal] Cache a command buffer descriptor in device to deduplicate by Lei Zhang · 1 year, 10 months ago
  69. 02c14aa [metal] Improve order in device creation by Lei Zhang · 1 year, 10 months ago
  70. 52a8d0c [metal] Return early with IREE macro to flatten status check by Lei Zhang · 1 year, 10 months ago
  71. 4faf549 [metal] Check and return failure earlier in buffer allocation by Lei Zhang · 1 year, 10 months ago
  72. bba889c [metal] Keep track of queue in buffer construction for macOS by Lei Zhang · 1 year, 10 months ago
  73. 54818a3 [metal] Order host_allocator and const-ify various query APIs by Lei Zhang · 1 year, 11 months ago
  74. 6f32048 [metal] Use string view and byte span for compilation functions by Lei Zhang · 1 year, 11 months ago
  75. d90e80f [metal] Use separate lists for different descriptor sets by Lei Zhang · 2 years ago
  76. d4aef98 [metal] NFC: Create struct for state-related command buffer fields by Lei Zhang · 2 years ago
  77. e32cfa6 [metal] Add some TODOs for expected changes in command buffer by Lei Zhang · 2 years ago
  78. aba08b4 [metal] Add technical details README file by Lei Zhang · 2 years ago
  79. f598fd2 [metal] Use staging buffer for argument buffers and update sources by Lei Zhang · 2 years ago
  80. 0ec791e [metal] Construct argument buffer at dispatch recording time by Lei Zhang · 2 years ago
  81. 4307ba2 [metal] Drop some unnecessary error checks by Lei Zhang · 2 years ago
  82. c0ad0ea [metal] Switch to use command segments for recording command buffer by Lei Zhang · 2 years ago
  83. ffb40b1 [metal] Signal and wait MTLEvent for execution only barriers by Lei Zhang · 2 years ago
  84. 30d52cd [metal] Use MTLEvent for synchronizing when switching encoders by Lei Zhang · 2 years ago
  85. 2412d43 [metal] Implement buffer invalidate range for managed storage by Lei Zhang · 2 years ago
  86. 4ce8551 [metal] Tidy up buffer compatibility and storage mode management by Lei Zhang · 2 years ago
  87. ab5fed3 [metal] Use iree_status_t annotation for compute pipeline errors by Lei Zhang · 2 years ago
  88. c4d9cda [metal] Use resource set to manage wait/signal semaphores by Lei Zhang · 2 years ago
  89. a56531f [metal] Return iree_ok_status for device queue flush by Lei Zhang · 2 years ago
  90. f231e81 [metal] Drop IREE HAL event via Metal fence implementation by Lei Zhang · 2 years ago
  91. ab848fc [metal] Enable real async execution on GPU by Lei Zhang · 2 years, 1 month ago
  92. 472bbbe [metal] Support different target platforms including iOS by Lei Zhang · 2 years, 1 month ago
  93. 59e67a7 [metal] Enable compiling to Metal library when possible by Lei Zhang · 2 years, 1 month ago
  94. 6e93f00 [metal] Retain MTLSharedEvent in wait/signal command buffers by Lei Zhang · 2 years, 1 month ago
  95. 696145c [metal] Upload initial data for non-shared storage mode buffers by Lei Zhang · 2 years, 1 month ago
  96. 0adc461 [metal] Use MTLFence to synchronize encoders in command buffer by Lei Zhang · 2 years, 1 month ago
  97. 3f64a11 [metal] Add option for strong resource reference in command buffers by Lei Zhang · 2 years, 1 month ago
  98. a658aa5 [metal] Specify dispatch queue QoS class as USER_INITIATED by Lei Zhang · 2 years, 1 month ago
  99. 6c2477b [metal] Add option to enable resource hazard tracking by Lei Zhang · 2 years, 1 month ago
  100. 64588d6 [metal] Support profiling via GPU frame captures by Lei Zhang · 2 years, 1 month ago