)]}'
{
  "commit": "a9e91ecb68b23241349acfbb2ede00afa4c107b7",
  "tree": "2184ecf847f3902e0b389ccd909d5216de49dfa3",
  "parents": [
    "b32ca40857b9e91ed99f91ba86a730a1feb9ff49"
  ],
  "author": {
    "name": "WMC",
    "email": "tnwilly@gmail.com",
    "time": "Tue Jun 09 14:35:50 2026 +0800"
  },
  "committer": {
    "name": "GitHub",
    "email": "noreply@github.com",
    "time": "Tue Jun 09 08:35:50 2026 +0200"
  },
  "message": "[LLVMGPU] Add sm_120 TileAndFuse config coverage, consolidate MMA sync pipeline tests (#24577)\n\n## Description\nAdds LLVMGPU TileAndFuse config-selection coverage for the CUDA `sm_120`\ntarget and consolidates MMA sync pipeline-lowering coverage across\nNVIDIA\nGPU architectures.\n\nThe `sm_120` config-selection test verifies that f16 matmul cases select\nthe\nNVIDIA MMA TileAndFuse path before any architecture-specific tuning is\nintroduced. The pipeline-lowering tests are kept target-independent\nbecause\nthey start from IR with an attached `mma_sync` intrinsic and only verify\nthat\nthe selected lowering pipeline generates the corresponding\n`nvgpu.mma.sync`\nops.\n\nCovered `sm_120` config-selection cases:\n- `f16xf16 -\u003e f32` matmul selects TileAndFuse with `NV_MMA_SYNC`\n- `f16xf16 -\u003e f16` matmul selects TileAndFuse with `NV_MMA_SYNC`\n- `f16xf16 -\u003e f32` matmul accumulate sets `convert_acc_gemm`\n\nCovered target-independent pipeline-lowering cases:\n- `f16xf16 -\u003e f32` TileAndFuse lowering generates `nvgpu.mma.sync`\n- `f16xf16 -\u003e f16` TileAndFuse lowering generates `nvgpu.mma.sync`\n- `f16xf16 -\u003e f32` VectorDistribute lowering generates `nvgpu.mma.sync`\n- `f16xf16 -\u003e f16` VectorDistribute lowering generates `nvgpu.mma.sync`\n\nThis is baseline regression coverage only; it does not introduce new\ntile\nheuristics or `sm_120`-specific tuning.\n\n## Test\n- `llvm-lit -v \\\n\ncompiler/src/iree/compiler/Codegen/LLVMGPU/test/NVVM/config_tile_and_fuse_sm120.mlir\n\\\n\ncompiler/src/iree/compiler/Codegen/LLVMGPU/test/NVVM/config_tile_and_fuse_sm80.mlir\n\\\n\ncompiler/src/iree/compiler/Codegen/LLVMGPU/test/NVVM/pipeline_tile_and_fuse_mma_sync.mlir\n\\\n\ncompiler/src/iree/compiler/Codegen/LLVMGPU/test/NVVM/pipeline_vector_distribute_mma_sync.mlir`\n\n\n```\n-- Testing: 4 tests, 4 workers --\nPASS: IREE :: src/iree/compiler/Codegen/LLVMGPU/test/NVVM/config_tile_and_fuse_sm120.mlir (1 of 4)\nPASS: IREE :: src/iree/compiler/Codegen/LLVMGPU/test/NVVM/config_tile_and_fuse_sm80.mlir (2 of 4)\nPASS: IREE :: src/iree/compiler/Codegen/LLVMGPU/test/NVVM/pipeline_vector_distribute_mma_sync.mlir (3 of 4)\nPASS: IREE :: src/iree/compiler/Codegen/LLVMGPU/test/NVVM/pipeline_tile_and_fuse_mma_sync.mlir (4 of 4)\n\nTesting Time: 0.14s\n\nTotal Discovered Tests: 4\n  Passed: 4 (100.00%)\n```\n\n---------\n\nSigned-off-by: weimin023 \u003ctnwilly@gmail.com\u003e",
  "tree_diff": [
    {
      "type": "modify",
      "old_id": "07b0ee68f2aa5fbbe83acd9d05dce05d1a8c9e2e",
      "old_mode": 33188,
      "old_path": "compiler/src/iree/compiler/Codegen/LLVMGPU/test/NVVM/BUILD.bazel",
      "new_id": "e5ded3077e318915a91ab96a7afc334e9b8f06a0",
      "new_mode": 33188,
      "new_path": "compiler/src/iree/compiler/Codegen/LLVMGPU/test/NVVM/BUILD.bazel"
    },
    {
      "type": "modify",
      "old_id": "1608e8badb8d9cc8777abaf6e3245925634b3d35",
      "old_mode": 33188,
      "old_path": "compiler/src/iree/compiler/Codegen/LLVMGPU/test/NVVM/CMakeLists.txt",
      "new_id": "5d58dde3a93e8096b51a82d300a8057b2ae53658",
      "new_mode": 33188,
      "new_path": "compiler/src/iree/compiler/Codegen/LLVMGPU/test/NVVM/CMakeLists.txt"
    },
    {
      "type": "add",
      "old_id": "0000000000000000000000000000000000000000",
      "old_mode": 0,
      "old_path": "/dev/null",
      "new_id": "89cba205c09e2a206a496cdbe3488b3d36abf6e0",
      "new_mode": 33188,
      "new_path": "compiler/src/iree/compiler/Codegen/LLVMGPU/test/NVVM/config_tile_and_fuse_sm120.mlir"
    },
    {
      "type": "modify",
      "old_id": "ed215abe5d0dcc19cfa8729818329846db7a19f2",
      "old_mode": 33188,
      "old_path": "compiler/src/iree/compiler/Codegen/LLVMGPU/test/NVVM/config_tile_and_fuse_sm80.mlir",
      "new_id": "69517d0c97ddea4e7c4875ec63838151d278f114",
      "new_mode": 33188,
      "new_path": "compiler/src/iree/compiler/Codegen/LLVMGPU/test/NVVM/config_tile_and_fuse_sm80.mlir"
    },
    {
      "type": "rename",
      "old_id": "bf861d4099519137177ef4ba11e167c0505d63de",
      "old_mode": 33188,
      "old_path": "compiler/src/iree/compiler/Codegen/LLVMGPU/test/NVVM/pipeline_tile_and_fuse_sm80.mlir",
      "new_id": "5d99b252718ac75dd21602cd902d4ff100a4046c",
      "new_mode": 33188,
      "new_path": "compiler/src/iree/compiler/Codegen/LLVMGPU/test/NVVM/pipeline_tile_and_fuse_mma_sync.mlir",
      "score": 97
    },
    {
      "type": "rename",
      "old_id": "543944c623ad202095139f9a35811824f8cb7769",
      "old_mode": 33188,
      "old_path": "compiler/src/iree/compiler/Codegen/LLVMGPU/test/NVVM/pipeline_vector_distribute_sm80.mlir",
      "new_id": "8a80b02aa0edc3ca4fffbe9a37b9e159b6105068",
      "new_mode": 33188,
      "new_path": "compiler/src/iree/compiler/Codegen/LLVMGPU/test/NVVM/pipeline_vector_distribute_mma_sync.mlir",
      "score": 98
    }
  ]
}
