[S-TIR][Dlight] Add layered fall back strategy to handle missing attr `max_shared_memory_per_block` by cchung100m · Pull Request #19453 · apache/tvm

cchung100m · 2026-04-27T13:11:41Z

Hi Committers,

This PR is trying to fix issues #19419. Any suggestions would be appreciated if you are available.

Root Cause

auto-detected CUDA might lacks max_shared_memory_per_block and it would cause KeyError

Solutions

Add layered fall back strategy to handle missing attr max_shared_memory_per_block

… 'max_shared_memory_per_block'

gemini-code-assist

Code Review

This pull request introduces a robust fallback mechanism for determining the maximum shared memory per block across various GPU targets. It adds the get_max_shared_memory_per_block utility function in common_analysis.py, which provides default values for CUDA, ROCm, Metal, OpenCL, and Vulkan when the target attribute is not explicitly defined. The gemv and low_batch_gemv schedules were updated to utilize this new function. Feedback suggests moving the hardcoded mapping dictionary to a module-level constant to optimize performance and improve code structure.

cchung100m · 2026-04-28T12:05:50Z

Hi @tlopex @mshr-h

This PR is trying to fix issues #19419. Any suggestions would be appreciated if you are available. 😄

cchung100m · 2026-04-29T15:47:29Z

Thanks to @mshr-h 😄

[S-TIR][Dlight] Add layered fall back strategy to handle missing attr…

6724198

… 'max_shared_memory_per_block'

gemini-code-assist Bot reviewed Apr 27, 2026

View reviewed changes

Comment thread python/tvm/s_tir/dlight/analysis/common_analysis.py Outdated

cchung100m added 2 commits April 27, 2026 21:34

[S-TIR][Dlight] Refactor the get_max_shared_memory_per_block

cacc811

[S-TIR][Dlight] Add test cases

49d7848

cchung100m marked this pull request as ready for review April 27, 2026 15:59

mshr-h reviewed Apr 28, 2026

View reviewed changes

Comment thread python/tvm/s_tir/dlight/analysis/common_analysis.py Outdated

cchung100m added 2 commits April 28, 2026 21:09

[S-TIR][Dlight] Add logger.warning(...)

4bd725a

[S-TIR][Dlight] Fix import error

4393d72

cchung100m marked this pull request as draft April 28, 2026 13:40

cchung100m marked this pull request as ready for review April 28, 2026 15:02

mshr-h approved these changes Apr 29, 2026

View reviewed changes

mshr-h merged commit 7ecf466 into apache:main Apr 29, 2026
11 checks passed

cchung100m deleted the issue-19419 branch April 29, 2026 15:46

ysh329 mentioned this pull request May 6, 2026

[Release] v0.24.0 Release Candidate Notes #19513

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[S-TIR][Dlight] Add layered fall back strategy to handle missing attr `max_shared_memory_per_block`#19453

[S-TIR][Dlight] Add layered fall back strategy to handle missing attr `max_shared_memory_per_block`#19453
mshr-h merged 5 commits into
apache:mainfrom
cchung100m:issue-19419

cchung100m commented Apr 27, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

cchung100m commented Apr 28, 2026

Uh oh!

Uh oh!

Uh oh!

cchung100m commented Apr 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cchung100m commented Apr 27, 2026

Root Cause

Solutions

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

cchung100m commented Apr 28, 2026

Uh oh!

Uh oh!

Uh oh!

cchung100m commented Apr 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants