[MetaSchedule][M4a] Schedule Rule: Multi-Level-Tiling#10043
Conversation
| - None on CPU | ||
| - [blockIdx.x, vthread.x, threadIdx.x] on GPU | ||
| use_tensor_core : bool | ||
| Whether to apply tensor core wmma intrinsic for the computation |
There was a problem hiding this comment.
This is probably a broader question, but are we going to bake tensorcore specific logic into this core schedule rule? What if I want to support other matrix cores from Intel or AMD?
I feel like the design should be that backend specific rules be decoupled from the main "driver" that does actual tiling like the one introduced in this PR. cc @junrushao1994 @vinx13
There was a problem hiding this comment.
Yes. We want to support auto tensorization for all hardware platforms. use-tensor-core is probably not a good name here - maybe use-tensor-intrin could be a better one
There was a problem hiding this comment.
I brought this up because I saw the upcoming code in https://github.com/junrushao1994/tvm/blob/meta-schedule/src/meta_schedule/schedule_rule/multi_level_tiling.cc where tensorcore stuff is hardcoded all over the place. I expect non-trivial refactoring before we can land this on main.
There was a problem hiding this comment.
Well, you are right. We might want to remove all the tensor core hard-coding here and focusing on AutoScheduler alignment first during upstreaming
73fe864 to
09f9d14
Compare
2a1de66 to
32940bc
Compare
* multi level tiling * remove tensor core related code * pylint * fix Co-authored-by: Junru Shao <junrushao1994@gmail.com>
* multi level tiling * remove tensor core related code * pylint * fix Co-authored-by: Junru Shao <junrushao1994@gmail.com>
This PR is one of the schedule rule for MetaSchedule.
The rule does not support auto tensorization for now.
Co-authored-by: Junru Shao junrushao1994@gmail.com
Co-authored-by: Xiyou Zhou xiyou@octoml.ai
Co-authored-by: Bohan Hou spectrometerh@gmail.com
Co-authored-by: Siyuan Feng Hzfengsy@sjtu.edu.cn
Co-authored-by: Ruihang Lai lairuihangdongdong@qq.com
Co-authored-by: Wuwei Lin wuwei@apache.org
@junrushao1994 @Hzfengsy @comaniac