[TensorIR] Cross-Thread Reduction#9360
Conversation
|
I will do another round of review next week! |
|
Will do the review tomorrow |
|
Finally got some time for a detailed code review! Will take over this PR and try to get it merged! |
3ca0e56 to
a560a9b
Compare
|
Did a pass over analysis and misc changes |
|
Love the PR and very comprehensively tested implementation ❤️ |
4ac6c65 to
6a162d7
Compare
|
@Hzfengsy @MasterJH5574 Should be good to go. Please take another look :-) |
|
@Hzfengsy Could you take another look? Junru's polishing looks very good, but I myself as the author cannot approve this PR 😅. |
Hzfengsy
left a comment
There was a problem hiding this comment.
LGTM. Thanks @MasterJH5574 for such a great effort on this PR.
* [TensorIR] Cross-Thread Reduction * Code revision on analysis and misc * Refactor TransformReductionBlock * Refactor code organization * Address comment * Use `std::make_tuple` Co-authored-by: Junru Shao <junrushao1994@gmail.com>
* [TensorIR] Cross-Thread Reduction * Code revision on analysis and misc * Refactor TransformReductionBlock * Refactor code organization * Address comment * Use `std::make_tuple` Co-authored-by: Junru Shao <junrushao1994@gmail.com>
* [TensorIR] Cross-Thread Reduction * Code revision on analysis and misc * Refactor TransformReductionBlock * Refactor code organization * Address comment * Use `std::make_tuple` Co-authored-by: Junru Shao <junrushao1994@gmail.com>
* [TensorIR] Cross-Thread Reduction * Code revision on analysis and misc * Refactor TransformReductionBlock * Refactor code organization * Address comment * Use `std::make_tuple` Co-authored-by: Junru Shao <junrushao1994@gmail.com>
* [TensorIR] Cross-Thread Reduction * Code revision on analysis and misc * Refactor TransformReductionBlock * Refactor code organization * Address comment * Use `std::make_tuple` Co-authored-by: Junru Shao <junrushao1994@gmail.com>
|
hi @MasterJH5574 , do we currently only support block reduction with warp level |
|
I've found the related implementation, thanks. |
Hi community! This PR adds cross-thread reduction support for TensorIR. After this PR, cross-thread reduction patterns in TIR can be successfully lowered.
cc @Hzfengsy @vinx13 @comaniac @junrushao1994 @jcf94 @jinhongyii @spectrometerHBH @tqchen
Co-authored-by: Wuwei Lin wuwei@apache.org
Co-authored-by: Junru Shao junrushao1994@gmail.com
Co-authored-by: Siyuan Feng Hzfengsy@sjtu.edu.cn
Co-authored-by: Hongyi Jin 3231950289@qq.com
Co-authored-by: Bohan Hou 32121147+spectrometerHBH@users.noreply.github.com