issue/843: QY机器支持scale_mm by xgqdut2016 · Pull Request #68 · qinyiqun/InfiniCore

xgqdut2016 · 2026-01-14T07:34:02Z

qinyiqun · 2026-01-14T11:01:19Z

scaled_mm未来会存放多种量化算法，所以kernel.cuh应该给一个比较明确的名字，以及需要注释

qinyiqun · 2026-01-14T11:02:11Z

这个文件应该是qy-gpu的，所以不应该用nvidia来命名

* issue/843: success qy scaled_mm * issue/843: modified kernel.cuh as per_channel_dequant_int8.cuh

demo131 - multiple issues regarding quantization, qy, and so forth * issue/843: success per_channel_quant_int8 * issue/843: success qy quant * issue/843: modified quant * Add w8a8int8 performance tests * add infinicore op linear_w8a8i8 * w8a8 linear module functional nn * issue/843: QY-GPU Support Int8 scale_mm (#68) * issue/843: success qy scaled_mm * issue/843: modified kernel.cuh as per_channel_dequant_int8.cuh * fix parallel slic in w8 * w8: support multiple batch size * temp: 修改quantconfig处理 * fix format and delete redundancy code * fix format * fix format * fix format * Refactor: add new API alongside legacy interfaces with deprecation warnings * 添加w4 inifnicore相关内容，以及将Quantization config划入InfiniCore * 量化算子支持图 * solve cub version problem and fix code structure * fix format * demo131 - remove commented lines --------- Co-authored-by: xgqdut2016 <kenan_gewei@163.com> Co-authored-by: xgqdut2016 <140036308+xgqdut2016@users.noreply.github.com> Co-authored-by: wooway777 <wooway777@gmail.com>

* issue/843: success qy scaled_mm * issue/843: modified kernel.cuh as per_channel_dequant_int8.cuh

issue/843: success qy scaled_mm

f40ba2d

qinyiqun reviewed Jan 14, 2026

View reviewed changes

issue/843: modified kernel.cuh as per_channel_dequant_int8.cuh

6ba0ffb

qinyiqun merged commit 188885a into qinyiqun:issue/843 Jan 15, 2026
2 of 5 checks passed

qinyiqun pushed a commit that referenced this pull request Jan 20, 2026

issue/843: QY-GPU Support Int8 scale_mm (#68)

f9262db

* issue/843: success qy scaled_mm * issue/843: modified kernel.cuh as per_channel_dequant_int8.cuh

qinyiqun pushed a commit that referenced this pull request Jan 21, 2026

issue/843: QY-GPU Support Int8 scale_mm (#68)

37d5feb

* issue/843: success qy scaled_mm * issue/843: modified kernel.cuh as per_channel_dequant_int8.cuh

qinyiqun pushed a commit that referenced this pull request Jan 27, 2026

issue/843: QY-GPU Support Int8 scale_mm (#68)

d413b3d

* issue/843: success qy scaled_mm * issue/843: modified kernel.cuh as per_channel_dequant_int8.cuh

qinyiqun pushed a commit that referenced this pull request Feb 5, 2026

issue/843: QY-GPU Support Int8 scale_mm (#68)

f5076c3

* issue/843: success qy scaled_mm * issue/843: modified kernel.cuh as per_channel_dequant_int8.cuh

xgqdut2016 added a commit to xgqdut2016/InfiniCore that referenced this pull request Mar 4, 2026

issue/843: QY-GPU Support Int8 scale_mm (qinyiqun#68)

04fc011

* issue/843: success qy scaled_mm * issue/843: modified kernel.cuh as per_channel_dequant_int8.cuh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

issue/843: QY机器支持scale_mm#68

issue/843: QY机器支持scale_mm#68
qinyiqun merged 2 commits intoqinyiqun:issue/843from
xgqdut2016:issue/843

xgqdut2016 commented Jan 14, 2026

Uh oh!

qinyiqun Jan 14, 2026

Uh oh!

qinyiqun Jan 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

xgqdut2016 commented Jan 14, 2026

Uh oh!

qinyiqun Jan 14, 2026

Choose a reason for hiding this comment

Uh oh!

qinyiqun Jan 14, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants