Uh oh!

There was an error while loading. Please reload this page.

ROCm / vllm Public

forked from vllm-project/vllm

Notifications You must be signed in to change notification settings
Fork 49
Star 122

Code
Issues 2
Pull requests 44
Actions
Projects
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security and quality
Insights

Pull requests: ROCm/vllm

Labels 16 Milestones 0

New pull request New

44 Open 963 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[ROCm][Kernel] W4A16 skinny GEMM: hand-asm kernel for gfx1151

#1045 opened Jul 1, 2026 by mgehre-amd • Draft

[bench] vit_attention/bench.py: add --backend, default to running all

#1044 opened Jul 1, 2026 by mgehre-amd • Draft

feat(mi450): switch to nightlies index, update version pins, bake gfx1250 env vars

#1043 opened Jun 30, 2026 by kiran-thumma Collaborator

Loading…

3 tasks

Marcusr/test staging cleanup

#1042 opened Jun 30, 2026 by marcusr-amd • Draft

5 tasks

[CI] Upload staging wheels to S3 for PR/dispatch builds

#1040 opened Jun 30, 2026 by marcusr-amd

Loading…

5 tasks

455 wip rebased

#1039 opened Jun 30, 2026 by danichan-mkm

Loading…

gpt-oss gfx1250 ATOM-parity perf patches

#1035 opened Jun 29, 2026 by dllehr-amd Collaborator

Loading…

[CI] Re-enable performance test job

#1033 opened Jun 29, 2026 by marcusr-amd • Draft

5 tasks

[ROCm] Split skinny_gemms_int8.cu into per-N translation units

#1028 opened Jun 26, 2026 by marcusr-amd • Draft

3 of 5 tasks

tune Qwen3-VL-4B prefill unified-attention on gfx1150

#1024 opened Jun 26, 2026 by qingxuamd

Loading…

Fix GLM 5.2 mxfp4 MTP loading issue

#1022 opened Jun 25, 2026 by amd-xiaoyu12

Loading…

[ROCm][MoE] W4A16 MoE routing-distribution benchmark suite for the gfx11 prefill GEMM

#1020 opened Jun 25, 2026 by roberteg16 • Draft

[ROCm][MoE] Custom W4A16 MoE prefill WMMA GEMM for gfx11 (default-on)

#1015 opened Jun 22, 2026 by roberteg16

Loading…

optimize TTFT qwen3-vl

#1006 opened Jun 15, 2026 by qingxuamd

Loading…

455 war room findings

#1001 opened Jun 12, 2026 by jpvillam-amd

Loading…

MoE: Grouped Triton GEMM for TTFT improvements

#970 opened May 26, 2026 by mgehre-amd • Draft

[ROCm][MoE] Modular MoE: alias fused_out with output to skip finalize copy

#940 opened May 19, 2026 by mgehre-amd

Loading…

2 tasks done

feat: Add NPU+GPU async pipelining for vision-language models

#936 opened May 14, 2026 by liangliangchang • Draft

4 of 5 tasks

Annotate VLM/audio tower nn.Linear calls in PyTorch profiles

#934 opened May 13, 2026 by mgehre-amd

Loading…

[bench] wvSplitK skinny GEMM: capture timed iters into a CUDA graph

#928 opened May 8, 2026 by mgehre-amd • Draft

Hybrid

#918 opened May 4, 2026 by liangliangchang • Draft

5 tasks

Auto-build flash-attn wheels on push, upload to S3

#910 opened Apr 30, 2026 by mgehre-amd • Draft

1 task

[ROCm][DSv4] Share AITER decode dequant + fp8-cast buffers across layers (rebased, stacked on #902)

#903 opened Apr 27, 2026 by ChuanLi1101 • Draft

2 of 4 tasks

[ROCm][DSv4] Make AITER sparse decode cudagraph-clean (rebased, stacked on #901)

#902 opened Apr 27, 2026 by ChuanLi1101 • Draft

2 of 5 tasks

[ROCm][DSv4] AITER-accelerated MLA decode for DeepSeek V4 on MI355X (rebased on tj/dsv4prrebase)

#901 opened Apr 27, 2026 by ChuanLi1101 • Draft

1 of 4 tasks

Previous 1 2 Next

Previous Next

ProTip! Type g p on any issue or pull request to go back to the pull request listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!