[Klaud Cold] Update dsv4-fp8-mi355x-sglang SGLang ROCm image to v0.5.12-rocm720-mi35x-20260517 by functionstackx · Pull Request #1470 · SemiAnalysisAI/InferenceX

functionstackx · 2026-05-18T02:16:03Z

Summary

Update SGLang ROCm image from custom rocm/sgl-dev:deepseek-v4-mi35x (23d old) to v0.5.12-rocm720-mi35x-20260517

Recipes touched: `dsv4-fp8-mi355x-sglang`

Test plan

full-sweep-enabled sweep passes.

🤖 Generated with Claude Code

…35x-20260517 Update SGLang ROCm image from custom rocm/sgl-dev:deepseek-v4-mi35x (23d old) to v0.5.12-rocm720-mi35x-20260517 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

github-actions · 2026-05-18T02:16:11Z

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your single node PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. If re-running failed jobs is attempted, PR authors are responsible for ensuring it passes. See GitHub's docs on re-running failed jobs: https://docs.github.com/en/actions/how-tos/manage-workflow-runs/re-run-workflows-and-jobs#re-running-failed-jobs-in-a-workflow

As a rule of thumb, generally, PR authors should request a review & get a PR approval from the respective companies' CODEOWNERS before requesting a review from core maintainers.

If additional help is needed, PR authors can reach out to core maintainers over Slack.

github-actions · 2026-05-18T02:16:11Z

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your single node PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

PR authors are responsible for ensuring that after merging, all GitHub Action jobs fully pass. A lot of the time, failures are just flakes and simply re-running the failed jobs will fix it. If re-running failed jobs is attempted, PR authors are responsible for ensuring it passes. See GitHub's docs on re-running failed jobs: https://docs.github.com/en/actions/how-tos/manage-workflow-runs/re-run-workflows-and-jobs#re-running-failed-jobs-in-a-workflow

As a rule of thumb, generally, PR authors should request a review & get a PR approval from the respective companies' CODEOWNERS before requesting a review from core maintainers.

If additional help is needed, PR authors can reach out to core maintainers over Slack.

github-actions · 2026-05-18T02:16:41Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=26009964668
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=26009964668

claude

LGTM — single image tag bump with matching changelog entry.

Extended reasoning...

Overview

This PR updates one line in .github/configs/amd-master.yaml to bump the dsv4-fp8-mi355x-sglang recipe's image from rocm/sgl-dev:deepseek-v4-mi35x to lmsysorg/sglang-rocm:v0.5.12-rocm720-mi35x-20260517, plus a matching entry appended to perf-changelog.yaml. No other recipes are touched.

Security risks

None. Both files are config YAML consumed by the perf sweep pipeline; no code paths or auth/permission boundaries are modified. The new image moves from a custom dev tag to an official lmsysorg/sglang-rocm tag, which is the same registry/pattern used by sibling recipes in this file.

Level of scrutiny

Low. This is a mechanical image bump matching the established pattern across the file (compare qwen3.5-bf16-mi355x-sglang, glm5-fp8-mi355x-sglang, etc., which all use lmsysorg/sglang-rocm:vX.Y.Z-rocm720-mi35x-DATE style tags). The full-sweep-enabled label ensures the sweep will validate the new image before merge.

Other factors

The bug hunting system found no issues. The PR is labeled full-sweep-enabled so the sweep job will exercise the new image end-to-end. The changelog entry uses the correct PR number (1470) and config-keys matches the recipe name.

chunfangamd · 2026-05-18T09:52:18Z

@functionstackx According to your comment here: #1255 (comment), we haven't focused on the DSv4 FP8 version for a while.

If you want to add it back, some configurations from that closed PR can be reconsidered.

functionstackx · 2026-05-18T17:28:41Z

@chunfangamd this was an claude /loop, thanks for correcting my claude

The v0.5.12-rocm720-mi35x-20260517 image rejects bf16 weights in fp8 slots (ValueError: Downcasting not allowed at deepseek_v4.py:1544 → parameter.py:73 copy_with_check) — DSV4-Pro-FP8 has bf16 shared-expert/ embedding layers and the old rocm/sgl-dev:deepseek-v4-mi35x image tolerated this, the generic v0.5.12 ROCm image doesn't. Bump PR #1470 root-caused via the failing sweep; the recipe isn't viable on any tag in the current sglang ROCm MI355X lineage until upstream relaxes the downcast check or ships a converter. Closing that PR; if the recipe is needed again later it can be re-added with whichever image works at that point. Also removes the orphan launch script benchmarks/single_node/ dsv4_fp8_mi355x.sh (not referenced by any remaining recipe). Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

functionstackx · 2026-05-18T17:31:21Z

Recipe removed entirely in #1501 (admin-merged as adbaae52).

The v0.5.12-rocm720-mi35x-20260517 image rejects the DSV4-Pro-FP8 checkpoint's bf16 shared-expert / embedding layers with ValueError: Downcasting not allowed: target.dtype=torch.float8_e4m3fn, loaded_weight.dtype=torch.bfloat16 at deepseek_v4.py:1544 → parameter.py:73 copy_with_check. The old custom rocm/sgl-dev:deepseek-v4-mi35x image tolerated this; no current sglang ROCm MI355X release does, so the recipe isn't viable on any bumpable image. Re-add later if the upstream landscape changes.

github-actions · 2026-05-18T17:33:42Z

see unofficial run visualizer at https://inferencex.semianalysis.com/inference?unofficialRun=26009966705
see unofficial run visualizer at https://inferencex.semianalysis.com/evaluation?unofficialRun=26009966705

Update dsv4-fp8-mi355x-sglang SGLang ROCm image to v0.5.12-rocm720-mi…

5b8e2e7

…35x-20260517 Update SGLang ROCm image from custom rocm/sgl-dev:deepseek-v4-mi35x (23d old) to v0.5.12-rocm720-mi35x-20260517 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

functionstackx requested a review from a team May 18, 2026 02:16

functionstackx added the full-sweep-enabled label May 18, 2026

functionstackx requested review from 1am9trash, billishyahao, chunfangamd, seungrokj and yctseng0211 as code owners May 18, 2026 02:16

github-project-automation Bot added this to InferenceMAX Board May 18, 2026

chore: fill pr-link for #1470

26a4a23

claude Bot reviewed May 18, 2026

View reviewed changes

functionstackx closed this May 18, 2026

github-project-automation Bot moved this to Done in InferenceMAX Board May 18, 2026

functionstackx mentioned this pull request May 18, 2026

[Klaud Cold] Remove dsv4-fp8-mi355x-sglang recipe #1501

Merged

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Klaud Cold] Update dsv4-fp8-mi355x-sglang SGLang ROCm image to v0.5.12-rocm720-mi35x-20260517#1470

[Klaud Cold] Update dsv4-fp8-mi355x-sglang SGLang ROCm image to v0.5.12-rocm720-mi35x-20260517#1470
functionstackx wants to merge 2 commits into
mainfrom
update-dsv4-fp8-mi355x-sglang-v0.5.12

functionstackx commented May 18, 2026

Uh oh!

github-actions Bot commented May 18, 2026

Uh oh!

github-actions Bot commented May 18, 2026

Uh oh!

github-actions Bot commented May 18, 2026

Uh oh!

claude Bot left a comment

Uh oh!

chunfangamd commented May 18, 2026

Uh oh!

functionstackx commented May 18, 2026

Uh oh!

functionstackx commented May 18, 2026

Uh oh!

github-actions Bot commented May 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

functionstackx commented May 18, 2026

Summary

Test plan

Uh oh!

github-actions Bot commented May 18, 2026

Uh oh!

github-actions Bot commented May 18, 2026

Uh oh!

github-actions Bot commented May 18, 2026

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Overview

Security risks

Level of scrutiny

Other factors

Uh oh!

chunfangamd commented May 18, 2026

Uh oh!

functionstackx commented May 18, 2026

Uh oh!

functionstackx commented May 18, 2026

Uh oh!

github-actions Bot commented May 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants