-
Notifications
You must be signed in to change notification settings - Fork 208
[AMD] dsv4-fp4-mi355x-atom: enable DPA at high concurrency, update image to atom0.1.4 #1717
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
36 commits
Select commit
Hold shift + click to select a range
31b4fbe
[AMD] dsv4-fp4-mi355x-atom: enable DPA TBO at high concurrency, updat…
seungrokj c566e28
[AMD] perf-changelog: dsv4-fp4-mi355x-atom DPA TBO + image atom0.1.4
seungrokj 7e1aa06
[AMD] perf-changelog: add PR link #1717
seungrokj 65e0fa3
[AMD] dsv4_fp4_mi355x_atom.sh: disable prefix caching
seungrokj 3f3560b
[AMD] dsv4-fp4-mi355x-atom: add max-model-len, eval context, extend c…
seungrokj c3b3289
[AMD] dsv4-fp4-mi355x-atom: narrow eval to single conc=1024 point, di…
seungrokj 7ffa976
[AMD] dsv4_fp4_mi355x_atom.sh: add cudagraph-capture-sizes and max-nu…
seungrokj f2677b2
[AMD] dsv4-fp4-mi355x-atom: bump to nightly image, expand search spac…
seungrokj f5f0d66
[AMD] set GPU_MAX_HW_QUEUES=5 in dsv4_fp4_mi355x_atom.sh
seungrokj dc5b239
[AMD] dsv4-fp4-mi355x-atom: disable TBO, add TP4 rows for isl=8192, c…
seungrokj 1dbf259
Merge branch 'main' into amd/dsv4_atom_0612
seungrokj 9e18052
[AMD] dsv4_fp4_mi355x_atom.sh: quote SERVER_LOG variable
seungrokj c1812ed
[AMD] dsv4_fp4_mi355x_atom.sh: comment out dense cudagraph sizes
seungrokj 28bdc6a
[AMD] dsv4_fp4_mi355x_atom.sh: fix --hf-overrides JSON escaping
seungrokj b36218e
[AMD] dsv4_fp4_mi355x_atom.sh: comment out dense cudagraph sizes
seungrokj fa47caf
[AMD] dsv4-fp4-mi355x-atom: expand search space, restore isl=1024 rows
seungrokj 1022e0b
Merge branch 'main' into amd/dsv4_atom_0612
seungrokj af82c27
[AMD] perf-changelog: update dsv4-fp4-mi355x-atom image and search-sp…
seungrokj 1300012
[AMD] dsv4_fp4_mi355x_atom.sh: restore sparse cudagraph capture sizes
seungrokj f56f877
[AMD] perf-changelog: revert dsv4-fp4-mi355x-atom image/search-space,…
seungrokj f7c9de8
Merge branch 'main' into amd/dsv4_atom_0612
seungrokj a4828cb
[AMD] perf-changelog: add dsv4-fp4-mi355x-sglang entry for PR #1762
seungrokj 19b8757
update dsv4-fp4-mi355x-atom: bump image, enable TBO conditionally, fi…
seungrokj 03aaa6b
expand dsv4-fp4-mi355x-atom search space: restore ISL1024 scenarios, …
seungrokj cf3962f
Merge branch 'main' into amd/dsv4_atom_0612
seungrokj 421313c
Update perf-changelog.yaml
seungrokj ae77233
Update perf-changelog.yaml
seungrokj a8f6bd0
Update perf-changelog.yaml
seungrokj 5fbd068
Update perf-changelog.yaml
seungrokj d080faa
update perf-changelog: move dsv4-fp4-mi355x-atom entry to end
seungrokj 91f6277
narrow dsv4-fp4-mi355x-atom to DPA conc=256-2048 ISL8192, fix TBO bra…
seungrokj 4364ef9
restore full dsv4-fp4-mi355x-atom search space: ISL1024 + ISL8192 TP4…
seungrokj 6644109
Update perf-changelog.yaml
seungrokj 471aff2
Update perf-changelog.yaml
seungrokj 67b052f
fix: resolve PR 1717 changelog conflict
Oseltamivir bcf0d1f
Merge remote-tracking branch 'origin/main' into amd/dsv4_atom_0612
Oseltamivir File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Some comments aren't visible on the classic Files Changed page.
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.