-
Notifications
You must be signed in to change notification settings - Fork 208
Pull requests: SemiAnalysisAI/InferenceX
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add MiniMax-M3 NVFP4 B300 single-node vLLM benchmark (EAGLE3 spec decode)
#1929
opened Jun 25, 2026 by
Ankur-singh
Collaborator
Loading…
Add MiniMax-M3 NVFP4 B300 single-node aggregated vLLM benchmark
#1928
opened Jun 25, 2026 by
Ankur-singh
Collaborator
Loading…
[codex] Add golden AL distributions
#1926
opened Jun 24, 2026 by
functionstackx
Collaborator
Loading…
[NV] Refresh Minimax M3 FP8 submission with new recipes for GB300
full-sweep-enabled
#1925
opened Jun 24, 2026 by
richardhuo-nv
Collaborator
Loading…
[WIP] agentic: add Kimi Mooncake LMCacheMP disagg recipe
sweep-enabled
#1924
opened Jun 24, 2026 by
YukioZzz
Collaborator
Loading…
[WIP][NV] dsv4-fp4-b200-sglang image to SGLang nightly 20260624
full-sweep-enabled
#1923
opened Jun 24, 2026 by
hshrivastava-droid
Collaborator
Loading…
[WIP][NV]Add Qwen3.5-397B-A17B-NVFP4 GB300 disagg multinode SGLang via Dynamo
full-sweep-enabled
#1921
opened Jun 24, 2026 by
hshrivastava-droid
Collaborator
Loading…
[AMD] Add MiniMax-M3-FP4 MI355X ATOM EAGLE3 / non-EAGLE3 update 0623
AMD
full-sweep-enabled
#1917
opened Jun 24, 2026 by
seungrokj
Collaborator
Loading…
6 of 8 tasks
[AMD] Add MiniMax-M3-FP8 MI355X ATOM EAGLE3 / non-EAGLE3 update 0623
AMD
full-sweep-enabled
#1916
opened Jun 24, 2026 by
seungrokj
Collaborator
Loading…
8 tasks done
[AMD] Add MiniMax-M3-MXFP4 MI355X vLLM disagg recipe
full-sweep-enabled
#1914
opened Jun 24, 2026 by
Duyi-Wang
Collaborator
Loading…
Update B300 FP4 SGLang (non-MTP) image to latest nightly
full-sweep-enabled
#1913
opened Jun 24, 2026 by
hshrivastava-droid
Collaborator
Loading…
Add GLM-5-FP8 GB300 multinode dynamo-sglang MTP benchmark
full-sweep-enabled
#1907
opened Jun 23, 2026 by
hshrivastava-droid
Collaborator
Loading…
[WIP][NV] Add MiniMax-M3 MXFP8 B300 1k/1k Dynamo-vLLM sweep
full-sweep-enabled
#1906
opened Jun 23, 2026 by
RohitNagraj
Collaborator
Loading…
glm5.1-fp4-mi355x-sglang: bump image to v0.5.13.post1-20260622 + enable aiter allreduce fusion
full-sweep-enabled
#1905
opened Jun 23, 2026 by
jiacao-amd
Collaborator
Loading…
Bump vLLM version for DSV4 B200 disagg
full-sweep-enabled
#1899
opened Jun 23, 2026 by
RohitNagraj
Collaborator
Loading…
CollectiveX: experimental cross-vendor collective/EP benchmark
#1896
opened Jun 23, 2026 by
Oseltamivir
Collaborator
Loading…
Add GLM-5-FP8 GB200 dynamo-sglang multinode benchmark
full-sweep-enabled
#1895
opened Jun 23, 2026 by
hshrivastava-droid
Collaborator
Loading…
[AMD] dsv4 atom-disagg eval sweep — validate reduced ATOM logging
all-evals
Expand eval selection to every fixed-sequence config
evals-only
Suppress throughput and run only eval jobs; combine with all-evals to expand selection
full-sweep-enabled
#1882
opened Jun 22, 2026 by
Oseltamivir
Collaborator
Loading…
[CI] Validate aggregate benchmark results before upload
#1881
opened Jun 21, 2026 by
edwingao28
Loading…
[codex] Enforce complete eval validation and quiet ATOM logs
#1878
opened Jun 21, 2026 by
Oseltamivir
Collaborator
•
Draft
[AMD] Add MiniMax-M3-FP4 MI355X ATOM EAGLE3 only
AMD
full-sweep-enabled
#1866
opened Jun 20, 2026 by
seungrokj
Collaborator
Loading…
3 tasks
[Klaud Cold] MI300X MiniMax-M3 nightly image and FP8 KV cache
full-sweep-fail-fast
#1858
opened Jun 19, 2026 by
cquil11
Collaborator
Loading…
[AMD] Add MiniMax-M3-FP4 MI355X ATOMMESH
all-evals
Expand eval selection to every fixed-sequence config
AMD
full-sweep-enabled
#1856
opened Jun 19, 2026 by
seungrokj
Collaborator
Loading…
4 tasks
[AMD] Add DSv4-FP4-MI355X ATOMMESH MTP
AMD
#1855
opened Jun 19, 2026 by
seungrokj
Collaborator
Loading…
2 tasks
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.