Skip to content

Pull requests: modelscope/ms-swift

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: Add YuFeng XGuard template support for training
#8179 opened Mar 3, 2026 by ciaoyizhen Loading…
1 of 4 tasks
feat: log grpo input images to wandb
#8157 opened Mar 2, 2026 by shunk031 Loading…
1 of 4 tasks
[megatron] qwen3.5 use megatron-core
#8126 opened Feb 27, 2026 by Jintao-Huang Loading…
[megatron] support GLM-5 megatron
#8085 opened Feb 24, 2026 by Jintao-Huang Loading…
[feat] support frames packing for minicpmv4_5 video processing
#8046 opened Feb 13, 2026 by fanqiNO1 Loading…
2 of 4 tasks
Add QAT (Quantization-Aware Training) Support Callback
#8042 opened Feb 12, 2026 by y2logic Loading…
1 task done
[v4] refactor v4 dataset sp patch_tasks
#7878 opened Jan 23, 2026 by Jintao-Huang Loading…
fix(megatron): disable checkpointing when calculate KL
#7828 opened Jan 20, 2026 by zzc0430 Loading…
1 of 4 tasks
Update moe.sh
#7375 opened Jan 13, 2026 by Itime-ren Loading…
4 tasks
[grpo] support gigpo with gym
#7364 opened Jan 12, 2026 by londa61 Loading…
3 tasks
[feature] add support for EAFT loss
#7361 opened Jan 12, 2026 by ymxyll Loading…
3 tasks
feat(cli): add setproctitle support to customize process name
#7278 opened Jan 4, 2026 by ciaoyizhen Loading…
1 task done
add sglang reasoning parser
#7171 opened Dec 23, 2025 by eliasyin Loading…
1 of 4 tasks
support cce、tiledmlp、activation cpu offload
#7169 opened Dec 23, 2025 by meichangsu1 Loading…
1 of 4 tasks
[infer] Support infer cache impl
#7150 opened Dec 22, 2025 by Jintao-Huang Loading…
Improve vLLM examples regarding vllm_engine_kwargs use
#7133 opened Dec 19, 2025 by 3manifold Loading…
1 task done
[megatron] support megatron fsdp
#7117 opened Dec 18, 2025 by Jintao-Huang Loading…
[template] support mimo-v2 template
#7095 opened Dec 17, 2025 by Jintao-Huang Loading…
[feat] support TiledMLP in Deepspeed and FSDP2
#7090 opened Dec 17, 2025 by kevssim Loading…
2 of 4 tasks
ProTip! Mix and match filters to narrow down what you’re looking for.