-
Notifications
You must be signed in to change notification settings - Fork 874
Pull requests: THUDM/slime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(rollout): drain generation before offload memory release
#2015
opened Jun 4, 2026 by
EazyReal
Loading…
fix(colocate): derive num_gpus_per_node from actor_num_gpus_per_node
#2012
opened Jun 3, 2026 by
aoshen02
Contributor
Loading…
perf(ppo): reduce log-prob + entropy cross-entropy peak memory
#2011
opened Jun 2, 2026 by
Mantissagithub
Loading…
perf(megatron-loss): scale logits per-chunk to avoid OOM
#2010
opened Jun 2, 2026 by
Yangruipis
Contributor
Loading…
perf(rollout): pack loss_masks as np.int8 at the ray.put boundary
#2006
opened Jun 2, 2026 by
Chasing1020
Contributor
Loading…
fix(gpt-oss): emit fused raw expert tensors for SGLang
#2004
opened Jun 1, 2026 by
Jiang020609
Contributor
Loading…
fix: clamp max_new_tokens on retry to prevent response_length overflow
#2003
opened Jun 1, 2026 by
YaoweiFan
Loading…
fix(megatron): honor --[no-]gradient-accumulation-fusion on the megatron.bridge provider
#1999
opened Jun 1, 2026 by
aoshen02
Contributor
Loading…
fix(rollout): support non extra gpu placement when using rollout-external mode
#1997
opened May 30, 2026 by
shinytang6
Loading…
fix(logging): partition raw rewards for correct samples
#1996
opened May 30, 2026 by
Jiang020609
Contributor
Loading…
[sft] rebuild the sft loss mask generator and add ci
#1994
opened May 30, 2026 by
zhuzilin
Contributor
Loading…
Add timeout configuration for on policy distillation HTTP session.
#1970
opened May 28, 2026 by
qqwqqw689
Contributor
Loading…
fix:TorchMemorySaver observes invalid LD_PRELOAD. when add --disable-weights-backuper
#1937
opened May 22, 2026 by
zyfzjsc988
Loading…
feat: add SFT entropy logging and validation loss monitoring
#1925
opened May 19, 2026 by
none0663
Contributor
Loading…
fix(debug): auto-append rollout_id/rank in save_debug_train_data path template
#1922
opened May 19, 2026 by
wlf-darkmatter
Loading…
Fix RolloutManager reward normalization for uneven rollout groups
#1918
opened May 18, 2026 by
haoyang9804
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-06-03.