-
Notifications
You must be signed in to change notification settings - Fork 4.8k
Pull requests: deepspeedai/DeepSpeed
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix PR-target workflow concurrency groups
#8017
opened May 20, 2026 by
tohtana
Collaborator
Loading…
Fix full CI test isolation for ZeRO chmod and NVMe quantization tests
#8008
opened May 14, 2026 by
tohtana
Collaborator
Loading…
docs: add test directory convention to AGENTS.md
#7993
opened May 6, 2026 by
delock
Collaborator
Loading…
Add engine.coalesce_grad_reduction() for ZeRO 1/2/3 multi-backward
#7992
opened May 5, 2026 by
roycho96
Contributor
Loading…
Refactor/torch autocast encapsulate global state
#7946
opened Apr 2, 2026 by
nathon-lee
Contributor
Loading…
Fix ZeRO-3 optimizer initialization validation (#7844)
#7929
opened Mar 28, 2026 by
amadhan882
Loading…
[Feature] Enable AutoEP Compatibility with ZeRO-3
#7928
opened Mar 28, 2026 by
nathon-lee
Contributor
Loading…
doc: Remove suggestion to build extensions in parallel
#7899
opened Mar 12, 2026 by
Flamefire
Contributor
Loading…
Fix Stage 0 + Ulysses crash: make bwc_tensor_model_parallel_rank() resilient to MP API absence
#7888
opened Mar 6, 2026 by
nathon-lee
Contributor
Loading…
fix(zero): Ensure full gradient reduction for Muon optimizer with reduce_scatter
#7878
opened Feb 27, 2026 by
nathon-lee
Contributor
Loading…
fix: correct DistributedAttention output shape and pad uneven sequence lengths (#7842)
#7868
opened Feb 22, 2026 by
harshang03
•
Draft
fix: keep fp32-pinned parameters out of the bf16 cast path in ZeRO-3 (#7747)
#7867
opened Feb 22, 2026 by
harshang03
•
Draft
Revert "fix: remove premature MPI environment variable check in OpenMPIRunner"
#7864
opened Feb 21, 2026 by
mikloorbi-sys
•
Draft
Fix global .cuh ignore and enforce tracked CUDA headers
#7858
opened Feb 18, 2026 by
harshang03
•
Draft
Fix ZeRO legacy grad-hook crash when next_functions is missing
#7857
opened Feb 17, 2026 by
harshang03
•
Draft
Reject non-finite fp16 loss_scale across config and ZeRO paths
#7856
opened Feb 17, 2026 by
harshang03
•
Draft
Fix zero/division safety gaps in utility and inference paths
#7855
opened Feb 17, 2026 by
harshang03
•
Draft
Fix count_used_parameters_in_backward crash on PyTorch < 2.3 (#7756)
#7849
opened Feb 12, 2026 by
harshang03
•
Draft
[BUG] Fix: Fix gradient norm calculation and dynamic shape blocking in PP+ZeRO1 collective communication
#7847
opened Feb 12, 2026 by
Thinksky5124
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-05-18.