Skip to content

Tags: msp8955/DeepSpeed

Tags

v0.6.5

Toggle v0.6.5's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Fixing several bugs in the inference-api and the kernels (deepspeedai…

…#1951)

Co-authored-by: Jeff Rasley <jerasley@microsoft.com>

v0.6.4

Toggle v0.6.4's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Improve z3 trace management (deepspeedai#1916)

* Fix OOM and type mismatch

* Toggle prefetching

* Disable z3 prefetching for inference (temp workaround)

* Fix zero3 tracing issues

* Remove debug prints

* Enable prefetch for inference

* Code clarity

* Invalidate trace cache

* Trace cache invalidation when needed
Separate nvme prefetch from all-gather prefetch

* Track last used step id

* Use debug name in error message

* Construct param trace from module trace

Co-authored-by: Jeff Rasley <jerasley@microsoft.com>

v0.6.3

Toggle v0.6.3's commit message
bumped to v0.6.3

v0.6.2

Toggle v0.6.2's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Fix OOM and type mismatch (deepspeedai#1884)

Co-authored-by: Jeff Rasley <jerasley@microsoft.com>

v0.6.1

Toggle v0.6.1's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
qkv_out can be a single tensor or a list. Handling these cases separe…

…tely. (deepspeedai#1850)

Co-authored-by: Jeff Rasley <jerasley@microsoft.com>

v0.6.0

Toggle v0.6.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
force amd install via sudo (deepspeedai#1815)

v0.5.10

Toggle v0.5.10's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
[ZeRO] Default disable elastic ckpt in stage 1+2 and reduce CPU memor…

…y overhead during ckpt load (deepspeedai#1525)

Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>

v0.5.9

Toggle v0.5.9's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Various small documentation text improvements (deepspeedai#1665)

Co-authored-by: Jeff Rasley <jerasley@microsoft.com>

v0.5.8

Toggle v0.5.8's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Remove unused import of ssl.OP_ENABLE_MIDDLEBOX_COMPAT (deepspeedai#1601

)

v0.5.7

Toggle v0.5.7's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Several fixes for our read-the-docs build (deepspeedai#1579)