Tags · fasterinnerlooper/DeepSpeed

v0.14.0

Update version.txt

Mar 8, 2024
ce78a63
zip
tar.gz

v0.13.5

fix fused_qkv model accuracy issue (deepspeedai#5217)

Fused_qkv model can not correctly choose the fused_qkv type. Need to
update the module_name_matches.

Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>

Mar 5, 2024
bc0d246
zip
tar.gz

v0.13.4

Add script to check for `--extra-index-url` (deepspeedai#5184)

Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>

Feb 26, 2024
5115df3
zip
tar.gz

v0.13.3

Switch cpu-inference workflow from --extra-index-url to --index-url (d…

…eepspeedai#5182)

This switch should cause no impact to the workflow, but ensures that we
only download this package from the correct feed rather than
defaulting/falling back to the index-url
[default](https://pip.pypa.io/en/stable/cli/pip_install/#cmdoption-i)
(PyPI) for if a package existed with a higher version there that would
be chosen instead.

Feb 23, 2024
afdf028
zip
tar.gz

v0.13.2

Remove optimizer step on initialization (deepspeedai#5104)

All ZeRO 1/2/3 stages call the optimizer's `step()` on its
initialization. This increments a counter in the optimizer and produces
a different result in parameter update with the normal usage of PyTorch.
This PR eliminates `step()` in the initialization and lazily configures
some internal states (linking *hp_params*) after the first `step()`
call.

---------

Co-authored-by: Olatunji Ruwase <olruwase@microsoft.com>

Feb 11, 2024
1817980
zip
tar.gz

v0.13.1

Refactor the Qwen positional emebdding config code (deepspeedai#4955)

follow PR deepspeedai#4920 on Qwen inference code

Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>

Jan 23, 2024
1d35db7
zip
tar.gz

v0.13.0

Update release.yml

Jan 19, 2024
1c8b8f3
zip
tar.gz

v0.12.6

Mixtral FastGen Support (deepspeedai#4828)

Adds support for Mixtral with FastGen. Key features implemented:

1. Top-2 MoE support
2. Better support for RoPE thetas
3. The mistral model implementation

---------

Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>

Dec 21, 2023
c00388a
zip
tar.gz

v0.12.5

Fix 4649 (deepspeedai#4650)

Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>

Dec 15, 2023
65b7727
zip
tar.gz

v0.12.4

Add safetensors support (deepspeedai#4659)

Co-authored-by: Logan Adams <114770087+loadams@users.noreply.github.com>
Co-authored-by: Jeff Rasley <jerasley@microsoft.com>
Co-authored-by: Michael Wyatt <michaelwyatt@microsoft.com>

Dec 1, 2023
7122362
zip
tar.gz

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

v0.14.0

v0.13.5

v0.13.4

v0.13.3

v0.13.2

v0.13.1

v0.13.0

v0.12.6

v0.12.5

v0.12.4

Tags: fasterinnerlooper/DeepSpeed