Tags: Art39print/together-python
Tags
Hotfix: remove DPO hyperparameter validations (togethercomputer#329) * hotfix remove valitaions * bump version --------- Co-authored-by: Soroush Bassam <sbassam@users.noreply.github.com>
Fix simpo_gamma default value for CLI (togethercomputer#328) * Fix simpo default * Bump
New options for preference tuning: rpo alpha, logprobs normalization,… … reference-free, simpo gamma (togethercomputer#327) * Add dpo improvements arguments * Version bump (tmp, dev) * Implicit setting of `reference_free` in case if simpo_gamma is set * Fix unbound variable * Fix * Force normalization for simpo * Version bump * Formatting * Version fix * Remove reference-free from dpo * Review fixes * Formatting * Fixes
Add validation for lora_dropout (togethercomputer#316) * Add validation for lora_dropout * Fix * Fix eos * Replace with llama3b * Minimize the diff * Version bump
Use /checkpoints instead of events parsing (togethercomputer#312) * Use /checkpoints instead of events parsing * Fix events listing for async client * Change the parsing logic due to the api changes * Fix string formatting * Parsing updated * Formatting * Remove old implementation
Add a logic to support max_batch_size_dpo. (togethercomputer#305) * Add a logic for max_batch_size_dpo, update version * Fix tests * Use default valuat to support old API
Bump the dependencies group across 1 directory with 3 updates (togeth… …ercomputer#304) * Bump the dependencies group across 1 directory with 3 updates Bumps the dependencies group with 3 updates in the / directory: [rich](https://github.com/Textualize/rich), [ruff](https://github.com/astral-sh/ruff) and [pre-commit](https://github.com/pre-commit/pre-commit). Updates `rich` from 13.9.4 to 14.0.0 - [Release notes](https://github.com/Textualize/rich/releases) - [Changelog](https://github.com/Textualize/rich/blob/master/CHANGELOG.md) - [Commits](Textualize/rich@v13.9.4...v14.0.0) Updates `ruff` from 0.9.10 to 0.11.7 - [Release notes](https://github.com/astral-sh/ruff/releases) - [Changelog](https://github.com/astral-sh/ruff/blob/main/CHANGELOG.md) - [Commits](astral-sh/ruff@0.9.10...0.11.7) Updates `pre-commit` from 4.1.0 to 4.2.0 - [Release notes](https://github.com/pre-commit/pre-commit/releases) - [Changelog](https://github.com/pre-commit/pre-commit/blob/main/CHANGELOG.md) - [Commits](pre-commit/pre-commit@v4.1.0...v4.2.0) --- updated-dependencies: - dependency-name: rich dependency-version: 14.0.0 dependency-type: direct:production update-type: version-update:semver-major dependency-group: dependencies - dependency-name: ruff dependency-version: 0.11.7 dependency-type: direct:development update-type: version-update:semver-minor dependency-group: dependencies - dependency-name: pre-commit dependency-version: 4.2.0 dependency-type: direct:development update-type: version-update:semver-minor dependency-group: dependencies ... Signed-off-by: dependabot[bot] <support@github.com> * poetry lock --------- Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: orangetin <abhy@together.ai>
Make fine-tuning job validation messages more informative (togetherco… …mputer#292) * more verbose messages * update test
PreviousNext