Pinned Loading
Repositories
Showing 1 of 1 repositories
- LMRax Public
LMRax is a framework built on JAX to train transformers language models by reinforcement learning, along with the reward model training.
Almost-Intelligence/LMRax’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…