Skip to content
View Vidit-Ostwal's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report Vidit-Ostwal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 158 88 Updated Feb 20, 2026

A repo of resource for the GPU Mode talk on OpenEnv.

14 Updated Jan 14, 2026
Python 7 Updated Jan 13, 2026

Democratizing Reinforcement Learning for LLMs

Python 5,179 505 Updated Mar 3, 2026

Learn CUDA with PyTorch

Cuda 235 32 Updated Mar 3, 2026

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 157,297 32,271 Updated Mar 3, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 86,815 13,177 Updated Mar 2, 2026

Environments by the Prime Intellect Research Team

Python 24 9 Updated Mar 3, 2026

A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗

Python 1,280 98 Updated Mar 3, 2026

Personal AI Notebooks. Organize files & webpages and generate notes from them. Open source, local & open data, open model choice (incl. local).

TypeScript 3,250 235 Updated Feb 23, 2026

Simplifying reinforcement learning for complex game environments

C 5,133 400 Updated Mar 3, 2026

👷 Build compute kernels

Nix 217 35 Updated Jan 27, 2026

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,360 113 Updated Jan 16, 2026

Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs

Python 2,258 271 Updated Oct 5, 2025

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 16,652 4,950 Updated Aug 1, 2024

The ultimate training toolkit for finetuning diffusion models

Python 9,618 1,160 Updated Mar 3, 2026

Our library for RL environments + evals

Python 3,872 507 Updated Mar 3, 2026
Python 871 45 Updated Sep 15, 2025

Build, enrich, and transform datasets using AI models with no code

TypeScript 1,628 135 Updated Oct 23, 2025

Get your documents ready for gen AI

Python 54,715 3,678 Updated Mar 3, 2026

Examples and guides for using the OpenAI API

Jupyter Notebook 71,801 12,066 Updated Mar 3, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,848 2,051 Updated Jan 13, 2026

This repository contains a curated collection of 300+ case studies from over 80 companies, detailing practical applications and insights into machine learning (ML) system design. The contents are o…

9,304 1,366 Updated Aug 5, 2025

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 8,701 728 Updated Mar 3, 2026

🤖🕰️ An MCP server that gives language models temporal awareness and time calculation abilities. Teaching AI the significance of the passage of time through collaborative tool development.

Python 713 47 Updated Jun 17, 2025

Making Flux go brrr on GPUs.

Python 163 17 Updated Jan 5, 2026

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,717 4,732 Updated Mar 2, 2026
Next