VizuaraAI / truly-open-gpt-ossLinks
A truly open version of gpt-oss which shows the entire pre-training from scratch
☆62Updated last month
Alternatives and similar repositories for truly-open-gpt-oss
Users that are interested in truly-open-gpt-oss are comparing it to the libraries listed below
Sorting:
- Train LLM on Hugging Face infra☆64Updated last month
- ☆116Updated 4 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆271Updated 3 months ago
- Learn the building blocks of how to build gpt-oss from scratch☆88Updated 3 weeks ago
- ☆157Updated 6 months ago
- Enhancing LLMs with LoRA☆163Updated last month
- ☆103Updated 3 months ago
- ☆53Updated 3 months ago
- minimal GRPO implementation from scratch☆98Updated 7 months ago
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆34Updated this week
- Code examples showing how to use Gemini, Gemma, Imagen, and more.☆44Updated 6 months ago
- 🎈 A series of lightweight GPT models featuring TinyGPT Base (~51M params) and TinyGPT-MoE (~85M params). Fast, creative text generation …☆12Updated last month
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆193Updated 2 weeks ago
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆15Updated 6 months ago
- ☆31Updated 7 months ago
- Sparse Inferencing for transformer based LLMs☆201Updated 2 months ago
- ☆45Updated 5 months ago
- ☆62Updated 3 months ago
- qwen3 experiments☆32Updated 3 months ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆59Updated this week
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 5 months ago
- ☆207Updated 2 weeks ago
- Fine tune Gemma 3 on an object detection task☆86Updated 3 months ago
- Verifiers for LLM Reinforcement Learning☆75Updated last month
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆459Updated last month
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆345Updated 3 months ago
- ☆46Updated 6 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated 9 months ago
- ☆112Updated last month
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆92Updated 4 months ago