VizuaraAI / truly-open-gpt-ossLinks
A truly open version of gpt-oss which shows the entire pre-training from scratch
☆55Updated 3 weeks ago
Alternatives and similar repositories for truly-open-gpt-oss
Users that are interested in truly-open-gpt-oss are comparing it to the libraries listed below
Sorting:
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆36Updated 4 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆268Updated 2 months ago
- Verifiers for LLM Reinforcement Learning☆75Updated 2 weeks ago
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆31Updated this week
- ☆45Updated 4 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆71Updated 5 months ago
- Enhancing LLMs with LoRA☆137Updated 2 weeks ago
- ☆54Updated 2 months ago
- ☆155Updated 5 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆155Updated last month
- ☆46Updated 5 months ago
- Code examples showing how to use Gemini, Gemma, Imagen, and more.☆43Updated 5 months ago
- A Demo of Cache-Augmented Generation (CAG) in an LLM☆109Updated 3 months ago
- ☆99Updated 2 weeks ago
- ☆115Updated 3 months ago
- An Automatic Prompt Optimization Framework for Large Language Models☆119Updated last month
- One click templates for inferencing Language Models☆214Updated last month
- ☆31Updated 6 months ago
- An open-source implementation of Whisper☆434Updated this week
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated 9 months ago
- 🎈 A series of lightweight GPT models featuring TinyGPT Base (~51M params) and TinyGPT-MoE (~85M params). Fast, creative text generation …☆12Updated last week
- Local Agentic RAG using Langchain and Agno☆28Updated 6 months ago
- ☆62Updated 2 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆446Updated last month
- minimal GRPO implementation from scratch☆97Updated 6 months ago
- Learn the building blocks of how to build gpt-oss from scratch☆84Updated this week
- ☆99Updated 3 months ago
- ☆75Updated 11 months ago
- Building LLaMA 4 MoE from Scratch☆64Updated 5 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆83Updated last month