VizuaraAI / truly-open-gpt-ossLinks
A truly open version of gpt-oss which shows the entire pre-training from scratch
☆82Updated 4 months ago
Alternatives and similar repositories for truly-open-gpt-oss
Users that are interested in truly-open-gpt-oss are comparing it to the libraries listed below
Sorting:
- Learn the building blocks of how to build gpt-oss from scratch☆108Updated 3 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆276Updated 5 months ago
- ☆62Updated 6 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- ☆122Updated 6 months ago
- ☆69Updated 5 months ago
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆41Updated 2 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆169Updated 4 months ago
- Train LLM on Hugging Face infra☆67Updated last month
- ☆46Updated 9 months ago
- qwen3 experiments☆33Updated 6 months ago
- ☆68Updated 7 months ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆88Updated last week
- Simple & Scalable Pretraining for Neural Architecture Research☆306Updated last month
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 7 months ago
- ☆158Updated 8 months ago
- ☆108Updated 6 months ago
- Fine tune Gemma 3 on an object detection task☆95Updated 5 months ago
- Efficient non-uniform quantization with GPTQ for GGUF☆58Updated 3 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆104Updated 7 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 2 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆109Updated 10 months ago
- ☆301Updated 5 months ago
- FormulaOne: A dataset of algorithmic problems based on MSO formulas.☆22Updated 4 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated 4 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆73Updated 8 months ago
- Verifiers for LLM Reinforcement Learning☆80Updated 3 months ago
- ☆45Updated 8 months ago
- WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups ov…☆480Updated last week
- minimal GRPO implementation from scratch☆102Updated 9 months ago