VizuaraAI / truly-open-gpt-ossLinks
A truly open version of gpt-oss which shows the entire pre-training from scratch
☆85Updated 4 months ago
Alternatives and similar repositories for truly-open-gpt-oss
Users that are interested in truly-open-gpt-oss are comparing it to the libraries listed below
Sorting:
- Learn the building blocks of how to build gpt-oss from scratch☆110Updated 4 months ago
- qwen3 experiments☆34Updated 6 months ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆109Updated 8 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆41Updated 3 months ago
- ☆76Updated 6 months ago
- ☆122Updated 7 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆307Updated last month
- Inference, Fine Tuning and many more recipes with Gemma family of models☆279Updated 6 months ago
- ☆158Updated 9 months ago
- ☆109Updated 7 months ago
- minimal GRPO implementation from scratch☆102Updated 10 months ago
- Enhancing LLMs with LoRA☆206Updated 3 months ago
- ~950 line, minimal, extensible LLM inference engine built from scratch.☆396Updated 3 weeks ago
- ☆62Updated 6 months ago
- ☆46Updated 9 months ago
- Train LLM on Hugging Face infra☆67Updated 2 months ago
- Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…☆60Updated last year
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated 5 months ago
- ☆31Updated 10 months ago
- ☆45Updated 8 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆168Updated 5 months ago
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆140Updated 5 months ago
- Fine tune Gemma 3 on an object detection task☆96Updated 6 months ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆38Updated last month
- RLVR Testing and Training☆23Updated 5 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 8 months ago
- Sparse Inferencing for transformer based LLMs☆218Updated 5 months ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Updated 3 months ago