VizuaraAI / truly-open-gpt-ossLinks

A truly open version of gpt-oss which shows the entire pre-training from scratch

☆76

Alternatives and similar repositories for truly-open-gpt-oss

Users that are interested in truly-open-gpt-oss are comparing it to the libraries listed below

Sorting:

VizuaraAI / nano-gpt-oss
Learn the building blocks of how to build gpt-oss from scratch
☆105Updated 2 months ago
huggingface / trl-jobs
Train LLM on Hugging Face infra
☆67Updated 2 weeks ago
microsoft / ArchScale
Simple & Scalable Pretraining for Neural Architecture Research
☆302Updated last month
ideaweaver-ai / DeepSeek-Children-Stories-15M-model
☆107Updated 5 months ago
janhq / ReZero
☆158Updated 7 months ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆108Updated 8 months ago
kurakurai / Luth
Luth is a state-of-the-art series of fine-tuned LLMs for French
☆39Updated last month
ALucek / GRPO-Training
An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning
☆36Updated 6 months ago
QuixiAI / grokadamw
☆136Updated last year
huggingface / huggingface-gemma-recipes
Inference, Fine Tuning and many more recipes with Gemma family of models
☆274Updated 4 months ago
tiiuae / onebitllms
Lightweight toolkit package to train and fine-tune 1.58bit Language models
☆99Updated 6 months ago
reka-ai / rekaquant
☆62Updated 4 months ago
SakanaAI / natural_niches
The code repository of the paper: Competition and Attraction Improve Model Fusion
☆166Updated 3 months ago
tiiuae / Falcon-H1
All information and news with respect to Falcon-H1 series
☆93Updated last month
fangyuan-ksgk / Tiny-GRPO
minimal GRPO implementation from scratch
☆99Updated 8 months ago
MBZUAI-IFM / K2-Think-SFT
☆127Updated 2 months ago
kmccleary3301 / nested_learning
A Reproduction of GDM's Nested Learning Paper
☆212Updated last week
wolfecameron / nanoMoE
An extension of the nanoGPT repository for training small MOE models.
☆215Updated 8 months ago
mkurman / grpo-llm-evaluator
Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluati…
☆47Updated 6 months ago
Open-Superintelligence-Lab / blueberry-llm
☆45Updated this week
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆72Updated 7 months ago
eqimp / hogwild_llm
Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache
☆133Updated 3 months ago
kabir2505 / tiny-mixtral
☆45Updated 6 months ago
joey00072 / ohara
Collection of autoregressive model implementation
☆86Updated 7 months ago
RiddleHe / llm-interp
A collection of lightweight interpretability scripts to understand how LLMs think
☆66Updated last week
kmohan321 / Research_Papers
☆46Updated 7 months ago
anhvth / opensloth
☆229Updated 2 months ago
cornstarch-org / Cornstarch
☆113Updated 2 months ago
NimbleEdge / sparse_transformers
Sparse Inferencing for transformer based LLMs
☆213Updated 3 months ago
YuvrajSingh-mist / SmolLlama
So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…
☆16Updated 8 months ago