VizuaraAI / nano-gpt-ossLinks

Learn the building blocks of how to build gpt-oss from scratch

☆110

Alternatives and similar repositories for nano-gpt-oss

Users that are interested in nano-gpt-oss are comparing it to the libraries listed below

Sorting:

VizuaraAI / truly-open-gpt-oss
A truly open version of gpt-oss which shows the entire pre-training from scratch
☆85Updated 4 months ago
Mayankpratapsingh022 / DeepSeek-from-Scratch
☆76Updated 6 months ago
huggingface / trl-jobs
Train LLM on Hugging Face infra
☆67Updated 2 months ago
ariG23498 / gemma3-object-detection
Fine tune Gemma 3 on an object detection task
☆96Updated 6 months ago
huggingface / huggingface-gemma-recipes
Inference, Fine Tuning and many more recipes with Gemma family of models
☆279Updated 6 months ago
argilla-io / argilla-cookbook
Simple examples using Argilla tools to build AI
☆57Updated last year
anyscale / e2e-llm-workflows
Fine-tune an LLM to perform batch inference and online serving.
☆117Updated 8 months ago
huggingface / gpt-oss-recipes
Collection of scripts and notebooks for OpenAI's latest GPT OSS models
☆496Updated 5 months ago
TrelisResearch / install-guides
Various installation guides for Large Language Models
☆77Updated 9 months ago
unslothai / unsloth-zoo
Utils for Unsloth https://github.com/unslothai/unsloth
☆187Updated last week
ritabratamaiti / AnyModal
AnyModal is a Flexible Multimodal Language Model Framework for PyTorch
☆103Updated last year
abhishekkrthakur / chat-ext
chrome & firefox extension to chat with webpages: local llms
☆131Updated last year
ali-bahrainian / RAG_best_practices
☆105Updated 10 months ago
githubpradeep / notebooks
☆55Updated 5 months ago
fangyuan-ksgk / Tiny-GRPO
minimal GRPO implementation from scratch
☆102Updated 10 months ago
tiiuae / onebitllms
Lightweight toolkit package to train and fine-tune 1.58bit Language models
☆109Updated 8 months ago
slashml / awesome-small-language-models
☆121Updated 3 weeks ago
agokrani / distillKitPlus
Easy to use, High Performant Knowledge Distillation for LLMs
☆97Updated 8 months ago
TrelisResearch / one-click-llms
One click templates for inferencing Language Models
☆227Updated 2 months ago
flowaicom / flow-judge
Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…
☆83Updated last year
FareedKhan-dev / gpt4o-from-scratch
Implementation of a GPT-4o like Multimodal from Scratch using Python
☆77Updated 9 months ago
kabir2505 / tiny-mixtral
☆45Updated 8 months ago
kurakurai / Luth
Luth is a state-of-the-art series of fine-tuned LLMs for French
☆41Updated 3 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆59Updated 3 months ago
AK391 / dailypapersHN
☆87Updated last year
ALucek / GRPO-Training
An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning
☆37Updated 8 months ago
shivendrra / SmallLanguageModel
a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model
☆167Updated last year
thad0ctor / unsloth-5090-multiple
unsloth-5090-multiple
☆60Updated 8 months ago
FareedKhan-dev / llm-scale-deploy-guide
An end-to-end pipeline to optimize and host LLM for 100K parallel queries
☆36Updated 6 months ago
nahidalam / maya
Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya
☆125Updated 5 months ago