VizuaraAILabs/nano-gpt-oss

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/VizuaraAILabs/nano-gpt-oss)

VizuaraAILabs / nano-gpt-oss

Learn the building blocks of how to build gpt-oss from scratch

☆120

Alternatives and similar repositories for nano-gpt-oss

Users that are interested in nano-gpt-oss are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

VizuaraAILabs / truly-open-gpt-oss
View on GitHub
A truly open version of gpt-oss which shows the entire pre-training from scratch
☆90Sep 4, 2025Updated 10 months ago
VizuaraAILabs / DeepSeek-From-Scratch
View on GitHub
Learn the building blocks of how to build DeepSeek from scratch.
☆147May 9, 2026Updated 2 months ago
DmitriyKras / Small-objects-segmentation
View on GitHub
This repository contains project about segmentation small flying objects with U-Net, PSP-Net and FCN
☆14Mar 26, 2023Updated 3 years ago
imneonizer / pytorch-triplet-loss
View on GitHub
Birds 400-Species Image Classification using Pytorch Metric Learning (Triplet Margin Loss)
☆13Nov 1, 2022Updated 3 years ago
Mayankpratapsingh022 / DeepSeek-from-Scratch
View on GitHub
☆118Jul 13, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
tanmay-bakshi / HierarchicalReasoningModel
View on GitHub
Implementation of "Hierarchical Reasoning Model" in MLX (Swift)
☆26Aug 11, 2025Updated 11 months ago
ideaweaver-ai / DeepSeek-Children-Stories-15M-model
View on GitHub
☆115Jun 19, 2025Updated last year
ideaweaver-ai / qwen3-from-scratch
View on GitHub
☆16Jul 4, 2026Updated 3 weeks ago
leockl / tool-ahead-of-time-ts
View on GitHub
This is a TypeScript package to add tool calling capabilities to newly released LLMs on LangChain.js's ChatOpenAI and BaseChatModel class…
☆18Jun 4, 2025Updated last year
VizuaraAILabs / Tiny-Stories-Regional
View on GitHub
☆59Jan 26, 2026Updated 5 months ago
openconstruct / libremodel
View on GitHub
opensource LLM
☆35Sep 20, 2025Updated 10 months ago
DLUTElvis / GNN4Rec-Papers
View on GitHub
Recent papers on Graph Neural Networks-based Recommender System.
☆12Aug 21, 2023Updated 2 years ago
JanTempus / tokenisation_lp
View on GitHub
☆15May 20, 2026Updated 2 months ago
vukrosic / open-source-ai
View on GitHub
Push open-source AI research to the frontier by the end of 2026 - open frontier science for everybody.
☆28Aug 13, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
davidbrowne17 / Mimi-Voice
View on GitHub
Create Unmute voice embeddings
☆26Nov 15, 2025Updated 8 months ago
necat101 / Hierarchos
View on GitHub
A novel hybrid AI architecture leveraging Titan's-like memory and HRM-like reasoning
☆26Jul 17, 2026Updated last week
hyoseokp / PRISM
View on GitHub
PRISM: O(1) Photonic Block Selection for Long-Context LLM Inference — eliminates the O(N) KV cache scan via photonic broadcast-and-weight…
☆28Apr 28, 2026Updated 2 months ago
imneonizer / yolo-nas-retail-training
View on GitHub
Training a YOLO NAS Model for detecting retail product items from shelf images using SKU110K dataset.
☆10Aug 13, 2023Updated 2 years ago
remichu-ai / pai-agent
View on GitHub
The accompany backend for PAI app
☆12Mar 24, 2025Updated last year
wolfecameron / nanoMoE
View on GitHub
An extension of the nanoGPT repository for training small MOE models.
☆280Mar 9, 2025Updated last year
mkurman / neuroblast-v3
View on GitHub
NeuroBLAST v3 architecture code
☆37Jan 6, 2026Updated 6 months ago
aniket-mish / cuda
View on GitHub
everything i know about cuda and triton
☆13Jan 28, 2025Updated last year
xTimeCrystal / MiniModel
View on GitHub
☆42Feb 25, 2026Updated 5 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
eric612 / Mobilenet-YOLO-Pytorch
View on GitHub
Include mobilenet series (v1,v2,v3...) and yolo series (yolov3,yolov4,...)
☆38Dec 29, 2021Updated 4 years ago
yousef-rafat / MaximusLLM
View on GitHub
High-throughput long-context LLMs. Scaling context via RandNLA and massive vocab capacity through MAXIS Loss and Fisher-SVD.
☆28May 1, 2026Updated 2 months ago
bhargobdeka / multi-agent-apps
View on GitHub
This repository will contain projects on multi-agent applications using frameworks such as crewai, langchain, gradio, hugging face etc.
☆25Aug 17, 2024Updated last year
inovex / recsys-training
View on GitHub
Hands-on Training for Recommender Systems
☆11Jul 27, 2021Updated 4 years ago
abhishekkrthakur / chat-ext
View on GitHub
chrome & firefox extension to chat with webpages: local llms
☆130Dec 20, 2024Updated last year
Abinesh-Mathivanan / beens-minimax
View on GitHub
world's stupidest moe llm in 103M parameters
☆20Jul 18, 2025Updated last year
YuvrajSingh-mist / NeatRL
View on GitHub
Repository of implementations of classic and sota rl algorithms from scratch in PyTorch
☆225Jun 30, 2026Updated 3 weeks ago
5afe / safe-ai-agent-tutorial
View on GitHub
☆29Feb 13, 2025Updated last year
tlatkowski / neural-recommender
View on GitHub
Neural recommender system implementation in TensorFlow.
☆15Mar 24, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Paulescu / plot-generator-agent
View on GitHub
Join 15k builders to the Real-World ML Newsletter ⬇️⬇️⬇️
☆47May 10, 2024Updated 2 years ago
Infatoshi / rl-handbook
View on GitHub
Code companion for the RL Post-Training Handbook - training reasoning models on a single GPU
☆19Jan 30, 2026Updated 5 months ago
ArturTanona / grpo_unsloth_docker
View on GitHub
☆56Feb 10, 2025Updated last year
litagin02 / anime_speaker_embedding
View on GitHub
Speaker embedding for anime speech domain based on ECAPA_TDNN
☆21Jun 22, 2025Updated last year
Ancastal / AI-Recruitment-Agent
View on GitHub
A multi-agent recruitment assistant that leverages Microsoft AutoGen framework to streamline hiring processes. The system employs special…
☆46Feb 12, 2025Updated last year
adamjen / Prompt_Maker
View on GitHub
Makes a improved prompts from a basic prompt
☆47Feb 5, 2026Updated 5 months ago
jukofyork / transplant-vocab
View on GitHub
Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.
☆54Oct 29, 2025Updated 8 months ago