Om-Alve / smolGPTLinks

☆1,416

Alternatives and similar repositories for smolGPT

Users that are interested in smolGPT are comparing it to the libraries listed below

Sorting:

therealoliver / Deepdive-llama3-from-scratch
Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
☆597Updated 4 months ago
Exorust / TorchLeet
Leetcode for Pytorch
☆1,333Updated this week
dleemiller / WordLlama
Things you can do with the token embeddings of an LLM
☆1,445Updated 3 months ago
PaulPauls / llama3_interpretability_sae
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…
☆620Updated 3 months ago
yousef-rafat / miniDiffusion
A reimplementation of Stable Diffusion 3.5 in pure PyTorch
☆639Updated last month
Pravko-Solutions / FlashLearn
Integrate LLM in any pipeline - fit/predict pattern, JSON driven flows, and built in concurency support.
☆601Updated 4 months ago
huggingface / smollm
Everything about the SmolLM and SmolVLM family of models
☆2,909Updated last week
tanishqkumar / beyond-nanogpt
Minimal and annotated implementations of key ideas from modern deep learning research.
☆1,069Updated 2 weeks ago
natolambert / rlhf-book
Textbook on reinforcement learning from human feedback
☆1,097Updated last week
vlm-run / vlmrun-hub
A hub for various industry-specific schemas to be used with VLMs.
☆525Updated last month
policy-gradient / GRPO-Zero
Implementing DeepSeek R1's GRPO algorithm from scratch
☆1,479Updated 3 months ago
dhealy05 / frames_of_mind
Animating R1's thoughts.
☆383Updated 5 months ago
alessiodm / drl-zh
Deep Reinforcement Learning: Zero to Hero!
☆2,118Updated 11 months ago
felafax / felafax
Felafax is building AI infra for non-NVIDIA GPUs
☆566Updated 5 months ago
vlm-run / vlmrun-cookbook
Examples and guides for using the VLM Run API
☆283Updated last week
ash80 / RLHF_in_notebooks
RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks
☆166Updated last month
klara-research / klarity
See Through Your Models
☆398Updated last week
mlecauchois / micrograd-cuda
☆248Updated last year
M4THYOU / TokenDagger
High-Performance Implementation of OpenAI's TikToken.
☆435Updated 2 weeks ago
OpenCoder-llm / OpenCoder-llm
The Open Cookbook for Top-Tier Code Large Language Model
☆1,765Updated 7 months ago
stanford-mast / blast
Browser-LLM Auto-Scaling Technology
☆531Updated this week
getcellm / cellm
Use LLMs in Excel formulas
☆840Updated this week
MarioSieg / magnetron
(WIP) A small but powerful, homemade PyTorch from scratch.
☆555Updated last week
labmlai / inspectus
LLM Analytics
☆673Updated 9 months ago
transformerlab / transformerlab-app
Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your …
☆3,630Updated this week
varungodbole / prompt-tuning-playbook
A playbook for effectively prompting post-trained LLMs
☆885Updated 6 months ago
likejazz / llama3.np
llama3.np is a pure NumPy implementation for Llama 3 model.
☆987Updated 2 months ago
mirth / chonky
Fully neural approach for text chunking
☆367Updated 2 months ago
neural-maze / agentic-patterns-course
Implementing the 4 agentic patterns from scratch
☆1,427Updated 4 months ago
PsyChip / machina
OpenCV+YOLO+LLAVA powered video surveillance system
☆763Updated last week