Om-Alve / smolGPT
☆1,353Updated 3 months ago
Alternatives and similar repositories for smolGPT
Users that are interested in smolGPT are comparing it to the libraries listed below
Sorting:
- Things you can do with the token embeddings of an LLM☆1,441Updated last month
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.☆573Updated 2 months ago
- Everything about the SmolLM2 and SmolVLM family of models☆2,361Updated last month
- Minimal LLM inference in Rust☆983Updated 6 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,328Updated 3 weeks ago
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆610Updated last month
- A hub for various industry-specific schemas to be used with VLMs.☆506Updated 2 weeks ago
- Run and explore Llama models locally with minimal dependencies on CPU☆189Updated 7 months ago
- Animating R1's thoughts.☆380Updated 3 months ago
- Felafax is building AI infra for non-NVIDIA GPUs☆560Updated 3 months ago
- NanoGPT (124M) in 3 minutes☆2,546Updated 3 weeks ago
- Stateful load balancer custom-tailored for llama.cpp 🏓🦙☆755Updated 2 weeks ago
- (WIP) A small but powerful, homemade PyTorch from scratch.☆550Updated last week
- A command-line interface for LLMs written in Bash.☆437Updated 2 months ago
- See Through Your Models☆390Updated 2 months ago
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆2,436Updated this week
- A modern model graph visualizer and debugger☆1,179Updated this week
- A fast Rust based tool to serialize text-based files in a repository or directory for LLM consumption☆2,059Updated this week
- (🚧 WIP) a course of LLM inference serving on Apple Silicon for systems engineers.☆1,871Updated last week
- Integrate LLM in any pipeline - fit/predict pattern, JSON driven flows, and built in concurency support.☆591Updated 2 months ago
- The Open Cookbook for Top-Tier Code Large Language Model☆1,693Updated 5 months ago
- llama3.np is a pure NumPy implementation for Llama 3 model.☆981Updated 3 weeks ago
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆915Updated 3 months ago
- LLM Analytics☆659Updated 6 months ago
- Official repository for our work on micro-budget training of large-scale diffusion models.☆1,404Updated 4 months ago
- Minimal and annotated implementations of key ideas from modern deep learning research.☆524Updated last week
- A course on aligning smol models.☆5,822Updated 3 months ago
- Implementing the 4 agentic patterns from scratch☆1,295Updated last month
- Use LLMs in Excel formulas☆810Updated this week
- A playbook for effectively prompting post-trained LLMs☆866Updated 3 months ago