poloclub / transformer-explainerLinks
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
☆5,371Updated last week
Alternatives and similar repositories for transformer-explainer
Users that are interested in transformer-explainer are comparing it to the libraries listed below
Sorting:
- Video+code lecture on building nanoGPT from scratch☆4,331Updated last year
- llama3 implementation one matrix multiplication at a time☆15,132Updated last year
- ☆5,296Updated 7 months ago
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆68,653Updated last week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆6,894Updated last month
- SGLang is a fast serving framework for large language models and vision language models.☆17,656Updated this week
- Simple, unified interface to multiple Generative AI providers☆12,345Updated last week
- Machine Learning Journal for Intermediate to Advanced Topics.☆2,166Updated 7 months ago
- ☆5,495Updated last year
- This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov☆1,899Updated 3 months ago
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆11,286Updated last month
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,607Updated 3 weeks ago
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆61,471Updated 3 months ago
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆3,993Updated this week
- Modeling, training, eval, and inference code for OLMo☆5,943Updated last week
- 3D Visualization of an GPT-style LLM☆4,977Updated last year
- LLM101n: Let's build a Storyteller☆34,328Updated last year
- s1: Simple test-time scaling☆6,541Updated 2 months ago
- A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API☆9,777Updated 2 months ago
- Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!☆8,450Updated this week
- ☆4,089Updated last year
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆27,863Updated this week
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,702Updated last month
- The Open Cookbook for Top-Tier Code Large Language Model☆1,820Updated 9 months ago
- ⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with …☆2,560Updated last week
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,908Updated last year
- Task-Aware Agent-driven Prompt Optimization Framework☆3,571Updated last month
- Official inference framework for 1-bit LLMs☆21,185Updated 3 months ago
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,564Updated 8 months ago
- The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.☆4,106Updated this week