poloclub / transformer-explainerLinks
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
☆6,501Updated 2 weeks ago
Alternatives and similar repositories for transformer-explainer
Users that are interested in transformer-explainer are comparing it to the libraries listed below
Sorting:
- 3D Visualization of an GPT-style LLM☆5,222Updated last year
- Everything about the SmolLM and SmolVLM family of models☆3,579Updated 2 weeks ago
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆4,589Updated 3 months ago
- Open-source AI cookbook☆2,583Updated 2 weeks ago
- ☆4,316Updated 6 months ago
- Machine Learning Journal for Intermediate to Advanced Topics.☆2,270Updated 4 months ago
- Simple, unified interface to multiple Generative AI providers☆13,394Updated last month
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆27,816Updated 4 months ago
- 100+ Fine-tuning Tutorial Notebooks on Google Colab, Kaggle and more.☆3,994Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,126Updated last week
- Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning☆4,012Updated 2 months ago
- s1: Simple test-time scaling☆6,635Updated 7 months ago
- A framework for few-shot evaluation of language models.☆11,298Updated this week
- Minimal reproduction of DeepSeek R1-Zero☆12,646Updated 9 months ago
- Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆18,066Updated this week
- Video+code lecture on building nanoGPT from scratch☆4,707Updated last year
- A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.☆3,740Updated last month
- SGLang is a high-performance serving framework for large language models and multimodal models.☆22,800Updated last week
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆20,062Updated last month
- llama3 implementation one matrix multiplication at a time☆15,240Updated last year
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆30,554Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆18,171Updated 2 months ago
- PyTorch native post-training library☆5,654Updated last week
- Democratizing Reinforcement Learning for LLMs☆5,060Updated this week
- A 4-hour coding workshop to understand how LLMs are implemented and used☆1,065Updated last year
- Awesome Reasoning LLM Tutorial/Survey/Guide☆2,280Updated 3 months ago
- verl: Volcano Engine Reinforcement Learning for LLMs☆18,756Updated this week
- ☆2,508Updated 3 weeks ago
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆13,053Updated last week
- NanoGPT (124M) in 2 minutes☆4,515Updated this week