poloclub / transformer-explainerLinks
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
☆4,983Updated last month
Alternatives and similar repositories for transformer-explainer
Users that are interested in transformer-explainer are comparing it to the libraries listed below
Sorting:
- 3D Visualization of an GPT-style LLM☆4,826Updated 11 months ago
- llama3 implementation one matrix multiplication at a time☆15,053Updated last year
- Simple, unified interface to multiple Generative AI providers☆12,304Updated last week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆26,742Updated last week
- The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬☆11,325Updated 3 months ago
- ☆5,111Updated 6 months ago
- PyTorch native post-training library☆5,361Updated last week
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,482Updated 3 weeks ago
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆10,511Updated this week
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆6,640Updated last month
- This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov☆1,859Updated 2 months ago
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.☆42,597Updated this week
- Modeling, training, eval, and inference code for OLMo☆5,822Updated this week
- Everything about the SmolLM and SmolVLM family of models☆2,951Updated last week
- The easiest way to deploy agents, MCP servers, models, RAG, pipelines and more. No MLOps. No YAML.☆3,398Updated this week
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆6,681Updated 2 weeks ago
- ⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with …☆2,279Updated this week
- Toolkit for linearizing PDFs for LLM datasets/training☆13,346Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,547Updated last week
- OCR & Document Extraction using vision models☆11,603Updated 2 months ago
- Gemma open-weight LLM library, from Google DeepMind☆3,550Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆16,386Updated this week
- Vision agent☆4,961Updated last week
- Task-Aware Agent-driven Prompt Optimization Framework☆3,434Updated 2 weeks ago
- Minimal reproduction of DeepSeek R1-Zero☆12,062Updated 3 months ago
- s1: Simple test-time scaling☆6,510Updated last month
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆7,739Updated 5 months ago
- Fully open reproduction of DeepSeek-R1☆25,138Updated this week
- ⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)☆2,555Updated last week
- A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.☆6,798Updated this week