poloclub / transformer-explainerLinks

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

☆4,983

Alternatives and similar repositories for transformer-explainer

Users that are interested in transformer-explainer are comparing it to the libraries listed below

Sorting:

bbycroft / llm-viz
3D Visualization of an GPT-style LLM
☆4,826Updated 11 months ago
naklecha / llama3-from-scratch
llama3 implementation one matrix multiplication at a time
☆15,053Updated last year
andrewyng / aisuite
Simple, unified interface to multiple Generative AI providers
☆12,304Updated last week
microsoft / graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
☆26,742Updated last week
SakanaAI / AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬
☆11,325Updated 3 months ago
ImagineAILab / ai-by-hand-excel
☆5,111Updated 6 months ago
pytorch / torchtune
PyTorch native post-training library
☆5,361Updated last week
InternLM / MindSearch
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
☆6,482Updated 3 weeks ago
QwenLM / Qwen-Agent
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
☆10,511Updated this week
adithya-s-k / omniparse
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
☆6,640Updated last month
aburkov / theLMbook
This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov
☆1,859Updated 2 months ago
unslothai / unsloth
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.
☆42,597Updated this week
allenai / OLMo
Modeling, training, eval, and inference code for OLMo
☆5,822Updated this week
huggingface / smollm
Everything about the SmolLM and SmolVLM family of models
☆2,951Updated last week
Lightning-AI / LitServe
The easiest way to deploy agents, MCP servers, models, RAG, pipelines and more. No MLOps. No YAML.
☆3,398Updated this week
zilliztech / deep-searcher
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
☆6,681Updated 2 weeks ago
SwanHubX / SwanLab
⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with …
☆2,279Updated this week
allenai / olmocr
Toolkit for linearizing PDFs for LLM datasets/training
☆13,346Updated this week
Lightning-AI / litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
☆12,547Updated last week
getomni-ai / zerox
OCR & Document Extraction using vision models
☆11,603Updated 2 months ago
google-deepmind / gemma
Gemma open-weight LLM library, from Google DeepMind
☆3,550Updated this week
sgl-project / sglang
SGLang is a fast serving framework for large language models and vision language models.
☆16,386Updated this week
landing-ai / vision-agent
Vision agent
☆4,961Updated last week
microsoft / PromptWizard
Task-Aware Agent-driven Prompt Optimization Framework
☆3,434Updated 2 weeks ago
Jiayi-Pan / TinyZero
Minimal reproduction of DeepSeek R1-Zero
☆12,062Updated 3 months ago
simplescaling / s1
s1: Simple test-time scaling
☆6,510Updated last month
Ucas-HaoranWei / GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
☆7,739Updated 5 months ago
huggingface / open-r1
Fully open reproduction of DeepSeek-R1
☆25,138Updated this week
RUC-NLPIR / FlashRAG
⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)
☆2,555Updated last week
hijkzzz / Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
☆6,798Updated this week