poloclub / transformer-explainerLinks
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
☆6,621Updated 3 weeks ago
Alternatives and similar repositories for transformer-explainer
Users that are interested in transformer-explainer are comparing it to the libraries listed below
Sorting:
- llama3 implementation one matrix multiplication at a time☆15,241Updated last year
- 3D Visualization of an GPT-style LLM☆5,234Updated last year
- A course on aligning smol models.☆6,579Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,137Updated this week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆67,023Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆24,344Updated 4 months ago
- ☆5,725Updated last year
- Open-source AI cookbook☆2,586Updated 3 weeks ago
- 🪄 Create rich visualizations with AI☆14,801Updated last week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆51,625Updated this week
- PyTorch native post-training library☆5,669Updated this week
- About Awesome things towards foundation agents. Papers / Repos / Blogs / ...☆1,955Updated 6 months ago
- Everything about the SmolLM and SmolVLM family of models☆3,602Updated 3 weeks ago
- Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!☆8,852Updated this week
- Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning☆4,040Updated 2 months ago
- This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov☆2,087Updated last month
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆4,625Updated 3 months ago
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆74,834Updated this week
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆20,239Updated last month
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆7,563Updated 2 months ago
- This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi…☆3,670Updated this week
- Machine Learning Journal for Intermediate to Advanced Topics.☆2,270Updated 5 months ago
- Understanding Deep Learning - Simon J.D. Prince☆9,051Updated 3 weeks ago
- Curated list of datasets and tools for post-training.☆4,229Updated 3 months ago
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆13,234Updated last week
- 🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽�…☆4,282Updated 9 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,080Updated last year
- Vision agent☆5,217Updated last week
- The Hugging Face course on Transformers☆3,683Updated 3 weeks ago
- Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.☆17,068Updated last year