poloclub / transformer-explainerLinks
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
☆5,177Updated 2 months ago
Alternatives and similar repositories for transformer-explainer
Users that are interested in transformer-explainer are comparing it to the libraries listed below
Sorting:
- 3D Visualization of an GPT-style LLM☆4,896Updated 11 months ago
- ☆5,176Updated 6 months ago
- Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆12,042Updated 3 months ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆27,307Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less…☆43,914Updated this week
- llama3 implementation one matrix multiplication at a time☆15,097Updated last year
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆10,976Updated 3 weeks ago
- Modeling, training, eval, and inference code for OLMo☆5,895Updated last week
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆3,877Updated this week
- 🪄 Create rich visualizations with AI☆12,916Updated last week
- Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)☆4,752Updated this week
- Task-Aware Agent-driven Prompt Optimization Framework☆3,478Updated 2 weeks ago
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,522Updated last month
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆13,833Updated 3 weeks ago
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆6,809Updated last month
- This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information r…☆20,103Updated last week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,604Updated last week
- 🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽�…☆4,083Updated 3 months ago
- Democratizing Reinforcement Learning for LLMs☆4,016Updated this week
- Minimal reproduction of DeepSeek R1-Zero☆12,121Updated 3 months ago
- Video+code lecture on building nanoGPT from scratch☆4,305Updated last year
- Vision agent☆5,007Updated last week
- Curated list of datasets and tools for post-training.☆3,365Updated 3 weeks ago
- s1: Simple test-time scaling☆6,533Updated last month
- DataComp for Language Models☆1,351Updated 2 weeks ago
- Open-source AI cookbook☆2,202Updated 2 weeks ago
- Knowledge Agents and Management in the Cloud☆4,104Updated this week
- Awesome Reasoning LLM Tutorial/Survey/Guide☆1,986Updated last month
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆24,091Updated last week
- ☆5,479Updated last year