poloclub / transformer-explainerLinks
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
☆6,000Updated 2 weeks ago
Alternatives and similar repositories for transformer-explainer
Users that are interested in transformer-explainer are comparing it to the libraries listed below
Sorting:
- ☆5,514Updated 10 months ago
- llama3 implementation one matrix multiplication at a time☆15,189Updated last year
- Video+code lecture on building nanoGPT from scratch☆4,549Updated last year
- 3D Visualization of an GPT-style LLM☆5,128Updated last year
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,694Updated 4 months ago
- A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.☆6,849Updated last month
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆29,384Updated this week
- DataComp for Language Models☆1,394Updated 2 months ago
- Everything about the SmolLM and SmolVLM family of models☆3,423Updated last week
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆17,990Updated 4 months ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,234Updated 9 months ago
- OCR & Document Extraction using vision models☆11,968Updated 6 months ago
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.☆48,542Updated last week
- Vision agent☆5,126Updated 2 weeks ago
- Understanding Deep Learning - Simon J.D. Prince☆8,510Updated last week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,617Updated 2 months ago
- Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation☆7,930Updated 6 months ago
- ⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)☆3,165Updated last week
- Sky-T1: Train your own O1 preview model within $450☆3,356Updated 4 months ago
- NanoGPT (124M) in 3 minutes☆3,878Updated last week
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆6,737Updated 5 months ago
- Convert PDF to markdown + JSON quickly with high accuracy☆30,047Updated last week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,022Updated 9 months ago
- A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…☆16,939Updated last week
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆8,949Updated 10 months ago
- This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov☆1,987Updated 6 months ago
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆4,294Updated last month
- This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi…☆3,608Updated last week
- Open-source AI cookbook☆2,510Updated 3 weeks ago
- Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and se…☆4,157Updated this week