poloclub / transformer-explainerLinks
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
☆6,196Updated last month
Alternatives and similar repositories for transformer-explainer
Users that are interested in transformer-explainer are comparing it to the libraries listed below
Sorting:
- 3D Visualization of an GPT-style LLM☆5,162Updated last year
- llama3 implementation one matrix multiplication at a time☆15,199Updated last year
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,034Updated this week
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆5,624Updated 4 months ago
- 🪄 Create rich visualizations with AI☆14,504Updated last week
- Task-Aware Agent-driven Prompt Optimization Framework☆3,716Updated 2 months ago
- The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices☆4,479Updated 9 months ago
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,624Updated 3 months ago
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.☆49,366Updated last week
- Simple, unified interface to multiple Generative AI providers☆13,038Updated this week
- Video+code lecture on building nanoGPT from scratch☆4,607Updated last year
- NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extra…☆2,783Updated last week
- Machine Learning Journal for Intermediate to Advanced Topics.☆2,248Updated 3 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆18,978Updated last month
- Build custom inference engines for models, agents, multi-modal systems, RAG, pipelines and more.☆3,737Updated this week
- ☆5,623Updated 10 months ago
- 100+ Fine-tuning Tutorial Notebooks on Google Colab, Kaggle and more.☆3,894Updated this week
- Understanding Deep Learning - Simon J.D. Prince☆8,588Updated 2 weeks ago
- Open-source AI cookbook☆2,545Updated last month
- OCR & Document Extraction using vision models☆11,992Updated 7 months ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆29,833Updated this week
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆4,380Updated last month
- 《大模型白盒子构建指南》:一个全手搓的Tiny-Universe☆4,138Updated 2 weeks ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆65,334Updated this week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆12,694Updated 2 months ago
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆9,490Updated 7 months ago
- A course on aligning smol models.☆6,541Updated last month
- Minimal reproduction of DeepSeek R1-Zero☆12,486Updated 7 months ago
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆18,509Updated 4 months ago
- This is the official repository for The Hundred-Page Language Models Book by Andriy Burkov☆2,045Updated 2 weeks ago