poloclub / transformer-explainerLinks
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
☆6,336Updated 3 weeks ago
Alternatives and similar repositories for transformer-explainer
Users that are interested in transformer-explainer are comparing it to the libraries listed below
Sorting:
- 3D Visualization of an GPT-style LLM☆5,183Updated last year
- llama3 implementation one matrix multiplication at a time☆15,219Updated last year
- ☆5,624Updated last year
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,721Updated 6 months ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆30,175Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,087Updated this week
- ☆5,679Updated 11 months ago
- The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬☆11,898Updated 3 weeks ago
- ☆4,110Updated last year
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆18,136Updated 2 months ago
- NanoGPT (124M) in 3 minutes☆4,085Updated this week
- 🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆5,856Updated last week
- Convert PDF to markdown + JSON quickly with high accuracy☆30,780Updated this week
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,254Updated last year
- Everything about the SmolLM and SmolVLM family of models☆3,539Updated last month
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆50,236Updated last week
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆12,863Updated 3 months ago
- ⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with …☆3,365Updated last week
- Easily build AI systems with Evals, RAG, Agents, fine-tuning, synthetic data, and more.☆4,518Updated last week
- 🧑🚀 全世界最好的LLM资料总结(多模态生成、Agent、辅助编程、AI审稿、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.☆7,178Updated last week
- Simple, unified interface to multiple Generative AI providers☆13,315Updated 3 weeks ago
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆39,585Updated this week
- This package contains the original 2012 AlexNet code.☆2,809Updated 9 months ago
- Train transformer language models with reinforcement learning.☆16,844Updated last week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,047Updated 10 months ago
- Retrieval and Retrieval-augmented LLMs☆11,082Updated 3 weeks ago
- A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.☆3,766Updated this week
- A Next-Generation Training Engine Built for Ultra-Large MoE Models☆5,040Updated last week
- ☆4,262Updated 5 months ago
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆9,668Updated 3 months ago