poloclub / transformer-explainer
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
☆4,242Updated last week
Alternatives and similar repositories for transformer-explainer:
Users that are interested in transformer-explainer are comparing it to the libraries listed below
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,323Updated 3 months ago
- 3D Visualization of an GPT-style LLM☆4,630Updated 8 months ago
- llama3 implementation one matrix multiplication at a time☆14,891Updated 11 months ago
- An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)☆4,497Updated 2 weeks ago
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆7,431Updated 3 months ago
- Toolkit for linearizing PDFs for LLM datasets/training☆11,889Updated this week
- s1: Simple test-time scaling☆6,299Updated 3 weeks ago
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆6,483Updated 2 weeks ago
- In-depth tutorials on LLMs, RAGs and real-world AI agent applications.☆8,144Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆8,612Updated this week
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆2,540Updated 2 weeks ago
- KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning a…☆6,681Updated this week
- Deploy high-performance AI models and inference pipelines on FastAPI with built-in batching, streaming and more.☆3,064Updated this week
- ☆5,352Updated 8 months ago
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆6,595Updated this week
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen2.5, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) and 200+ ML…☆7,155Updated this week
- Finetune Llama 4, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥☆37,364Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆24,664Updated last week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,004Updated this week
- verl: Volcano Engine Reinforcement Learning for LLMs☆6,909Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆7,461Updated 2 months ago
- 🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆3,098Updated 2 weeks ago
- OCR & Document Extraction using vision models☆11,011Updated this week
- Train transformer language models with reinforcement learning.☆13,373Updated this week
- Awesome Reasoning LLM Tutorial/Survey/Guide☆1,436Updated 2 weeks ago
- LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning☆1,966Updated last week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆5,997Updated 2 months ago
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆6,613Updated last week
- open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming…☆3,286Updated 5 months ago
- Task-Aware Agent-driven Prompt Optimization Framework☆3,175Updated last month