poloclub / transformer-explainerLinks
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
☆4,726Updated 3 weeks ago
Alternatives and similar repositories for transformer-explainer
Users that are interested in transformer-explainer are comparing it to the libraries listed below
Sorting:
- verl: Volcano Engine Reinforcement Learning for LLMs☆10,204Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆26,164Updated last week
- Task-Aware Agent-driven Prompt Optimization Framework☆3,358Updated 3 weeks ago
- KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning a…☆7,334Updated this week
- A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.☆6,757Updated 2 weeks ago
- Toolkit for linearizing PDFs for LLM datasets/training☆13,063Updated this week
- The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬☆11,193Updated 2 months ago
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆6,358Updated 2 weeks ago
- 3D Visualization of an GPT-style LLM☆4,755Updated 10 months ago
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆9,788Updated 2 weeks ago
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.☆41,413Updated this week
- s1: Simple test-time scaling☆6,468Updated last week
- SGLang is a fast serving framework for large language models and vision language models.☆15,567Updated this week
- Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆11,207Updated last month
- Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)☆4,521Updated 2 weeks ago
- Yet Another Document Translator☆4,444Updated this week
- ☆4,940Updated 5 months ago
- llama3 implementation one matrix multiplication at a time☆15,017Updated last year
- Everything about the SmolLM2 and SmolVLM family of models☆2,606Updated this week
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆7,982Updated 6 months ago
- Video+code lecture on building nanoGPT from scratch☆4,182Updated 10 months ago
- Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and pe…☆3,209Updated 2 weeks ago
- A simple, easy-to-hack GraphRAG implementation☆3,065Updated 2 months ago
- Democratizing Reinforcement Learning for LLMs☆3,411Updated last month
- [NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge a…☆2,511Updated last week
- Official Implementation of "KBLaM: Knowledge Base augmented Language Model"☆1,331Updated last week
- ☆6,557Updated last month
- Vision agent☆4,901Updated this week
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,122Updated 4 months ago
- Retrieval and Retrieval-augmented LLMs☆10,012Updated 3 weeks ago