poloclub / transformer-explainer
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
β4,135Updated last week
Alternatives and similar repositories for transformer-explainer:
Users that are interested in transformer-explainer are comparing it to the libraries listed below
- 3D Visualization of an GPT-style LLMβ4,569Updated 7 months ago
- llama3 implementation one matrix multiplication at a timeβ14,675Updated 10 months ago
- π An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)β6,230Updated 2 months ago
- PyTorch native post-training libraryβ5,026Updated this week
- SGLang is a fast serving framework for large language models and vision language models.β12,427Updated this week
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.β6,316Updated last week
- A modular graph-based Retrieval-Augmented Generation (RAG) systemβ23,891Updated this week
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"β5,813Updated last month
- Retrieval and Retrieval-augmented LLMsβ9,126Updated last week
- An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)β4,425Updated this week
- Supercharge Your LLM Application Evaluations πβ8,563Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.β11,878Updated this week
- KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning aβ¦β6,134Updated this week
- Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.β9,282Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Modelβ7,314Updated last month
- Neo4j graph construction from unstructured data using LLMsβ3,207Updated this week
- β5,307Updated 7 months ago
- Knowledge Agents and Management in the Cloudβ3,814Updated this week
- Convert PDF to markdown + JSON quickly with high accuracyβ23,393Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/β8,359Updated this week
- Composable building blocks to build Llama Appsβ7,577Updated this week
- An open-source RAG-based tool for chatting with your documents.β21,795Updated last month
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.β9,525Updated 8 months ago
- β‘FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)β2,019Updated last week
- β4,070Updated 9 months ago
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.β5,935Updated this week
- A language model programming library.β5,703Updated last month
- Run PyTorch LLMs locally on servers, desktop and mobileβ3,542Updated last week
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Modelβ4,850Updated 6 months ago
- The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery π§βπ¬β10,370Updated this week