poloclub / transformer-explainer
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
☆3,954Updated last week
Alternatives and similar repositories for transformer-explainer:
Users that are interested in transformer-explainer are comparing it to the libraries listed below
- 3D Visualization of an GPT-style LLM☆4,430Updated 5 months ago
- llama3 implementation one matrix multiplication at a time☆14,139Updated 8 months ago
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆5,042Updated this week
- ☆5,154Updated 6 months ago
- SGLang is a fast serving framework for large language models and vision language models.☆10,325Updated this week
- KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning a…☆5,287Updated this week
- Simple, unified interface to multiple Generative AI providers☆10,964Updated this week
- Composable building blocks to build Llama Apps☆7,256Updated this week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,506Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆22,523Updated this week
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.☆5,852Updated 3 weeks ago
- verl: Volcano Engine Reinforcement Learning for LLMs☆3,387Updated this week
- A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.☆6,466Updated this week
- Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.☆15,511Updated this week
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,030Updated last month
- Vision agent☆2,873Updated this week
- Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆7,531Updated this week
- ☆3,268Updated 3 weeks ago
- PyTorch native post-training library☆4,856Updated this week
- Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.☆2,843Updated this week
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆6,185Updated 3 months ago
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆4,762Updated 4 months ago
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other ent…☆2,533Updated this week
- 🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽�…☆3,598Updated last month
- ☆2,176Updated last week
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,434Updated 3 weeks ago
- Task-Aware Agent-driven Prompt Optimization Framework☆2,817Updated last month