poloclub / transformer-explainer
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
β3,371Updated last month
Related projects β
Alternatives and complementary repositories for transformer-explainer
- π An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)β5,185Updated 2 weeks ago
- Composable building blocks to build Llama Appsβ4,594Updated this week
- Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.β2,489Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Modelβ6,053Updated this week
- Parse files for optimal RAGβ3,173Updated last week
- A simple screen parsing tool towards pure vision based GUI agentβ4,768Updated 2 weeks ago
- Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"β3,211Updated 6 months ago
- Run PyTorch LLMs locally on servers, desktop and mobileβ3,383Updated this week
- Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.β3,153Updated last month
- llama3 implementation one matrix multiplication at a timeβ13,741Updated 5 months ago
- High-quality datasets, tools, and concepts for LLM fine-tuning.β2,010Updated 3 weeks ago
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/β6,985Updated this week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviateβ6,347Updated 2 weeks ago
- Ingest, parse, and optimize any data format β‘οΈ from documents to multimedia β‘οΈ for enhanced compatibility with GenAI frameworksβ5,648Updated 2 weeks ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-oβ3,540Updated 2 weeks ago
- GraphRAG using Local LLMs - Features robust API and multiple apps for Indexing/Prompt Tuning/Query/Chat/Visualizing/Etc. This is meant toβ¦β1,729Updated last week
- A modular graph-based Retrieval-Augmented Generation (RAG) systemβ19,247Updated this week
- β1,456Updated 3 weeks ago
- A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.β5,211Updated this week
- Neo4j graph construction from unstructured data using LLMsβ2,506Updated this week
- β4,035Updated 5 months ago
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Modelβ3,597Updated last month
- The easiest way to use Agentic RAG in any enterpriseβ3,866Updated this week
- Vision agentβ1,308Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.β10,734Updated last week
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. ζ₯θΏGPT-4o葨η°ηεΌζΊε€ζ¨‘ζε―Ήθ―樑εβ6,055Updated this week
- Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.β2,710Updated this week
- A simple, easy-to-hack GraphRAG implementationβ1,534Updated this week
- An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)β3,977Updated last week
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"β2,281Updated last month