poloclub / transformer-explainer
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
โ3,791Updated 3 weeks ago
Alternatives and similar repositories for transformer-explainer:
Users that are interested in transformer-explainer are comparing it to the libraries listed below
- The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery ๐งโ๐ฌโ8,660Updated this week
- ๐ An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)โ5,719Updated last week
- 3D Visualization of an GPT-style LLMโ4,276Updated 4 months ago
- llama3 implementation one matrix multiplication at a timeโ14,030Updated 7 months ago
- g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chainsโ4,147Updated last month
- Simple, unified interface to multiple Generative AI providersโ9,752Updated last week
- โ5,042Updated 5 months ago
- Composable building blocks to build Llama Appsโ6,036Updated this week
- Run PyTorch LLMs locally on servers, desktop and mobileโ3,462Updated this week
- A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 ๐ and reasoning techniques.โ6,233Updated this week
- Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.โ3,878Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.โ11,197Updated this week
- SGLang is a fast serving framework for large language models and vision language models.โ7,353Updated this week
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"โ3,639Updated last week
- โ4,050Updated 7 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Modelโ6,576Updated this week
- Finetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memoryโ20,611Updated this week
- ๐ค MLE-Agent: Your intelligent companion for seamless AI engineering and research. ๐ Integrate with arxiv and paper with code to provideโฆโ1,192Updated last week
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.โ4,399Updated this week
- Official inference framework for 1-bit LLMsโ12,615Updated 3 weeks ago
- PDF to Markdown with vision modelsโ8,298Updated last month
- Agentic components of the Llama Stack APIsโ4,072Updated this week
- A language model programming library.โ5,556Updated 3 weeks ago
- ๐๐ ใๅคงๆจกๅใ3ๅฐๆถๅฎๅ จไป0่ฎญ็ป26M็ๅฐๅๆฐGPT๏ผ๐ Train a 26M-parameter GPT from scratch in just 3 hours!โ4,984Updated last month
- Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.โ5,320Updated this week
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. ๆฅ่ฟGPT-4o่กจ็ฐ็ๅผๆบๅคๆจกๆๅฏน่ฏๆจกๅโ6,793Updated 3 weeks ago
- ๐ค smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.โ5,197Updated this week
- A PyTorch native library for large model trainingโ3,091Updated this week
- โ7,156Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We alsโฆโ15,910Updated this week