bbycroft / llm-vizLinks
3D Visualization of an GPT-style LLM
☆5,162Updated last year
Alternatives and similar repositories for llm-viz
Users that are interested in llm-viz are comparing it to the libraries listed below
Sorting:
- Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization☆6,196Updated last month
- llama3 implementation one matrix multiplication at a time☆15,199Updated last year
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,217Updated last year
- ☆4,109Updated last year
- ☆5,609Updated last year
- Building a quick conversation-based search demo with Lepton AI.☆8,127Updated 2 weeks ago
- Modeling, training, eval, and inference code for OLMo☆6,220Updated 3 weeks ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,034Updated this week
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,165Updated 3 months ago
- Large World Model -- Modeling Text and Video with Millions Context☆7,383Updated last year
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,031Updated 10 months ago
- A series of large language models trained from scratch by developers @01-ai☆7,846Updated last year
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆9,490Updated 7 months ago
- 《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣☆4,167Updated 3 months ago
- ☆1,560Updated last month
- 仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理☆3,800Updated last year
- Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。☆4,162Updated 7 months ago
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,624Updated 3 months ago
- PyTorch native post-training library☆5,619Updated this week
- Understanding Deep Learning - Simon J.D. Prince☆8,588Updated 2 weeks ago
- ☆5,623Updated 10 months ago
- High-speed Large Language Model Serving for Local Deployment☆8,460Updated 4 months ago
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,707Updated 5 months ago
- The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling st…☆2,356Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,835Updated last year
- Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization☆1,383Updated last year
- Gemma open-weight LLM library, from Google DeepMind☆3,879Updated last month
- Question and Answer based on Anything.☆13,792Updated 8 months ago
- Implementation of Nougat Neural Optical Understanding for Academic Documents☆9,753Updated 9 months ago
- DataComp for Language Models☆1,401Updated 3 months ago