bbycroft / llm-vizLinks
3D Visualization of an GPT-style LLM
☆5,128Updated last year
Alternatives and similar repositories for llm-viz
Users that are interested in llm-viz are comparing it to the libraries listed below
Sorting:
- ☆4,110Updated last year
- Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization☆6,000Updated 2 weeks ago
- llama3 implementation one matrix multiplication at a time☆15,189Updated last year
- ☆5,514Updated 10 months ago
- Understanding Deep Learning - Simon J.D. Prince☆8,510Updated last week
- Video+code lecture on building nanoGPT from scratch☆4,549Updated last year
- Modeling, training, eval, and inference code for OLMo☆6,168Updated last month
- RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…☆14,152Updated 2 weeks ago
- 《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣☆4,115Updated 2 months ago
- A series of large language models trained from scratch by developers @01-ai☆7,848Updated last year
- A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)☆10,109Updated last year
- 从零实现一个 llama3 中文版☆986Updated last year
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,183Updated last year
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,324Updated last year
- High-speed Large Language Model Serving for Local Deployment☆8,409Updated 3 months ago
- 仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理☆3,743Updated last year
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,975Updated this week
- The official GitHub page for the survey paper "A Survey of Large Language Models".☆11,975Updated 8 months ago
- Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。☆4,164Updated 6 months ago
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆6,929Updated 4 months ago
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,156Updated 3 months ago
- A Next-Generation Training Engine Built for Ultra-Large MoE Models☆4,996Updated last week
- PyTorch native post-training library☆5,604Updated this week
- The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling st…☆2,336Updated last year
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …☆6,328Updated last week
- Open-source AI cookbook☆2,510Updated 3 weeks ago
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆12,476Updated 2 months ago
- Open-Sora: Democratizing Efficient Video Production for All☆27,987Updated 7 months ago
- Building a quick conversation-based search demo with Lepton AI.☆8,133Updated 2 weeks ago
- Gemma open-weight LLM library, from Google DeepMind☆3,825Updated last week