bbycroft / llm-vizLinks
3D Visualization of an GPT-style LLM
☆4,852Updated 11 months ago
Alternatives and similar repositories for llm-viz
Users that are interested in llm-viz are comparing it to the libraries listed below
Sorting:
- llama3 implementation one matrix multiplication at a time☆15,053Updated last year
- Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization☆4,983Updated last month
- ☆4,087Updated last year
- ☆5,125Updated 6 months ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,786Updated last year
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,036Updated 3 months ago
- The official PyTorch implementation of Google's Gemma models☆5,518Updated 2 months ago
- Machine Learning Engineering Open Book☆14,527Updated last week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,667Updated last year
- Video+code lecture on building nanoGPT from scratch☆4,241Updated 11 months ago
- LLM training in simple, raw C/CUDA☆27,249Updated last month
- PyTorch native post-training library☆5,366Updated this week
- Modeling, training, eval, and inference code for OLMo☆5,822Updated last week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,564Updated last week
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆8,654Updated 2 weeks ago
- Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization☆1,335Updated 7 months ago
- Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。☆4,157Updated 2 months ago
- ☆5,466Updated 11 months ago
- A series of large language models trained from scratch by developers @01-ai☆7,834Updated 8 months ago
- Inference Llama 2 in one file of pure C☆18,597Updated 11 months ago
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆18,840Updated last week
- Examples in the MLX framework☆7,675Updated last month
- The n-gram Language Model☆1,437Updated 11 months ago
- Open-Sora: Democratizing Efficient Video Production for All☆26,952Updated 3 months ago
- DataComp for Language Models☆1,342Updated 4 months ago
- 🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆4,234Updated 3 months ago
- 从零实现一个 llama3 中文版☆921Updated last year
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆6,949Updated last year
- Generative Agents: Interactive Simulacra of Human Behavior☆19,380Updated 11 months ago
- ☆3,886Updated 3 weeks ago