bbycroft / llm-vizLinks
3D Visualization of an GPT-style LLM
☆5,222Updated last year
Alternatives and similar repositories for llm-viz
Users that are interested in llm-viz are comparing it to the libraries listed below
Sorting:
- Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization☆6,501Updated 2 weeks ago
- llama3 implementation one matrix multiplication at a time☆15,240Updated last year
- ☆4,113Updated last year
- High-speed Large Language Model Serving for Local Deployment☆8,611Updated last week
- Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).☆7,149Updated 3 months ago
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆9,728Updated 8 months ago
- 《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣☆4,235Updated 5 months ago
- Train transformer language models with reinforcement learning.☆17,206Updated this week
- PyTorch native post-training library☆5,654Updated last week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,880Updated last year
- Building a quick conversation-based search demo with Lepton AI.☆8,121Updated 2 months ago
- OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, …☆6,615Updated last week
- Implement a ChatGPT-like LLM in PyTorch from scratch, step by step☆84,086Updated this week
- A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)☆10,142Updated last year
- Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization☆1,389Updated last year
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,180Updated 5 months ago
- Modeling, training, eval, and inference code for OLMo☆6,299Updated 2 months ago
- ☆5,719Updated last year
- A Next-Generation Training Engine Built for Ultra-Large MoE Models☆5,067Updated this week
- Video+code lecture on building nanoGPT from scratch☆4,707Updated last year
- A unified evaluation framework for large language models☆2,773Updated last week
- Large World Model -- Modeling Text and Video with Millions Context☆7,391Updated last year
- Retrieval and Retrieval-augmented LLMs☆11,222Updated last month
- Question and Answer based on Anything.☆13,848Updated 10 months ago
- Llama3、Llama3.1 中文后训练 版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。☆4,157Updated 3 weeks ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,287Updated last year
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆13,126Updated last week
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆20,543Updated this week
- AIOS: AI Agent Operating System☆4,988Updated last week
- 仅需Python 基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理☆3,930Updated last year