bbycroft / llm-vizLinks

3D Visualization of an GPT-style LLM

☆4,852

Alternatives and similar repositories for llm-viz

Users that are interested in llm-viz are comparing it to the libraries listed below

Sorting:

naklecha / llama3-from-scratch
llama3 implementation one matrix multiplication at a time
☆15,053Updated last year
poloclub / transformer-explainer
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
☆4,983Updated last month
openai / transformer-debugger
☆4,087Updated last year
ImagineAILab / ai-by-hand-excel
☆5,125Updated 6 months ago
karpathy / minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
☆9,786Updated last year
pytorch-labs / gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
☆6,036Updated 3 months ago
google / gemma_pytorch
The official PyTorch implementation of Google's Gemma models
☆5,518Updated 2 months ago
stas00 / ml-engineering
Machine Learning Engineering Open Book
☆14,527Updated last week
jzhang38 / TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
☆8,667Updated last year
karpathy / build-nanogpt
Video+code lecture on building nanoGPT from scratch
☆4,241Updated 11 months ago
karpathy / llm.c
LLM training in simple, raw C/CUDA
☆27,249Updated last month
pytorch / torchtune
PyTorch native post-training library
☆5,366Updated this week
allenai / OLMo
Modeling, training, eval, and inference code for OLMo
☆5,822Updated last week
Lightning-AI / litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
☆12,564Updated last week
OpenGVLab / InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
☆8,654Updated 2 weeks ago
RahulSChand / gpu_poor
Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
☆1,335Updated 7 months ago
CrazyBoyM / llama3-Chinese-chat
Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。
☆4,157Updated 2 months ago
andrewyng / translation-agent
☆5,466Updated 11 months ago
01-ai / Yi
A series of large language models trained from scratch by developers @01-ai
☆7,834Updated 8 months ago
karpathy / llama2.c
Inference Llama 2 in one file of pure C
☆18,597Updated 11 months ago
QwenLM / Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
☆18,840Updated last week
ml-explore / mlx-examples
Examples in the MLX framework
☆7,675Updated last month
EurekaLabsAI / ngram
The n-gram Language Model
☆1,437Updated 11 months ago
hpcaitech / Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
☆26,952Updated 3 months ago
mlfoundations / dclm
DataComp for Language Models
☆1,342Updated 4 months ago
jingyaogong / minimind-v
🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM！🌏 Train a 26M-parameter VLM from scratch in just 1 hours!
☆4,234Updated 3 months ago
wdndev / llama3-from-scratch-zh
从零实现一个 llama3 中文版
☆921Updated last year
mit-han-lab / streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
☆6,949Updated last year
joonspk-research / generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
☆19,380Updated 11 months ago
openai / simple-evals
☆3,886Updated 3 weeks ago