bbycroft / llm-vizLinks
3D Visualization of an GPT-style LLM
☆4,769Updated 10 months ago
Alternatives and similar repositories for llm-viz
Users that are interested in llm-viz are comparing it to the libraries listed below
Sorting:
- Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization☆4,801Updated last month
- llama3 implementation one matrix multiplication at a time☆15,040Updated last year
- ☆4,083Updated last year
- Building a quick conversation-based search demo with Lepton AI.☆8,127Updated 3 weeks ago
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,436Updated last week
- An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)☆4,636Updated this week
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,731Updated last year
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆4,920Updated 9 months ago
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,011Updated 3 months ago
- ☆1,505Updated 2 weeks ago
- Modeling, training, eval, and inference code for OLMo☆5,757Updated this week
- CodeGeeX4-ALL-9B, a versatile model for all AI software development scenarios, including code completion, code interpreter, web search, f…☆2,022Updated 10 months ago
- PyTorch native post-training library☆5,306Updated this week
- ☆5,444Updated 11 months ago
- Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).☆6,973Updated 5 months ago
- The official PyTorch implementation of Google's Gemma models☆5,496Updated last month
- Retrieval and Retrieval-augmented LLMs☆10,091Updated last month
- Gemma open-weight LLM library, from Google DeepMind☆3,494Updated this week
- official repository of aiXcoder-7B Code Large Language Model☆2,268Updated 5 months ago
- The n-gram Language Model☆1,432Updated 11 months ago
- Home of StarCoder2!☆1,932Updated last year
- Llama3、Llama3.1 中文后训练版仓库 - 微调、魔改版本有趣权重 & 训练、推理、评测、部署教程视频 & 文档。☆4,150Updated 2 months ago
- Question and Answer based on Anything.☆13,370Updated 3 months ago
- The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling st…☆2,125Updated 8 months ago
- A series of large language models trained from scratch by developers @01-ai☆7,831Updated 7 months ago
- SWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?☆3,147Updated last week
- DataComp for Language Models☆1,322Updated 3 months ago
- 《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣☆3,754Updated 3 months ago
- A Pythonic framework to simplify AI service building☆2,767Updated this week
- LLM training in simple, raw C/CUDA☆27,075Updated 2 weeks ago