rasbt / LLMs-from-scratchLinks
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
☆51,173Updated this week
Alternatives and similar repositories for LLMs-from-scratch
Users that are interested in LLMs-from-scratch are comparing it to the libraries listed below
Sorting:
- Awesome-LLM: a curated list of Large Language Model☆23,819Updated last month
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆52,442Updated this week
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆55,680Updated 2 weeks ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,293Updated this week
- llama3 implementation one matrix multiplication at a time☆15,001Updated last year
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆17,490Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆49,721Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆38,731Updated 2 weeks ago
- LLM101n: Let's build a Storyteller☆33,643Updated 10 months ago
- DSPy: The framework for programming—not prompting—language models☆25,466Updated this week
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆10,093Updated 2 weeks ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆41,856Updated 6 months ago
- 仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理☆3,176Updated 10 months ago
- Inference code for Llama models☆58,399Updated 4 months ago
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆38,620Updated this week
- Train transformer language models with reinforcement learning.☆14,193Updated this week
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆18,774Updated last week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆25,911Updated this week
- Making large AI models cheaper, faster and more accessible☆40,966Updated last week
- Fully open reproduction of DeepSeek-R1☆24,819Updated 2 weeks ago
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆30,040Updated 11 months ago
- Latest Advances on Multimodal Large Language Models☆15,578Updated this week
- LLM inference in C/C++☆81,984Updated this week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆22,795Updated 10 months ago
- LLM training in simple, raw C/CUDA☆26,906Updated last month
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆40,227Updated this week
- LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.☆21,953Updated this week
- Simple, unified interface to multiple Generative AI providers☆12,161Updated last month
- The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.☆18,509Updated this week
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,500Updated last year