rasbt / LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
☆39,992Updated this week
Alternatives and similar repositories for LLMs-from-scratch:
Users that are interested in LLMs-from-scratch are comparing it to the libraries listed below
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆46,564Updated 3 weeks ago
- LLM101n: Let's build a Storyteller☆31,796Updated 6 months ago
- llama3 implementation one matrix multiplication at a time☆14,139Updated 8 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆38,475Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆11,592Updated this week
- 3D Visualization of an GPT-style LLM☆4,430Updated 5 months ago
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆16,227Updated last week
- LLM training in simple, raw C/CUDA☆25,627Updated 4 months ago
- Machine Learning Engineering Open Book☆12,773Updated 2 weeks ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆22,523Updated this week
- Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥☆30,529Updated this week
- A one stop repository for generative AI research updates, interview resources, notebooks and much more!☆10,715Updated last week
- The Memory layer for AI Agents☆24,688Updated this week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆21,892Updated 3 weeks ago
- DSPy: The framework for programming—not prompting—language models☆21,930Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆7,856Updated this week
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆5,042Updated this week
- MLX: An array framework for Apple silicon☆19,152Updated this week
- Awesome-LLM: a curated list of Large Language Model☆21,493Updated 2 weeks ago
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆18,709Updated 4 months ago
- 21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/☆70,895Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆16,260Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆17,621Updated this week
- A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/aut…☆39,644Updated this week
- SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensiv…☆14,630Updated this week
- Agno is a lightweight library for building multi-modal Agents☆19,012Updated this week
- 🦜🔗 Build context-aware reasoning applications☆100,780Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆20,826Updated this week
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆21,365Updated 6 months ago