rasbt / LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
☆37,992Updated this week
Alternatives and similar repositories for LLMs-from-scratch:
Users that are interested in LLMs-from-scratch are comparing it to the libraries listed below
- llama3 implementation one matrix multiplication at a time☆14,030Updated 7 months ago
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆15,910Updated this week
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆41,661Updated this week
- Finetune Llama 3.3, Mistral, Phi-4, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory☆20,611Updated this week
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆9,326Updated 6 months ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆11,197Updated this week
- Open-Sora: Democratizing Efficient Video Production for All☆23,110Updated 3 weeks ago
- LLM training in simple, raw C/CUDA☆25,047Updated 3 months ago
- SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensiv…☆14,188Updated this week
- DSPy: The framework for programming—not prompting—language models☆21,018Updated this week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆21,096Updated 5 months ago
- The Memory layer for your AI apps☆23,953Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆21,693Updated this week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆38,486Updated last month
- LLM101n: Let's build a Storyteller☆31,021Updated 5 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆33,809Updated this week
- The official Meta Llama 3 GitHub site☆27,957Updated 5 months ago
- Machine Learning Engineering Open Book☆12,353Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆21,705Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆15,474Updated this week
- LLM based autonomous agent that conducts local and web research on any topic and generates a comprehensive report with citations.☆15,736Updated this week
- A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/aut…☆37,558Updated this week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆38,057Updated this week
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆18,680Updated this week
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆17,712Updated 3 months ago
- This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.☆11,835Updated 2 weeks ago
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆24,745Updated this week
- 仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理☆2,052Updated 5 months ago
- Build multi-modal Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.☆17,869Updated this week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆38,227Updated this week