rasbt / LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
☆44,540Updated this week
Alternatives and similar repositories for LLMs-from-scratch:
Users that are interested in LLMs-from-scratch are comparing it to the libraries listed below
- Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.☆49,195Updated 2 months ago
- llama3 implementation one matrix multiplication at a time☆14,871Updated 10 months ago
- Finetune Llama 4, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥☆37,183Updated this week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆23,861Updated 2 months ago
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆20,671Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆11,981Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆24,532Updated this week
- Official code repo for the O'Reilly Book - "Hands-On Large Language Models"☆6,112Updated this week
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆46,735Updated this week
- Awesome-LLM: a curated list of Large Language Model☆22,784Updated this week
- 🦜🔗 Build context-aware reasoning applications☆105,662Updated this week
- A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/aut…☆43,139Updated this week
- DSPy: The framework for programming—not prompting—language models☆23,550Updated this week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆41,013Updated this week
- Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.☆137,188Updated this week
- LLM101n: Let's build a Storyteller☆33,171Updated 8 months ago
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆22,237Updated 8 months ago
- Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.☆15,965Updated this week
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆30,248Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆21,336Updated this week
- LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.☆20,923Updated this week
- A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)☆9,824Updated 10 months ago
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆89,708Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆45,116Updated this week
- The Memory layer for AI Agents☆27,605Updated this week
- A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.☆11,739Updated last month
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆19,626Updated last month
- ⏩ Create, share, and use custom AI code assistants with our open-source IDE extensions and hub of models, rules, prompts, docs, and other…☆25,665Updated this week
- 仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理☆2,666Updated 8 months ago
- A list of AI autonomous agents☆17,067Updated last month