FareedKhan-dev / train-llm-from-scratchLinks
A straightforward method for training your LLM, from downloading data to generating text.
☆425Updated 3 weeks ago
Alternatives and similar repositories for train-llm-from-scratch
Users that are interested in train-llm-from-scratch are comparing it to the libraries listed below
Sorting:
- Building a GPT-like LLM from scratch with PyTorch.☆281Updated 8 months ago
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆182Updated last year
- ☆634Updated this week
- Explore a comprehensive collection of resources, tutorials, papers, tools, and best practices for fine-tuning Large Language Models (LLMs…☆459Updated 8 months ago
- Model Activity Visualiser☆519Updated 4 months ago
- A Deep Research agent from scratch☆201Updated 3 months ago
- Building DeepSeek R1 from Scratch☆686Updated 5 months ago
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆178Updated last year
- Maximizing the Performance of a Simple RAG using RL☆79Updated 5 months ago
- 😎 Awesome list of Retrieval-Augmented Generation (RAG) applications in Generative AI.☆588Updated last month
- A step by step implementation of a complex RAG pipeline to solve real world situations☆237Updated last month
- Educational implementation of a small GPT model from scratch in a single Jupyter Notebook☆106Updated 6 months ago
- Build datasets using natural language☆518Updated 3 months ago
- Turn topics into essays in seconds!☆187Updated last month
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆660Updated 5 months ago
- [EMNLP'25 findings] This is the official repo for the paper, HiRAG: Retrieval-Augmented Generation with Hierarchical Knowledge.☆282Updated this week
- LettuceDetect is a hallucination detection framework for RAG applications.☆478Updated this week
- A command-line interface tool for serving LLM using vLLM.☆361Updated last week
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆69Updated 4 months ago
- Make any LLM to think like OpenAI o1 and deepseek R1☆490Updated 6 months ago
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆477Updated last month
- CPU inference for the DeepSeek family of large language models in C++☆310Updated 2 months ago
- META‑AGENTIC α‑AGI 👁️✨ — Mission 🎯 End‑to‑end: Identify 🔍 → Out‑Learn 📚 → Out‑Think 🧠 → Out‑Design 🎨 → Out‑Strategise ♟️ → Out‑Exe…☆254Updated this week
- Converting Unstructured Data to a Knowledge Graph: An End-to-End Pipeline☆238Updated 4 months ago
- ☆515Updated 5 months ago
- Building LLaMA 4 MoE from Scratch☆60Updated 4 months ago
- Python Implementation of MUVERA (Multi-Vector Retrieval via Fixed Dimensional Encodings)☆302Updated last month
- Inference, Fine Tuning and many more recipes with Gemma family of models☆267Updated last month
- RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vec…☆292Updated last week
- Deep research agent to help you find the best GitHub repositories 🕵️!☆793Updated last month