FareedKhan-dev / train-llm-from-scratchLinks
A straightforward method for training your LLM, from downloading data to generating text.
☆512Updated 6 months ago
Alternatives and similar repositories for train-llm-from-scratch
Users that are interested in train-llm-from-scratch are comparing it to the libraries listed below
Sorting:
- ☆733Updated last week
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆197Updated last year
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆200Updated last year
- Model Activity Visualiser☆521Updated 10 months ago
- Building a GPT-like LLM from scratch with PyTorch.☆334Updated last year
- Building DeepSeek R1 from Scratch☆744Updated 10 months ago
- A Deep Research agent from scratch☆216Updated 8 months ago
- A step by step implementation of a complex RAG pipeline to solve real world situations☆414Updated 7 months ago
- Explore a comprehensive collection of resources, tutorials, papers, tools, and best practices for fine-tuning Large Language Models (LLMs…☆502Updated last year
- Building LLaMA 4 MoE from Scratch☆72Updated 9 months ago
- Build datasets using natural language☆566Updated 4 months ago
- Maximizing the Performance of a Simple RAG using RL☆90Updated 10 months ago
- Make any LLM to think like OpenAI o1 and deepseek R1☆492Updated last year
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆77Updated 10 months ago
- Educational implementation of a small GPT model from scratch in a single Jupyter Notebook☆122Updated last month
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆683Updated 10 months ago
- Ollama's Interactive Prompt Engineering Tutorial☆266Updated last year
- Train a 29M parameter GPT from Scratch☆33Updated 11 months ago
- Train a Language Model with GRPO to create a schedule from a list of events and priorities☆260Updated 9 months ago
- Converting Unstructured Data to a Knowledge Graph: An End-to-End Pipeline☆292Updated 9 months ago
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆493Updated 6 months ago
- Generate High-Quality Synthetics, Train, Measure, and Evaluate in a Single Pipeline☆830Updated last week
- ☆242Updated 4 months ago
- AI Engineering bootcamp☆107Updated 11 months ago
- Turn topics into essays in seconds!☆191Updated 7 months ago
- A comprehensive book on neural networks and large language models in NLP☆555Updated 2 months ago
- ☆80Updated 6 months ago
- ☆665Updated 11 months ago
- META‑AGENTIC α‑AGI 👁️✨ — Mission 🎯 End‑to‑end: Identify 🔍 → Out‑Learn 📚 → Out‑Think 🧠 → Out‑Design 🎨 → Out‑Strategise ♟️ → Out‑Exe…☆278Updated 2 weeks ago
- a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model☆168Updated last year