FareedKhan-dev / train-llm-from-scratchLinks
A straightforward method for training your LLM, from downloading data to generating text.
☆504Updated 5 months ago
Alternatives and similar repositories for train-llm-from-scratch
Users that are interested in train-llm-from-scratch are comparing it to the libraries listed below
Sorting:
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆196Updated last year
- ☆722Updated last week
- Building DeepSeek R1 from Scratch☆742Updated 10 months ago
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆197Updated last year
- Explore a comprehensive collection of resources, tutorials, papers, tools, and best practices for fine-tuning Large Language Models (LLMs…☆499Updated last year
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆76Updated 9 months ago
- A Deep Research agent from scratch☆214Updated 8 months ago
- Building a GPT-like LLM from scratch with PyTorch.☆330Updated last year
- Maximizing the Performance of a Simple RAG using RL☆90Updated 10 months ago
- Model Activity Visualiser☆520Updated 9 months ago
- Building LLaMA 4 MoE from Scratch☆72Updated 9 months ago
- Converting Unstructured Data to a Knowledge Graph: An End-to-End Pipeline☆290Updated 9 months ago
- Build datasets using natural language☆559Updated 4 months ago
- [EMNLP'25 findings] This is the official repo for the paper, HiRAG: Retrieval-Augmented Generation with Hierarchical Knowledge.☆497Updated 2 months ago
- A step by step implementation of a complex RAG pipeline to solve real world situations☆395Updated 6 months ago
- [AAAI 2026 🔥 Poster] ComoRAG: A Cognitive-Inspired Memory-Organized RAG for Stateful Long Narrative Reasoning☆317Updated 4 months ago
- Completed research on semantic retrieval augmented generation through novel semantic similarity graph traversal algorithms.☆266Updated 2 months ago
- LettuceDetect is a hallucination detection framework for RAG applications.☆527Updated 4 months ago
- REFRAG-style RAG (compress → sense/select → expand) — Single-file reference implementation☆203Updated 3 weeks ago
- Advanced Retrieval-Augmented Generation (RAG) through practical notebooks, using the power of the Langchain, OpenAI GPTs ,META LLAMA3 ,A…☆439Updated last year
- Train a Language Model with GRPO to create a schedule from a list of events and priorities☆259Updated 8 months ago
- AI Engineering bootcamp☆106Updated 10 months ago
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆680Updated 10 months ago
- Collection of resources for finetuning Large Language Models (LLMs).☆110Updated last year
- ☆78Updated last year
- ☆659Updated 10 months ago
- Turn topics into essays in seconds!☆191Updated 6 months ago
- RAG-VectorDB-Embedings-LlamaIndex-Langchain☆277Updated 3 months ago
- ☆43Updated last week
- Make any LLM to think like OpenAI o1 and deepseek R1☆491Updated 11 months ago