FareedKhan-dev / train-llm-from-scratchLinks
A straightforward method for training your LLM, from downloading data to generating text.
☆493Updated 4 months ago
Alternatives and similar repositories for train-llm-from-scratch
Users that are interested in train-llm-from-scratch are comparing it to the libraries listed below
Sorting:
- Building a GPT-like LLM from scratch with PyTorch.☆324Updated last year
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆195Updated last year
- Explore a comprehensive collection of resources, tutorials, papers, tools, and best practices for fine-tuning Large Language Models (LLMs…☆496Updated last year
- Building DeepSeek R1 from Scratch☆732Updated 9 months ago
- ☆714Updated last week
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆195Updated last year
- Maximizing the Performance of a Simple RAG using RL☆87Updated 9 months ago
- A step by step implementation of a complex RAG pipeline to solve real world situations☆381Updated 6 months ago
- A Deep Research agent from scratch☆214Updated 7 months ago
- Building LLaMA 4 MoE from Scratch☆70Updated 8 months ago
- A Straightforward, Step-by-Step Implementation of a Video Diffusion Model☆69Updated 4 months ago
- Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"☆756Updated 2 months ago
- Model Activity Visualiser☆519Updated 8 months ago
- Build datasets using natural language☆556Updated 3 months ago
- Train a 29M parameter GPT from Scratch☆30Updated 9 months ago
- Educational implementation of a small GPT model from scratch in a single Jupyter Notebook☆119Updated this week
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆74Updated 8 months ago
- Completed research on semantic retrieval augmented generation through novel semantic similarity graph traversal algorithms.☆265Updated last month
- Advanced Retrieval-Augmented Generation (RAG) through practical notebooks, using the power of the Langchain, OpenAI GPTs ,META LLAMA3 ,A…☆433Updated last year
- This is the official companion repository for the book The Complete LangGraph Blueprint: Build 50+ AI Agents for Business Success. The re…☆155Updated 6 months ago
- [EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!☆460Updated last year
- 😎 Awesome list of Retrieval-Augmented Generation (RAG) applications in Generative AI.☆893Updated last month
- AI Engineering bootcamp☆106Updated 9 months ago
- Train a Language Model with GRPO to create a schedule from a list of events and priorities☆251Updated 8 months ago
- Code samples from our Python agents tutorial☆109Updated 10 months ago
- Framework for enhancing LLMs for RAG tasks using fine-tuning.☆762Updated 2 weeks ago
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆680Updated 9 months ago
- Deep research agent to help you find the best GitHub repositories 🕵️!☆830Updated last month
- Converting Unstructured Data to a Knowledge Graph: An End-to-End Pipeline☆286Updated 8 months ago
- Curate High Quality Datasets, Train, Evaluate and Ship! 🚀☆753Updated this week