FareedKhan-dev / train-llm-from-scratchLinks
A straightforward method for training your LLM, from downloading data to generating text.
☆397Updated 2 months ago
Alternatives and similar repositories for train-llm-from-scratch
Users that are interested in train-llm-from-scratch are comparing it to the libraries listed below
Sorting:
- Building a GPT-like LLM from scratch with PyTorch.☆264Updated 6 months ago
- Building DeepSeek R1 from Scratch☆654Updated 3 months ago
- Model Activity Visualiser☆510Updated 3 months ago
- ☆605Updated this week
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆180Updated last year
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆172Updated 10 months ago
- A Deep Research agent from scratch☆197Updated 2 months ago
- Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.☆642Updated 3 months ago
- Explore a comprehensive collection of resources, tutorials, papers, tools, and best practices for fine-tuning Large Language Models (LLMs…☆439Updated 7 months ago
- Build datasets using natural language☆500Updated 2 months ago
- 😎 Awesome list of Retrieval-Augmented Generation (RAG) applications in Generative AI.☆520Updated this week
- Educational implementation of a small GPT model from scratch in a single Jupyter Notebook☆104Updated 4 months ago
- Generate large synthetic data using an LLM☆433Updated last week
- Make any LLM to think like OpenAI o1 and deepseek R1☆491Updated 5 months ago
- This repository provides a Python script to fetch and summarize research papers from arXiv using the free Gemini API☆236Updated 4 months ago
- Building LLaMA 4 MoE from Scratch☆56Updated 3 months ago
- Maximizing the Performance of a Simple RAG using RL☆66Updated 3 months ago
- CPU inference for the DeepSeek family of large language models in C++☆308Updated last month
- The Fastest Way to Fine-Tune LLMs Locally☆312Updated 3 months ago
- A fully functional and simple Machine Learning library made entirely from scratch with Python.☆295Updated last month
- Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software☆318Updated 5 months ago
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆473Updated 6 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆68Updated 3 months ago
- First-principle implementations of groundbreaking AI algorithms using a wide range of deep learning frameworks, accompanied by supporting…☆174Updated last week
- Ollama's Interactive Prompt Engineering Tutorial☆251Updated 7 months ago
- A flexible, adaptive classification system for dynamic text classification☆336Updated 3 weeks ago
- Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"☆330Updated 3 months ago
- Interactive Pytorch forward pass visualization in notebooks☆398Updated last week
- A simple Python program to implement the search-extract-summarize flow.☆269Updated last month
- 🤗 Benchmark Large Language Models Reliably On Your Data☆359Updated last week