FareedKhan-dev / train-llm-from-scratch
A straightforward method for training your LLM, from downloading data to generating text.
☆316Updated last month
Alternatives and similar repositories for train-llm-from-scratch
Users that are interested in train-llm-from-scratch are comparing it to the libraries listed below
Sorting:
- Unsloth Fine-tuning Notebooks for Google Colab, Kaggle, Hugging Face and more.☆311Updated this week
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆453Updated 4 months ago
- ☆567Updated this week
- Building DeepSeek R1 from Scratch☆592Updated last month
- Oliva Multi-Agent Assistant☆352Updated last month
- Turn topics into essays in seconds!☆178Updated 2 weeks ago
- Maximizing the Performance of a Simple RAG using RL☆57Updated last month
- Model Activity Visualiser☆477Updated last month
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo☆366Updated 5 months ago
- Implementation of all RAG techniques in a simpler way☆1,755Updated this week
- Train a Language Model with GRPO to create a schedule from a list of events and priorities☆165Updated 2 weeks ago
- 😎 Awesome list of Retrieval-Augmented Generation (RAG) applications in Generative AI.☆399Updated last week
- Building a GPT-like LLM from scratch with PyTorch.☆219Updated 4 months ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆945Updated 2 weeks ago
- A list of useful Open Source tools and scrapers to gather data for LLMs☆233Updated 2 months ago
- A fully custom chatbot built with Agentic RAG (Retrieval-Augmented Generation), combining Gemini models with a local knowledge base for a…☆138Updated 3 months ago
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆164Updated last year
- Deep research agent to help you find the best GitHub repositories 🕵️!☆709Updated last week
- A simple tool that let's you explore different possible paths that an LLM might sample.☆170Updated last week
- ☆48Updated 2 weeks ago
- From scratch implementation of a vision language model in pure PyTorch☆214Updated last year
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆161Updated 8 months ago
- A collection of notebooks/recipes showcasing usecases of open-source models with Together AI.☆897Updated last week
- A simple Python program to implement the search-extract-summarize flow.☆262Updated 3 months ago
- The Fastest Way to Fine-Tune LLMs Locally☆293Updated last month
- recursive rag with r1 reasoning☆294Updated 2 months ago
- Educational implementation of a small GPT model from scratch in a single Jupyter Notebook☆96Updated 2 months ago
- Minimal and annotated implementations of key ideas from modern deep learning research.☆524Updated this week
- ☆138Updated last month
- 🤗 Benchmark Large Language Models Reliably On Your Data☆295Updated this week