FareedKhan-dev / train-llm-from-scratchLinks
A straightforward method for training your LLM, from downloading data to generating text.
☆376Updated last month
Alternatives and similar repositories for train-llm-from-scratch
Users that are interested in train-llm-from-scratch are comparing it to the libraries listed below
Sorting:
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆175Updated last year
- A fully functional and simple Machine Learning library made entirely from scratch with Python.☆294Updated 3 weeks ago
- Building DeepSeek R1 from Scratch☆630Updated 3 months ago
- Educational implementation of a small GPT model from scratch in a single Jupyter Notebook☆96Updated 4 months ago
- Model Activity Visualiser☆506Updated 2 months ago
- A Deep Research agent from scratch☆189Updated last month
- A Straightforward, Step-by-Step Implementation of a Video Diffusion Model☆47Updated last month
- ☆596Updated this week
- Turn topics into essays in seconds!☆184Updated 2 months ago
- Interactive Pytorch forward pass visualization in notebooks☆242Updated 2 weeks ago
- Ollama's Interactive Prompt Engineering Tutorial☆249Updated 6 months ago
- Maximizing the Performance of a Simple RAG using RL☆62Updated 3 months ago
- (WIP) A small but powerful, homemade PyTorch from scratch.☆554Updated this week
- LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.☆169Updated 10 months ago
- A list of useful Open Source tools and scrapers to gather data for LLMs☆238Updated 4 months ago
- Building a GPT-like LLM from scratch with PyTorch.☆254Updated 6 months ago
- ☆155Updated 2 months ago
- The Fastest Way to Fine-Tune LLMs Locally☆306Updated 3 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆64Updated 2 months ago
- A simple Python program to implement the search-extract-summarize flow.☆269Updated last week
- Contains the public resources of Hands on GenAI book☆160Updated 5 months ago
- Hands-on tutorials on fine-tuning various LLMs using different fine-tuning techniques☆236Updated last week
- Generate large synthetic data using an LLM☆428Updated this week
- CPU inference for the DeepSeek family of large language models in C++☆302Updated 3 weeks ago
- Building LLaMA 4 MoE from Scratch☆53Updated 2 months ago
- Make any LLM to think like OpenAI o1 and deepseek R1☆490Updated 4 months ago
- A collection of notebooks/recipes showcasing usecases of open-source models with Together AI.☆954Updated last week
- 😎 Awesome list of Retrieval-Augmented Generation (RAG) applications in Generative AI.☆486Updated last month
- A category wise collection of 200+ LLM survey papers.☆156Updated 2 months ago
- A Demo of Cache-Augmented Generation (CAG) in an LLM☆101Updated 2 weeks ago