kevinpdev / gpt-from-scratchLinks
Educational implementation of a small GPT model from scratch in a single Jupyter Notebook
☆113Updated 8 months ago
Alternatives and similar repositories for gpt-from-scratch
Users that are interested in gpt-from-scratch are comparing it to the libraries listed below
Sorting:
- A fully functional and simple Machine Learning library made entirely from scratch with Python.☆318Updated this week
- ☆118Updated 4 months ago
- A straightforward method for training your LLM, from downloading data to generating text.☆459Updated 3 months ago
- RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks☆210Updated 4 months ago
- Building a GPT-like LLM from scratch with PyTorch.☆308Updated 10 months ago
- ☆105Updated 4 months ago
- It takes a village to raise a child: Google DeepThink 🧠 but in LangGraph and free - an original algorithm for collaborative agents using…☆129Updated last month
- ☆259Updated this week
- The Fastest Way to Fine-Tune LLMs Locally☆324Updated 7 months ago
- ☆75Updated 5 months ago
- Ollama's Interactive Prompt Engineering Tutorial☆259Updated 11 months ago
- An interactive web-based demonstration of fundamental tabular Reinforcement Learning (RL) algorithms in a simple grid world environment.☆81Updated 5 months ago
- 📓 A collection of generative AI open-source repositories that are actively being developed. If you are looking to build a solid profile …☆83Updated last month
- A C++ implementation of a Multilayer Perceptron (MLP) neural network using Eigen, supporting multiple activation and loss functions, mini…☆136Updated 5 months ago
- ☆57Updated 8 months ago
- Enhancing LLMs with LoRA☆173Updated 2 weeks ago
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.☆609Updated 8 months ago
- Testing the different LLM and RAG Tests while I learn along the way☆206Updated 5 months ago
- a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model☆164Updated last year
- A LLM trained only on data from certain time periods to reduce modern bias☆607Updated last month
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆190Updated last year
- Implementation of Stable Diffusion with PyTorch☆354Updated 8 months ago
- Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software☆334Updated 9 months ago
- Learn to build and deploy local Visual Language Models for Edge AI☆314Updated last week
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆277Updated 2 months ago
- Autograd to GPT-2 completely from scratch☆124Updated 2 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆73Updated 7 months ago
- A truly open version of gpt-oss which shows the entire pre-training from scratch☆73Updated 2 months ago
- Local LLM Powered Recursive Search & Smart Knowledge Explorer☆254Updated 3 weeks ago
- ☆45Updated 6 months ago