kevinpdev / gpt-from-scratchLinks
Educational implementation of a small GPT model from scratch in a single Jupyter Notebook
☆116Updated 9 months ago
Alternatives and similar repositories for gpt-from-scratch
Users that are interested in gpt-from-scratch are comparing it to the libraries listed below
Sorting:
- A fully functional and simple Machine Learning library made entirely from scratch with Python.☆323Updated 2 weeks ago
- RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks☆218Updated 5 months ago
- A straightforward method for training your LLM, from downloading data to generating text.☆476Updated 3 months ago
- ☆120Updated 5 months ago
- Building a GPT-like LLM from scratch with PyTorch.☆316Updated 11 months ago
- ☆75Updated 6 months ago
- ☆107Updated 5 months ago
- Implementation of a GPT-4o like Multimodal from Scratch using Python☆73Updated 7 months ago
- ☆261Updated 3 weeks ago
- 📓 A collection of generative AI open-source repositories that are actively being developed. If you are looking to build a solid profile …☆83Updated last month
- Implementations of Papers that I read, you can read my breakdown in my blog☆88Updated last month
- The Fastest Way to Fine-Tune LLMs Locally☆327Updated 8 months ago
- Train LLM Model Behavior☆662Updated this week
- ☆96Updated last month
- A curated list of materials on AI efficiency☆190Updated 3 weeks ago
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.☆612Updated 9 months ago
- Testing the different LLM and RAG Tests while I learn along the way☆207Updated 5 months ago
- A C++ implementation of a Multilayer Perceptron (MLP) neural network using Eigen, supporting multiple activation and loss functions, mini…☆137Updated 6 months ago
- Autograd to GPT-2 completely from scratch☆125Updated 3 months ago
- a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model☆164Updated last year
- Minimal and annotated implementations of key ideas from modern deep learning research.☆1,207Updated 2 months ago
- Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software☆334Updated 9 months ago
- Building a 2.3M-parameter LLM from scratch with LLaMA 1 architecture.☆191Updated last year
- ☆45Updated 6 months ago
- It takes a village to raise a child: Google DeepThink 🧠 but in LangGraph and free - an original algorithm for collaborative agents using…☆130Updated 2 months ago
- A Deep Research agent from scratch☆212Updated 6 months ago
- Coding an LLM and its building blocks from scratch.☆101Updated 8 months ago
- Learn to build and deploy local Visual Language Models for Edge AI☆325Updated last month
- ☆399Updated 2 weeks ago
- Learn the building blocks of how to build gpt-oss from scratch☆105Updated 2 months ago