kevinpdev / gpt-from-scratchLinks
Educational implementation of a small GPT model from scratch in a single Jupyter Notebook
☆111Updated 7 months ago
Alternatives and similar repositories for gpt-from-scratch
Users that are interested in gpt-from-scratch are comparing it to the libraries listed below
Sorting:
- ☆116Updated 4 months ago
- A straightforward method for training your LLM, from downloading data to generating text.☆450Updated 2 months ago
- ☆103Updated 3 months ago
- RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks☆207Updated 3 months ago
- Building a GPT-like LLM from scratch with PyTorch.☆302Updated 9 months ago
- Enhancing LLMs with LoRA☆163Updated last month
- ☆259Updated 2 months ago
- Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software☆331Updated 8 months ago
- The Fastest Way to Fine-Tune LLMs Locally☆322Updated 7 months ago
- Autograd to GPT-2 completely from scratch☆125Updated 2 months ago
- It takes a village to raise a child: Google DeepThink 🧠 but in LangGraph and free - an original algorithm for collaborative agents using…☆128Updated 3 weeks ago
- Implementations of Papers that I read, you can read my breakdown in my blog☆85Updated 2 months ago
- chrome & firefox extension to chat with webpages: local llms☆127Updated 9 months ago
- Ollama's Interactive Prompt Engineering Tutorial☆257Updated 10 months ago
- Open-source CLI toolkit for low-RAM finetuning, quantization, and deployment of LLMs☆90Updated 2 months ago
- 📓 A collection of generative AI open-source repositories that are actively being developed. If you are looking to build a solid profile …☆81Updated last week
- Pixelagent — Multimodal stateful agents☆218Updated 4 months ago
- a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model☆162Updated last year
- A LLM trained only on data from certain time periods to reduce modern bias☆562Updated 3 weeks ago
- A C++ implementation of a Multilayer Perceptron (MLP) neural network using Eigen, supporting multiple activation and loss functions, mini…☆135Updated 4 months ago
- A curated list of materials on AI efficiency☆173Updated 2 weeks ago
- llmbasedos — Local-First OS Where Your AI Agents Wake Up and Work☆275Updated last month
- An interactive web-based demonstration of fundamental tabular Reinforcement Learning (RL) algorithms in a simple grid world environment.☆76Updated 4 months ago
- A flexible, adaptive classification system for dynamic text classification☆477Updated last week
- ☆45Updated 5 months ago
- ☆75Updated 4 months ago
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.☆609Updated 7 months ago
- ☆57Updated 8 months ago
- This project implements optimizers for TensorFlow and Keras, which can be used in the same way as Keras optimizers. Machine learning, Dee…☆35Updated last week
- A simple tool that let's you explore different possible paths that an LLM might sample.☆190Updated 5 months ago