kevinpdev / gpt-from-scratchLinks
Educational implementation of a small GPT model from scratch in a single Jupyter Notebook
☆104Updated 4 months ago
Alternatives and similar repositories for gpt-from-scratch
Users that are interested in gpt-from-scratch are comparing it to the libraries listed below
Sorting:
- A fully functional and simple Machine Learning library made entirely from scratch with Python.☆295Updated last month
- A straightforward method for training your LLM, from downloading data to generating text.☆397Updated 2 months ago
- RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks☆150Updated 3 weeks ago
- ☆161Updated 3 weeks ago
- ☆105Updated last month
- ☆74Updated last month
- chrome & firefox extension to chat with webpages: local llms☆119Updated 6 months ago
- The Fastest Way to Fine-Tune LLMs Locally☆312Updated 3 months ago
- Building a GPT-like LLM from scratch with PyTorch.☆264Updated 6 months ago
- Minimal Linux OS with a Model Context Protocol (MCP) gateway to expose local capabilities to LLMs.☆257Updated 3 weeks ago
- Pixelagent — Multimodal stateful agents☆205Updated last month
- ☆42Updated 2 months ago
- ☆85Updated 3 weeks ago
- Sparse Inferencing for transformer based LLMs☆193Updated last week
- a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model☆147Updated last year
- Autograd to GPT-2 completely from scratch☆114Updated 2 months ago
- Notate is a desktop chat application that takes AI conversations to the next level. It combines the simplicity of chat with advanced feat…☆257Updated 4 months ago
- AI Engineering bootcamp☆93Updated 4 months ago
- An interactive web-based demonstration of fundamental tabular Reinforcement Learning (RL) algorithms in a simple grid world environment.☆67Updated last month
- Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software☆318Updated 5 months ago
- A deep learning library built from scratch with complex neural networks examples built on top for learning purposes.☆66Updated 5 months ago
- This repository contains the JFK Records dataset with ~2.2k declassified documents (~63k pages), cleaned, summarized, and stored in text …☆83Updated 3 months ago
- Official python implementation of the UTCP☆184Updated this week
- Testing the different LLM and RAG Tests while I learn along the way☆197Updated last month
- A C++ implementation of a Multilayer Perceptron (MLP) neural network using Eigen, supporting multiple activation and loss functions, mini…☆138Updated last month
- Local & Private LLM that drafts responses LIKE you automatically☆81Updated 8 months ago
- A flexible, adaptive classification system for dynamic text classification☆336Updated 3 weeks ago
- ☆41Updated 10 months ago
- A Deep Research agent from scratch☆197Updated 2 months ago
- Local LLMs generating equity research reports☆20Updated 6 months ago