Reproducing GPT on the TinyStories dataset
☆19Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for gpt-tinystories
Users that are interested in gpt-tinystories are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- code to train a gpt-2 model to train it on tiny stories dataset according to the TinyStories paper☆39Nov 24, 2023Updated 2 years ago
- Final project for CS486 (AI)☆11Apr 26, 2017Updated 9 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated 2 years ago
- This repository showcases how to use the DynamixelSDK C++ and Python APIs to control an Interbotix XSeries Arm.☆18Apr 28, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Multi-view Reinforcement Learning☆11Feb 9, 2020Updated 6 years ago
- Voice synthesis library for Text-to-Speech applications (Currently HTS Engine rewrite in Rust language)☆13May 18, 2026Updated last week
- A project designed to build and render a full Minecraft crafting tree.☆10Aug 10, 2021Updated 4 years ago
- Go implementation of the Gun distributed graph database☆11Feb 26, 2019Updated 7 years ago
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated 3 months ago
- Official implementation of the Informed Dreamer algorithm, based on DreamerV3☆22Jan 29, 2026Updated 3 months ago
- LMDB Adapter for gunDB☆14Dec 8, 2022Updated 3 years ago
- Neural text to speech system that uses eSpeak as a text/phoneme front-end☆16Oct 20, 2021Updated 4 years ago
- NeurIPS22 "RankFeat: Rank-1 Feature Removal for Out-of-distribution Detection" and T-PAMI Extension☆20Feb 21, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11Jan 28, 2024Updated 2 years ago
- ☆14Aug 25, 2024Updated last year
- ☆16Feb 4, 2025Updated last year
- Implementation of <Symbolic Graphics Programming with Large Language Models>☆38Sep 14, 2025Updated 8 months ago
- This is a repository for RM2021 Software tutorial☆11Nov 4, 2020Updated 5 years ago
- Official Pytorch Code of the Paper "WorldWander: Bridging Egocentric and Exocentric Worlds in Video Generation"☆33May 13, 2026Updated 2 weeks ago
- An OpenAI API compatible images server to generate or manipulate images.☆18Feb 2, 2025Updated last year
- ☆20Feb 6, 2025Updated last year
- Curse-of-memory phenomenon of RNNs in sequence modelling☆19May 8, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆18Mar 19, 2019Updated 7 years ago
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆21Jul 29, 2024Updated last year
- Computing the greatest common divisor with transformers, source code for the paper https//arxiv.org/abs/2308.15594☆14Aug 11, 2025Updated 9 months ago
- ☆16Dec 16, 2024Updated last year
- If FPGAs are universal function approximators, can they be used like neural networks?☆10Jun 4, 2021Updated 4 years ago
- Thinker project☆16Sep 4, 2024Updated last year
- GunDB HTTP/HTTPS Server and API☆19Feb 15, 2018Updated 8 years ago
- Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting☆14Dec 19, 2025Updated 5 months ago
- ☆11Mar 2, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- Coaching a Teachable Student☆19Aug 1, 2023Updated 2 years ago
- ☆17Oct 31, 2023Updated 2 years ago
- ☆19Nov 25, 2022Updated 3 years ago
- Implementation of stop sequencer for Huggingface Transformers☆16Jun 6, 2023Updated 2 years ago
- ☆17May 27, 2019Updated 7 years ago
- An Angular module that allows to open images in canvas, zoom, pan, crop, resize and download image☆35Apr 1, 2014Updated 12 years ago