Reproducing GPT on the TinyStories dataset
☆19Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for gpt-tinystories
Users that are interested in gpt-tinystories are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- code to train a gpt-2 model to train it on tiny stories dataset according to the TinyStories paper☆39Nov 24, 2023Updated 2 years ago
- Official implementation for GATSBI: Generative Agent-centric Spatio-temporal Object Interaction (CVPR'2021)☆12Mar 23, 2022Updated 4 years ago
- Final project for CS486 (AI)☆11Apr 26, 2017Updated 9 years ago
- ☆13Nov 3, 2016Updated 9 years ago
- DOS Disk Editor, handles all types of FAT file systems ( including exFAT )☆22Nov 1, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding (RSS 2021)☆12Oct 22, 2021Updated 4 years ago
- ☆31Mar 17, 2026Updated 2 months ago
- Computation of binomial confidence intervals that achieve exact coverage.☆16Apr 23, 2025Updated last year
- ☆13Nov 1, 2023Updated 2 years ago
- A Hardware-Generated CPU Test Suite for the NEC V20☆23Aug 19, 2025Updated 9 months ago
- Move blog and personal web page over to github (work in progress)☆16Feb 21, 2026Updated 3 months ago
- Pytorch routines for (Ker)nel (Mac)hines☆12Oct 10, 2025Updated 8 months ago
- Simple MoE - Day 17 of 365 Days of Repos☆20Jun 2, 2026Updated 2 weeks ago
- Go implementation of the Gun distributed graph database☆11Feb 26, 2019Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A project designed to build and render a full Minecraft crafting tree.☆10Aug 10, 2021Updated 4 years ago
- Curiosity in Multi-Step Motion Planning☆13Jul 15, 2020Updated 5 years ago
- a very fast headless DOS emulator for Linux☆26Mar 20, 2026Updated 2 months ago
- Implementation of Diffusion Policy☆14Dec 13, 2024Updated last year
- Asymmetric methods for partially observable reinforcement learning☆10Jun 9, 2025Updated last year
- D-Flat TUI Library☆19Dec 28, 2021Updated 4 years ago
- ☆11Jun 20, 2023Updated 2 years ago
- Neural text to speech system that uses eSpeak as a text/phoneme front-end☆16Oct 20, 2021Updated 4 years ago
- Environment codebase for ICRA 2020 paper "Towards Practical Multi-object Manipulation using Relational Reinforcement Learning"☆14Jul 22, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11Jan 28, 2024Updated 2 years ago
- ☆14Aug 25, 2024Updated last year
- Lightweight 8086 simulator for running ELKS and DOS programs☆22Mar 23, 2023Updated 3 years ago
- Code release for "Training Robots to Evaluate Robots" (CoRL'22, Best Paper Award)☆17Feb 15, 2023Updated 3 years ago
- This is a repository for RM2021 Software tutorial☆11Nov 4, 2020Updated 5 years ago
- Oobabooga "Hello World" API example for node.js with Express☆13Jul 2, 2023Updated 2 years ago
- Experiments for "A Closer Look at In-Context Learning under Distribution Shifts"☆18May 29, 2023Updated 3 years ago
- C++-Animation-(Standard-Template-Library)-Engine,or CASTLE for short,is a C++ plotting and animation engine created by BiliBili uploader …☆11Jan 17, 2021Updated 5 years ago
- This module offers lathe and extrude components to aframevr☆11Jul 3, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆21Feb 6, 2025Updated last year
- Android Photo/Video Recording/Capture/Effects via OpenGL☆10Feb 21, 2021Updated 5 years ago
- PAct: Part-Decomposed Single-View Articulated Object Generation☆54May 12, 2026Updated last month
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆21Jul 29, 2024Updated last year
- ☆16Dec 16, 2024Updated last year
- If FPGAs are universal function approximators, can they be used like neural networks?☆10Jun 4, 2021Updated 5 years ago
- Thinker project☆16Sep 4, 2024Updated last year