Reproducing GPT on the TinyStories dataset
☆19Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for gpt-tinystories
Users that are interested in gpt-tinystories are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- code to train a gpt-2 model to train it on tiny stories dataset according to the TinyStories paper☆40Nov 24, 2023Updated 2 years ago
- Final project for CS486 (AI)☆11Apr 26, 2017Updated 8 years ago
- An IRC bot for Common Lisp code evaluation☆25Sep 12, 2025Updated 7 months ago
- ☆13Nov 3, 2016Updated 9 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- git cvsimport'd version of the CLOCC repository on sourceforge.☆19Apr 7, 2010Updated 16 years ago
- Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding (RSS 2021)☆12Oct 22, 2021Updated 4 years ago
- Computation of binomial confidence intervals that achieve exact coverage.☆14Apr 23, 2025Updated 11 months ago
- ☆16Oct 14, 2017Updated 8 years ago
- ☆13Nov 1, 2023Updated 2 years ago
- Jürgen Walther's AI Workbench for Common Lisp, restored from the CMU AI Repository☆14Nov 4, 2023Updated 2 years ago
- Tools to analyze Interlisp source code, to support VM development, and to eventually bootstrap systems☆16Jan 12, 2025Updated last year
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- Thread-safe queues and mailboxes☆14Mar 18, 2020Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Pytorch routines for (Ker)nel (Mac)hines☆12Oct 10, 2025Updated 6 months ago
- Implementation of Diffusion Policy☆13Dec 13, 2024Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated 2 years ago
- Simple MoE - Day 17 of 365 Days of Repos☆18Jan 17, 2025Updated last year
- A state-of-the-art DirectX12 based pathtracer☆23Updated this week
- Go implementation of the Gun distributed graph database☆11Feb 26, 2019Updated 7 years ago
- Exploring the minimal architecture required for coherent English language generation.☆12Mar 5, 2025Updated last year
- Asymmetric methods for partially observable reinforcement learning☆10Jun 9, 2025Updated 10 months ago
- Repository for ACL paper: "Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIs"☆17Jul 1, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Jupyter-style custom node for executing Python code and plotting within ComfyUI workflows.☆37Mar 18, 2026Updated 3 weeks ago
- Tools for geospatial analysis of gridded and ungridded lightning fields☆12Mar 8, 2017Updated 9 years ago
- ARMA cell: a modular and effective approach for neural autoregressive modeling☆16May 29, 2024Updated last year
- NeurIPS22 "RankFeat: Rank-1 Feature Removal for Out-of-distribution Detection" and T-PAMI Extension☆20Feb 21, 2025Updated last year
- ☆14Aug 25, 2024Updated last year
- ☆17Feb 4, 2025Updated last year
- ☆18Oct 6, 2022Updated 3 years ago
- Implementation of <Symbolic Graphics Programming with Large Language Models>☆38Sep 14, 2025Updated 7 months ago
- ☆19Feb 6, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code release for "Training Robots to Evaluate Robots" (CoRL'22, Best Paper Award)☆17Feb 15, 2023Updated 3 years ago
- [ICLR 2025] This repository contains the code to reproduce the results from our paper From Sparse Dependence to Sparse Attention: Unveili…☆12Mar 7, 2025Updated last year
- An OpenAI API compatible images server to generate or manipulate images.☆17Feb 2, 2025Updated last year
- Oobabooga "Hello World" API example for node.js with Express☆13Jul 2, 2023Updated 2 years ago
- A template project to both illustrate and serve as an example for plugin creations on top of the manim.☆20Apr 30, 2021Updated 4 years ago
- C++-Animation-(Standard-Template-Library)-Engine,or CASTLE for short,is a C++ plotting and animation engine created by BiliBili uploader …☆11Jan 17, 2021Updated 5 years ago
- Android Photo/Video Recording/Capture/Effects via OpenGL☆10Feb 21, 2021Updated 5 years ago