code to train a gpt-2 model to train it on tiny stories dataset according to the TinyStories paper
☆40Nov 24, 2023Updated 2 years ago
Alternatives and similar repositories for TinyStories
Users that are interested in TinyStories are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Reproducing GPT on the TinyStories dataset☆19Jan 18, 2024Updated 2 years ago
- Exploring the minimal architecture required for coherent English language generation.☆12Mar 5, 2025Updated last year
- ☆21May 24, 2023Updated 2 years ago
- see github.com/understanding-search/maze-transformer☆10Dec 8, 2023Updated 2 years ago
- A Mathematica and Matlab toolboxes for Clifford algebras of n-dimensional Euclidean vector spaces☆11Jan 24, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 《차근차근 실습하며 배우는 파이토치 딥러닝 프로그래밍》 예제 코드☆23Aug 17, 2022Updated 3 years ago
- ☆15May 20, 2023Updated 2 years ago
- Blue Archive hoi4 modding github☆21Updated this week
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆38Jun 6, 2023Updated 2 years ago
- Windows one-click installer script for oobabooga/text-generation-webui☆12Mar 8, 2023Updated 3 years ago
- Automatic subordinate clause extractor☆11Jul 7, 2022Updated 3 years ago
- Official implementation for GATSBI: Generative Agent-centric Spatio-temporal Object Interaction (CVPR'2021)☆12Mar 23, 2022Updated 4 years ago
- Paper elements by Google translated to React☆13Nov 20, 2014Updated 11 years ago
- Final project for CS486 (AI)☆11Apr 26, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆11Sep 25, 2025Updated 6 months ago
- ☆12Jun 22, 2024Updated last year
- ☆13Nov 3, 2016Updated 9 years ago
- 基于 Nagao 算法统计词频☆14Dec 13, 2016Updated 9 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- ☆34May 28, 2023Updated 2 years ago
- A codebase for ACL 2023 paper: Mitigating Label Biases for In-context Learning☆10Aug 4, 2023Updated 2 years ago
- A blueprint for next-gen AI. Project Infinity uses a token-efficient, Codified Agent Protocol to create specialized, secure, and imaginat…☆26Mar 13, 2026Updated 2 weeks ago
- Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding (RSS 2021)☆12Oct 22, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Computation of binomial confidence intervals that achieve exact coverage.☆14Apr 23, 2025Updated 11 months ago
- Conversion script adapting vicuna dataset into alpaca format for use with oobabooga's trainer☆13Jun 21, 2023Updated 2 years ago
- Course materials for the MVA course "algorithms for speech and language processing"☆12Mar 29, 2023Updated 2 years ago
- An exploration of how to use inter-subject representational similarity analysis (is-RSA) to study individual differences in brain activit…☆20Apr 22, 2020Updated 5 years ago
- Code for "Transformer-Based Deep Survival Analysis"☆12May 27, 2022Updated 3 years ago
- Flat is an Open-source web-based collaborative music score editor.☆10Jul 30, 2013Updated 12 years ago
- ☆13Nov 1, 2023Updated 2 years ago
- ☆14Jul 24, 2024Updated last year
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Patch for MPT-7B which allows using and training a LoRA☆58May 20, 2023Updated 2 years ago
- VR pool simulator written in Python (using pyopenvr)☆13Mar 11, 2026Updated 2 weeks ago
- Machine learning in nim☆12Aug 16, 2014Updated 11 years ago
- This repository showcases how to use the DynamixelSDK C++ and Python APIs to control an Interbotix XSeries Arm.☆17Apr 28, 2021Updated 4 years ago
- Makes llama.cpp easy to use.☆12May 14, 2025Updated 10 months ago
- Pytorch routines for (Ker)nel (Mac)hines☆11Oct 10, 2025Updated 5 months ago
- Hercules: Attributable and Scalable Opinion Summarization (ACL 2023)☆20Nov 8, 2023Updated 2 years ago