code to train a gpt-2 model to train it on tiny stories dataset according to the TinyStories paper
☆40Nov 24, 2023Updated 2 years ago
Alternatives and similar repositories for TinyStories
Users that are interested in TinyStories are comparing it to the libraries listed below
Sorting:
- ☆21May 24, 2023Updated 2 years ago
- ☆15May 20, 2023Updated 2 years ago
- 《차근차근 실습하며 배우는 파이토치 딥러닝 프로그래밍》 예제 코드☆22Aug 17, 2022Updated 3 years ago
- annoy long term memory experiment for oobabooga/text-generation-webui☆31Jul 17, 2023Updated 2 years ago
- ☆33May 28, 2023Updated 2 years ago
- Synthetic Hypertext and Homomorphic Catalogue☆15Dec 28, 2024Updated last year
- ☆13Nov 3, 2016Updated 9 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- attempt to perma root the NEC Terrain android phone☆10Jul 24, 2015Updated 10 years ago
- ☆30Dec 3, 2023Updated 2 years ago
- GPU-accelerated first-order low-rank SDP solver☆13Mar 17, 2025Updated 11 months ago
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆17Updated this week
- ☆10Nov 13, 2024Updated last year
- ☆11Sep 29, 2014Updated 11 years ago
- Getting VibeVoice 7b working with 10 gb of vram.☆14Aug 31, 2025Updated 6 months ago
- A complete open source e-commerce solution built with Rust(STILL IN DEVELOPMENT).☆11Jul 29, 2018Updated 7 years ago
- Code release for "Transfer Adversarial Hashing for Hamming Space Retrieval" (AAAI 2018)☆13Jun 15, 2018Updated 7 years ago
- A workaround way to get a DeepL translation API locally while not needing a API key at all. Emulates chromium headlessly and makes a loca…☆13Nov 6, 2024Updated last year
- A pure-Python DNS server for local development.☆10Apr 23, 2018Updated 7 years ago
- Recreation of Kubernetes the Hard Way in containers instead of GCP.☆10Jun 29, 2020Updated 5 years ago
- This is a repository for RM2021 Software tutorial☆11Nov 4, 2020Updated 5 years ago
- Simple MoE - Day 17 of 365 Days of Repos☆17Jan 17, 2025Updated last year
- see github.com/understanding-search/maze-transformer☆10Dec 8, 2023Updated 2 years ago
- Automated operation and maintenance platform based on SaltStack.☆10Apr 19, 2020Updated 5 years ago
- Final project for CS486 (AI)☆11Apr 26, 2017Updated 8 years ago
- linear algebra package. like gonum/mat, but small. lets say gonum-lite☆12Jul 8, 2023Updated 2 years ago
- Feature Pyramid Networks for Object Detection on caffe☆10Nov 8, 2017Updated 8 years ago
- A project designed to build and render a full Minecraft crafting tree.☆10Aug 10, 2021Updated 4 years ago
- Surgically de-slop LLMs☆14Jun 1, 2025Updated 9 months ago
- Kubernetes locally using kubeadm and https://github.com/coreos/coreos-vagrant☆12Mar 29, 2018Updated 7 years ago
- [ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.☆14Aug 8, 2025Updated 6 months ago
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- ☆10Aug 9, 2023Updated 2 years ago
- Official implementation for GATSBI: Generative Agent-centric Spatio-temporal Object Interaction (CVPR'2021)☆12Mar 23, 2022Updated 3 years ago
- The official Languini Kitchen repository☆14May 6, 2024Updated last year
- A small in-memory key value database for rust☆14Jun 8, 2023Updated 2 years ago
- Unsupervised Imitation Learning☆11Dec 3, 2017Updated 8 years ago
- ☆10Jun 19, 2023Updated 2 years ago
- ☆13May 11, 2024Updated last year