ad8e / TinyStories-cleaner
Remove generated stories with stray unicode characters
☆13Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for TinyStories-cleaner
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆152Updated this week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆84Updated this week
- realtime latent world model inference demo☆35Updated last week
- Train Llama Loras Easily☆29Updated last year
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆98Updated this week
- Latent Diffusion Language Models☆67Updated last year
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆113Updated 7 months ago
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆51Updated 3 weeks ago
- RWKV-7: Surpassing GPT☆45Updated this week
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆104Updated last month
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- ☆24Updated 7 months ago
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆14Updated 7 months ago
- σ-GPT: A New Approach to Autoregressive Models☆59Updated 3 months ago
- ☆49Updated 8 months ago
- ☆18Updated last month
- Cerule - A Tiny Mighty Vision Model☆67Updated 2 months ago
- ☆27Updated last year
- Examples of apps built with Nendo, the AI Audio Tool Suite☆56Updated 8 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆113Updated 3 weeks ago
- ☆13Updated last month
- Losslessly encode text natively with arithmetic coding and HuggingFace Transformers☆71Updated 3 months ago
- RWKV, in easy to read code☆55Updated this week
- ☆27Updated 4 months ago
- A Collection of Pydantic Models to Abstract IRL☆15Updated this week
- Modeling code for a BitNet b1.58 Llama-style model.☆23Updated 6 months ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆39Updated 10 months ago
- ☆53Updated 10 months ago
- Minetest is an open source voxel game engine with easy modding and game creation☆63Updated 9 months ago