ad8e / TinyStories-cleanerLinks
Remove generated stories with stray unicode characters
☆12Updated 2 years ago
Alternatives and similar repositories for TinyStories-cleaner
Users that are interested in TinyStories-cleaner are comparing it to the libraries listed below
Sorting:
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆183Updated 2 months ago
- llm sampler that only allows words that are in the bible☆43Updated last year
- A synthetic story narration dataset to study small audio LMs.☆31Updated last year
- ☆19Updated last month
- research impl of Native Sparse Attention (2502.11089)☆63Updated 10 months ago
- ☆50Updated last year
- Focused on fast experimentation and simplicity☆79Updated last year
- Modeling code for a BitNet b1.58 Llama-style model.☆25Updated last year
- realtime latent world model inference demo☆48Updated last year
- ☆23Updated last year
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆66Updated last month
- ☆20Updated last year
- ☆20Updated 2 years ago
- ☆48Updated 10 months ago
- Thispersondoesnotexist went down, so this time, while building it back up, I am going to open source all of it.☆91Updated 2 years ago
- Latent Diffusion Language Models☆70Updated 2 years ago
- smolLM with Entropix sampler on pytorch☆149Updated last year
- ☆12Updated last year
- A JAX implementation of the continuous time formulation of Consistency Models☆85Updated 2 years ago
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆20Updated 2 years ago
- Make-A-Video Latent Diffusion Model☆19Updated 2 years ago
- ☆22Updated last year
- supporting pytorch FSDP for optimizers☆84Updated last year
- Efficient optimizers☆280Updated 3 weeks ago
- Train vision models using JAX and 🤗 transformers☆100Updated 3 weeks ago
- ☆53Updated last year
- ☆108Updated 5 months ago
- Visualizing the internal board state of a GPT trained on chess PGN strings, and performing interventions on its internal board state and …☆218Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆150Updated last week
- A simple, hackable text-to-speech system in PyTorch and MLX☆184Updated 5 months ago