ad8e / TinyStories-cleanerLinks
Remove generated stories with stray unicode characters
☆13Updated last year
Alternatives and similar repositories for TinyStories-cleaner
Users that are interested in TinyStories-cleaner are comparing it to the libraries listed below
Sorting:
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆175Updated last month
- llm sampler that only allows words that are in the bible☆27Updated 6 months ago
- ☆49Updated last year
- realtime latent world model inference demo☆46Updated 7 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆140Updated 4 months ago
- ☆29Updated 6 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated last month
- Train Llama Loras Easily☆31Updated last year
- Genertaes control vectors for use with llama.cpp in GGUF format.☆26Updated 3 months ago
- Text-writing denoising diffusion (and much more)☆30Updated 2 years ago
- ☆20Updated last year
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆101Updated 3 months ago
- Training Models Daily☆17Updated last year
- Fast, Modern, and Low Precision PyTorch Optimizers☆94Updated this week
- ☆21Updated 7 months ago
- ☆28Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- ☆20Updated 8 months ago
- Token Omission Via Attention☆128Updated 8 months ago
- smolLM with Entropix sampler on pytorch☆150Updated 7 months ago
- The first AI artist☆32Updated 2 years ago
- this is a TypeScript-based MCP server that implements a simple loom and makes it available for Claude to use.☆20Updated 6 months ago
- Latent Diffusion Language Models☆68Updated last year
- supporting pytorch FSDP for optimizers☆82Updated 6 months ago
- Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.☆71Updated 2 years ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Updated 4 months ago
- Doohickey is a stable diffusion tool for technical artists who want to stay up-to-date with the latest developments in the field.☆40Updated 2 years ago
- Focused on fast experimentation and simplicity☆75Updated 6 months ago
- A GPT with self-similar nested properties☆20Updated last year
- Erasing concepts from neural representations with provable guarantees☆228Updated 5 months ago