jennyzzt / evolving_ideasLinks
A simple script to see how my ideas evolve over time
☆41Updated last month
Alternatives and similar repositories for evolving_ideas
Users that are interested in evolving_ideas are comparing it to the libraries listed below
Sorting:
- Partial Masking for Discrete Diffusion Models☆14Updated 3 weeks ago
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆44Updated last month
- ☆22Updated last month
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆140Updated last month
- Esoteric Language Models☆88Updated this week
- Resa: Transparent Reasoning Models via SAEs☆40Updated last month
- ☆23Updated last month
- Code accompanying the paper "Generalized Interpolating Discrete Diffusion"☆94Updated last month
- The official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆35Updated this week
- ☆67Updated last year
- σ-GPT: A New Approach to Autoregressive Models☆65Updated 11 months ago
- Fast and memory efficient PyTorch implementation of the Perceiver with FlashAttention.☆26Updated 8 months ago
- PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning☆298Updated last week
- Focused on fast experimentation and simplicity☆76Updated 6 months ago
- Official repo of paper LM2☆41Updated 5 months ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Updated 4 months ago
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Updated last month
- RS-IMLE☆41Updated 7 months ago
- Fork of Flame repo for training of some new stuff in development☆14Updated last week
- ☆14Updated 3 weeks ago
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆55Updated 4 months ago
- Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.☆86Updated 2 months ago
- ☆19Updated 4 months ago
- ☆33Updated 6 months ago
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆103Updated 2 weeks ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated 2 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆19Updated 4 months ago
- Code for☆27Updated 7 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆65Updated 2 months ago
- Repository to create traveling waves integrate special information through time☆53Updated 4 months ago