jennyzzt / evolving_ideasLinks
A simple script to see how my ideas evolve over time
☆42Updated 2 months ago
Alternatives and similar repositories for evolving_ideas
Users that are interested in evolving_ideas are comparing it to the libraries listed below
Sorting:
- ☆25Updated 3 months ago
- Official repo of paper LM2☆41Updated 6 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆143Updated 3 months ago
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆86Updated this week
- Resa: Transparent Reasoning Models via SAEs☆41Updated 3 weeks ago
- Esoteric Language Models☆94Updated last month
- ☆26Updated 2 months ago
- working implimention of deepseek MLA☆43Updated 7 months ago
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆65Updated 3 months ago
- Minimal (truly) muP implementation, consistent with TP4 and TP5 papers notation☆14Updated 3 months ago
- The official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆35Updated last week
- ☆101Updated last week
- ☆83Updated 2 months ago
- SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning☆140Updated last week
- ☆69Updated last year
- σ-GPT: A New Approach to Autoregressive Models☆67Updated last year
- Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".☆132Updated 3 weeks ago
- Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.☆93Updated 3 months ago
- A simple, performant and scalable JAX-based world modeling codebase☆70Updated this week
- PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning☆432Updated last month
- Basic world models☆23Updated last week
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem”☆30Updated 2 months ago
- open source alpha evolve☆67Updated 3 months ago
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆43Updated 9 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆96Updated last month
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆20Updated 3 months ago
- The official github repo for "Diffusion Language Models are Super Data Learners".☆107Updated 3 weeks ago
- An AI benchmark for creative, human-like problem solving using Sudoku variants☆91Updated last month
- ☆174Updated 3 weeks ago
- Official PyTorch implementation of TokenSet.☆121Updated 5 months ago