accompanying material for sleep-time compute paper
☆128Apr 30, 2025Updated 10 months ago
Alternatives and similar repositories for sleep-time-compute
Users that are interested in sleep-time-compute are comparing it to the libraries listed below
Sorting:
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 4 months ago
- [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning☆1,025Mar 11, 2026Updated last week
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆37Oct 1, 2025Updated 5 months ago
- Reference implementation of models from Nyonic Model Factory☆12May 13, 2024Updated last year
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 7 months ago
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆14Mar 25, 2025Updated 11 months ago
- qgpt-issue-31☆11Oct 31, 2024Updated last year
- ☆16Feb 22, 2025Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Apr 17, 2025Updated 11 months ago
- ThetaEvolve: Test-time Learning on Open Problems, enabling RL training on AlphaEvolve/OpenEvolve and emphasizing scaling test-time comput…☆135Feb 27, 2026Updated 3 weeks ago
- Official Code Release for "Training a Generally Curious Agent"☆45May 18, 2025Updated 10 months ago
- Paper Implementation of Self-Rewarding Language Models☆13Feb 1, 2024Updated 2 years ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆16Feb 16, 2026Updated last month
- [ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search☆109Jun 3, 2025Updated 9 months ago
- The original Shared Recurrent Memory Transformer implementation☆34Jul 11, 2025Updated 8 months ago
- The official implementations of Intention-conditioned Flow Occupancy Models (InFOM)☆31Feb 5, 2026Updated last month
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆476May 17, 2025Updated 10 months ago
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,699Updated this week
- Marketplace ML experiment - training without backprop☆27Sep 9, 2025Updated 6 months ago
- Simple quiz component for solidjs and solid-start.☆11Mar 13, 2026Updated last week
- ☆22Jan 29, 2026Updated last month
- Ring-V2 is a reasoning MoE LLM provided and open-sourced by InclusionAI.☆97Oct 23, 2025Updated 5 months ago
- ☆56Nov 6, 2024Updated last year
- ☆11Sep 16, 2022Updated 3 years ago
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆119Dec 5, 2024Updated last year
- An automated data pipeline scaling RL to pretraining levels☆74Oct 11, 2025Updated 5 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆54Oct 1, 2024Updated last year
- Official Code for MIMETIC^2☆13Nov 19, 2024Updated last year
- Code for "Adaptive Self-improvement LLM Agentic System for ML Library Development" (ICML 2025)☆15Jan 6, 2026Updated 2 months ago
- Sounds from my whiteboard-the-web videos☆11Jun 12, 2025Updated 9 months ago
- Self Organizing Maps (SOM) ML model can be used to conduct semantic search to populate context required for Retrieval Augmented Generatio…☆15Mar 16, 2024Updated 2 years ago
- ☆29Jun 5, 2025Updated 9 months ago
- CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models (NeurIPS 2025)☆175Nov 4, 2025Updated 4 months ago
- ☆19Jul 31, 2025Updated 7 months ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Jun 11, 2025Updated 9 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆222Nov 27, 2025Updated 3 months ago