accompanying material for sleep-time compute paper
☆130Apr 30, 2025Updated last year
Alternatives and similar repositories for sleep-time-compute
Users that are interested in sleep-time-compute are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 6 months ago
- [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning☆1,058Apr 15, 2026Updated 2 weeks ago
- Reference implementation of models from Nyonic Model Factory☆12May 13, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 8 months ago
- qgpt-issue-31☆11Oct 31, 2024Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!☆18Apr 25, 2026Updated last week
- Source code for the collaborative reasoner research project at Meta FAIR.☆113Mar 26, 2026Updated last month
- God Conjecture Paper / Presentation / Video Discussion☆38Apr 12, 2026Updated 2 weeks ago
- Official Code Release for "Training a Generally Curious Agent"☆46May 18, 2025Updated 11 months ago
- ☆43Dec 10, 2025Updated 4 months ago
- ☆273May 14, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ThetaEvolve: Test-time Learning on Open Problems, enabling RL training on AlphaEvolve/OpenEvolve and emphasizing scaling test-time comput…☆147Feb 27, 2026Updated 2 months ago
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆479May 17, 2025Updated 11 months ago
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,790Updated this week
- Train and run transformers directly on Apple's Neural Engine in Swift bypass coreml entirely☆99Apr 18, 2026Updated 2 weeks ago
- Marketplace ML experiment - training without backprop☆27Sep 9, 2025Updated 7 months ago
- Fast and reliable distributed systems in Python☆35Mar 25, 2026Updated last month
- ☆56Nov 6, 2024Updated last year
- Ring-V2 is a reasoning MoE LLM provided and open-sourced by InclusionAI.☆97Oct 23, 2025Updated 6 months ago
- ☆17Oct 12, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- An automated data pipeline scaling RL to pretraining levels☆75Oct 11, 2025Updated 6 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆54Oct 1, 2024Updated last year
- Official Code for MIMETIC^2☆13Nov 19, 2024Updated last year
- Code for "Adaptive Self-improvement LLM Agentic System for ML Library Development" (ICML 2025)☆15Jan 6, 2026Updated 3 months ago
- ☆12Sep 7, 2024Updated last year
- Self Organizing Maps (SOM) ML model can be used to conduct semantic search to populate context required for Retrieval Augmented Generatio…☆15Mar 16, 2024Updated 2 years ago
- The official repo of VideoAgentTrek☆47Oct 24, 2025Updated 6 months ago
- ☆19Jul 31, 2025Updated 9 months ago
- CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models (NeurIPS 2025)☆181Nov 4, 2025Updated 5 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Jun 11, 2025Updated 10 months ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- ☆10Nov 17, 2022Updated 3 years ago
- An experimental SDK for adding agentic memory and learning in a pluggable way☆43Nov 4, 2025Updated 5 months ago
- ☆15Sep 7, 2022Updated 3 years ago
- MCP Test Client is a TypeScript testing utility for Model Context Protocol (MCP) servers.☆13Jan 2, 2025Updated last year
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆140Aug 13, 2025Updated 8 months ago