google-deepmind / alphaevolve_resultsLinks
☆230Updated 3 months ago
Alternatives and similar repositories for alphaevolve_results
Users that are interested in alphaevolve_results are comparing it to the libraries listed below
Sorting:
- Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.☆644Updated 2 weeks ago
- ☆475Updated 2 months ago
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆519Updated 2 weeks ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆343Updated 3 months ago
- Evaluation of LLMs on latest math competitions☆171Updated 3 weeks ago
- Testing baseline LLMs performance across various models☆313Updated this week
- ☆88Updated last week
- ☆209Updated 6 months ago
- RLP: Reinforcement as a Pretraining Objective☆155Updated last week
- Open source interpretability artefacts for R1.☆161Updated 5 months ago
- Repository for Zochi's Research☆276Updated last month
- ☆478Updated 4 months ago
- ☆296Updated 3 weeks ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]☆547Updated 2 months ago
- ☆270Updated 5 months ago
- OpenAI Frontier Evals☆903Updated 2 weeks ago
- ☆188Updated last month
- accompanying material for sleep-time compute paper☆115Updated 5 months ago
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆328Updated 10 months ago
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆418Updated this week
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆268Updated this week
- A virtual environment for developing and evaluating automated scientific discovery agents.☆188Updated 7 months ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆342Updated 10 months ago
- AIRA-dojo: a framework for developing and evaluating AI research agents☆96Updated 2 weeks ago
- Technical report of Kimina-Prover Preview.☆335Updated 3 months ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆556Updated 2 months ago
- Code for the paper: "Learning to Reason without External Rewards"☆360Updated 3 months ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆290Updated this week
- An open source implementation of LFMs from Liquid AI: Liquid Foundation Models☆113Updated last year
- [EMNLP 2025 Demo] TinyScientist: A Lightweight Framework for Building Research Agents☆108Updated this week