google-deepmind / alphaevolve_resultsLinks
☆258Updated 3 weeks ago
Alternatives and similar repositories for alphaevolve_results
Users that are interested in alphaevolve_results are comparing it to the libraries listed below
Sorting:
- ☆483Updated 6 months ago
- Open-source release accompanying Gao et al. 2025☆498Updated last month
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆812Updated last week
- Evaluation of LLMs on latest math competitions☆214Updated last month
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆313Updated last month
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆358Updated 7 months ago
- ☆116Updated this week
- ☆615Updated 8 months ago
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆343Updated 2 months ago
- ☆225Updated 9 months ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆344Updated last month
- ☆85Updated this week
- ☆346Updated this week
- ☆186Updated last week
- Testing baseline LLMs performance across various models☆336Updated this week
- [ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a Pretraining Objective☆226Updated this week
- ☆167Updated 5 months ago
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆288Updated 2 months ago
- ☆281Updated 9 months ago
- Technical report of Kimina-Prover Preview.☆350Updated 6 months ago
- [ICLR 2026] Learning to Reason without External Rewards☆388Updated this week
- ☆401Updated last month
- ☆214Updated 3 weeks ago
- Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models☆227Updated 2 months ago
- ☆227Updated 11 months ago
- Open source interpretability artefacts for R1.☆169Updated 9 months ago
- Demystifying Reinforcement Learning in Agentic Reasoning☆159Updated 3 months ago
- accompanying material for sleep-time compute paper☆119Updated 9 months ago
- Repository for Zochi's Research☆298Updated 2 months ago
- Training API and CLI☆323Updated last week