google-deepmind / alphaevolve_resultsLinks

☆213

Alternatives and similar repositories for alphaevolve_results

Users that are interested in alphaevolve_results are comparing it to the libraries listed below

Sorting:

arcprize / ARC-AGI-2
☆402Updated 2 months ago
eth-sri / matharena
Evaluation of LLMs on latest math competitions
☆155Updated 2 weeks ago
SakanaAI / RLT
Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.
☆324Updated last month
aw31 / openai-imo-2025-proofs
☆455Updated 2 weeks ago
goodfire-ai / r1-interpretability
Open source interpretability artefacts for R1.
☆157Updated 3 months ago
SakanaAI / AI-Scientist-ICLR2025-Workshop-Experiment
☆262Updated 3 months ago
ekinakyurek / marc
Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"
☆321Updated 8 months ago
IntologyAI / Zochi
Repository for Zochi's Research
☆248Updated 3 weeks ago
facebookresearch / ExploreToM
Code for ExploreTom
☆84Updated last month
iliao2345 / CompressARC
☆172Updated 3 months ago
OSU-NLP-Group / GrokkedTransformer
Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
☆226Updated 2 weeks ago
arcprize / arc-agi-benchmarking
Testing baseline LLMs performance across various models
☆291Updated last week
idavidrein / gpqa
GPQA: A Graduate-Level Google-Proof Q&A Benchmark
☆378Updated 10 months ago
letta-ai / sleep-time-compute
accompanying material for sleep-time compute paper
☆99Updated 3 months ago
facebookresearch / memory
Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…
☆344Updated 7 months ago
hijohnnylin / neuronpedia
open source interpretability platform 🧠
☆311Updated this week
gkamradt / SnakeBench
☆88Updated last month
PrimeIntellect-ai / prime-rl
Decentralized RL Training at Scale
☆400Updated this week
sunblaze-ucb / Intuitor
Code for the paper: "Learning to Reason without External Rewards"
☆344Updated 3 weeks ago
Decentralised-AI / LFM-Liquid-AI-Liquid-Foundation-Models
An open source implementation of LFMs from Liquid AI: Liquid Foundation Models
☆103Updated 10 months ago
Goedel-LM / Goedel-Prover
☆194Updated 4 months ago
facebookresearch / MLGym
MLGym A New Framework and Benchmark for Advancing AI Research Agents
☆538Updated 2 weeks ago
SWE-Gym / SWE-Gym
Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym [ICML 2025]
☆516Updated last week
vsubramaniam851 / multiagent-ft
☆212Updated 5 months ago
ZihanWang314 / CoE
Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models
☆219Updated last month
kanishkg / stream-of-search
Repository for the paper Stream of Search: Learning to Search in Language
☆149Updated 6 months ago
MoonshotAI / Kimina-Prover-Preview
Technical report of Kimina-Prover Preview.
☆320Updated 3 weeks ago
SakanaAI / evo-memory
Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.
☆318Updated 9 months ago
Continual-Intelligence / SEAL
Self-Adapting Language Models
☆743Updated this week
ScalingIntelligence / Archon
Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.
☆175Updated 4 months ago