google-research / scoreLinks
Code associated with the paper An AI system to help scientists write expert-level empirical software
☆99Updated 4 months ago
Alternatives and similar repositories for score
Users that are interested in score are comparing it to the libraries listed below
Sorting:
- CodeScientist: An automated scientific discovery system for code-based experiments☆310Updated 2 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆358Updated 7 months ago
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆237Updated 11 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆224Updated 5 months ago
- Official code for NeurIPS 2025 paper "AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise"☆126Updated 2 weeks ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆169Updated 5 months ago
- ☆106Updated 7 months ago
- ☆281Updated 9 months ago
- Repository for Zochi's Research☆300Updated 2 months ago
- Automated Hypothesis Testing with Agentic Sequential Falsifications☆245Updated 8 months ago
- ☆80Updated 4 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆459Updated 5 months ago
- We present the first systematic study on the scaling property of raw agents instantiated by LLMs. We find that performance scales with th…☆140Updated last year
- ☆102Updated last month
- ☆259Updated last month
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆56Updated 6 months ago
- Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement☆157Updated 4 months ago
- Framework enabling modular interchange of language agents, environments, and optimizers☆121Updated this week
- ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations [COLM 2025]☆248Updated 7 months ago
- Public repository containing METR's DVC pipeline for eval data analysis☆199Updated last week
- build and benchmark deep research☆229Updated last week
- Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library☆256Updated this week
- SoTA Approach for ARC-AGI 2☆158Updated 4 months ago
- ☆228Updated 11 months ago
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆313Updated this week
- ☆223Updated this week
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆261Updated this week
- Latent Collaboration in Multi-Agent Systems☆746Updated 2 weeks ago
- 🧬 The Huxley-Gödel Machine☆322Updated 2 months ago
- A virtual environment for developing and evaluating automated scientific discovery agents.☆199Updated 10 months ago