google-research / scoreLinks
Code associated with the paper An AI system to help scientists write expert-level empirical software
☆97Updated 3 months ago
Alternatives and similar repositories for score
Users that are interested in score are comparing it to the libraries listed below
Sorting:
- CodeScientist: An automated scientific discovery system for code-based experiments☆304Updated 2 weeks ago
- Automated Hypothesis Testing with Agentic Sequential Falsifications☆234Updated 7 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆353Updated 5 months ago
- Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library☆239Updated 3 weeks ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆55Updated 4 months ago
- 🧬 The Huxley-Gödel Machine☆307Updated 2 weeks ago
- Repository for Zochi's Research☆294Updated 3 weeks ago
- [EMNLP 2025 Demo] TinyScientist: A Lightweight Framework for Building Research Agents☆120Updated last month
- Official code for NeurIPS 2025 paper "AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise"☆111Updated this week
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆235Updated 9 months ago
- frozen-in-time version of our Paper Finder agent for reproducing evaluation results☆207Updated 3 months ago
- Public repository containing METR's DVC pipeline for eval data analysis☆143Updated 8 months ago
- accompanying material for sleep-time compute paper☆118Updated 7 months ago
- We present the first systematic study on the scaling property of raw agents instantiated by LLMs. We find that performance scales with th…☆134Updated last year
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆100Updated 3 months ago
- ☆105Updated 5 months ago
- A language agent gym with challenging scientific tasks☆219Updated this week
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆730Updated last week
- SciAgent: A Unified Multi-Agent System for Generalistic Scientific Reasoning☆89Updated 3 weeks ago
- Open AI data scientist agent that automates complex data analysis tasks using the ReAct framework. Execute Python code locally or in the …☆182Updated 5 months ago
- ☆79Updated 2 months ago
- ☆96Updated this week
- This is a survey of research on AI scientists, AI researchers, AI engineers, and a series of AI-driven research studies☆163Updated last month
- [ACL 2025] Agentic Knowledgeable Self-awareness☆91Updated 5 months ago
- ☆62Updated last year
- Train your own SOTA deductive reasoning model☆107Updated 9 months ago
- Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement☆143Updated 2 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆94Updated 6 months ago
- ☆360Updated 4 months ago
- [DAI 2025] Beyond GPT-5: Making LLMs Cheaper and Better via Performance–Efficiency Optimized Routing☆194Updated 2 weeks ago