SakanaAI / AI-Scientist-ICLR2025-Workshop-ExperimentLinks
☆273Updated 7 months ago
Alternatives and similar repositories for AI-Scientist-ICLR2025-Workshop-Experiment
Users that are interested in AI-Scientist-ICLR2025-Workshop-Experiment are comparing it to the libraries listed below
Sorting:
- CodeScientist: An automated scientific discovery system for code-based experiments☆303Updated last week
- [EMNLP 2025 Demo] TinyScientist: A Lightweight Framework for Building Research Agents☆119Updated last month
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆235Updated 9 months ago
- ⏰ AI conference deadline countdowns☆290Updated last week
- Repository for Zochi's Research☆292Updated 2 weeks ago
- large population models☆522Updated last week
- ShinkaEvolve: Towards Open-Ended and Sample-Efficient Program Evolution☆705Updated this week
- ☆226Updated 9 months ago
- Open source interpretability artefacts for R1.☆164Updated 7 months ago
- ☆358Updated 4 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆352Updated 5 months ago
- ☆79Updated 2 months ago
- A virtual environment for developing and evaluating automated scientific discovery agents.☆192Updated 8 months ago
- ☆105Updated 5 months ago
- The code repository of the paper: Competition and Attraction Improve Model Fusion☆167Updated 3 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆110Updated 7 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆225Updated this week
- ☆575Updated 6 months ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆576Updated 3 months ago
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆340Updated 3 weeks ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆76Updated this week
- ☆92Updated last month
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆479Updated 3 months ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆55Updated 4 months ago
- ☆186Updated 5 months ago
- Code for LitLLMs, LLMs for Literature Review: Are we there yet? (TMLR 2025)☆45Updated 7 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆123Updated 9 months ago
- An agent benchmark with tasks in a simulated software company.☆592Updated 2 weeks ago
- ☆244Updated 5 months ago
- accompanying material for sleep-time compute paper☆118Updated 7 months ago