SakanaAI / AI-Scientist-ICLR2025-Workshop-Experiment
☆229Updated last week
Alternatives and similar repositories for AI-Scientist-ICLR2025-Workshop-Experiment:
Users that are interested in AI-Scientist-ICLR2025-Workshop-Experiment are comparing it to the libraries listed below
- CodeScientist: An automated scientific discovery system for code-based experiments☆221Updated 3 weeks ago
- Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning☆207Updated 2 months ago
- Repository for Zochi's Research☆56Updated 3 weeks ago
- ☆125Updated this week
- ☆297Updated 4 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆182Updated last week
- The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search☆658Updated last week
- ☆194Updated 2 months ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆484Updated 2 weeks ago
- large population models☆329Updated 2 weeks ago
- ☆64Updated 2 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆108Updated 2 months ago
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆204Updated last month
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆122Updated last month
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆304Updated 5 months ago
- A virtual environment for developing and evaluating automated scientific discovery agents.☆144Updated last month
- ☆514Updated 2 months ago
- Prompt-to-Leaderboard☆218Updated 2 weeks ago
- ☆165Updated 2 months ago
- ☆85Updated 7 months ago
- An agent benchmark with tasks in a simulated software company.☆294Updated 2 weeks ago
- ☆544Updated 3 weeks ago
- ☆56Updated last week
- AWM: Agent Workflow Memory☆262Updated 2 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆240Updated last week
- ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations☆179Updated 2 weeks ago
- Releases from OpenAI Preparedness☆704Updated 2 weeks ago
- OpenResearcher, an advanced Scientific Research Assistant☆439Updated 6 months ago
- ☆53Updated 2 months ago
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆178Updated 3 weeks ago