WecoAI / aideml
AIDE: AI-Driven Exploration in the Space of Code. State of the Art machine Learning engineering agents that automates AI R&D.
☆803Updated 3 weeks ago
Alternatives and similar repositories for aideml:
Users that are interested in aideml are comparing it to the libraries listed below
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆645Updated 2 months ago
- [ICLR 2025] Automated Design of Agentic Systems☆1,225Updated last month
- Synthetic data curation for post-training and structured data extraction☆1,049Updated this week
- 🤠 Agent-as-a-Judge and DevAI dataset☆375Updated 2 months ago
- ☆1,011Updated 3 months ago
- ☆583Updated 2 months ago
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,572Updated 3 months ago
- Autonomous Agents (LLMs) research papers. Updated Daily.☆720Updated this week
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆612Updated last month
- Search-o1: Agentic Search-Enhanced Large Reasoning Models☆719Updated 2 weeks ago
- Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhan…☆871Updated 10 months ago
- ☆374Updated 2 months ago
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆338Updated 9 months ago
- AWM: Agent Workflow Memory☆252Updated last month
- Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding☆375Updated last year
- An agent benchmark with tasks in a simulated software company.☆268Updated this week
- A reading list on LLM based Synthetic Data Generation 🔥☆1,211Updated last month
- End-to-end Generative Optimization for AI Agents☆518Updated last week
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle☆237Updated this week
- A compilation of the best multi-agent papers☆446Updated last week
- Code and Data for Tau-Bench☆346Updated 2 months ago
- [ICLR 2025] Agent S: an open agentic framework that uses computers like a human☆1,321Updated this week
- System 2 Reasoning Link Collection☆811Updated last week
- An intuitive LLM prompting framework for multifunctional agents, by explicitly constructing a complex "thought process" from simple natur…☆442Updated 3 months ago
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym☆402Updated last week
- Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi e…☆422Updated last week