WecoAI / aidemlLinks
AIDE: AI-Driven Exploration in the Space of Code. The machine Learning engineering agent that automates AI R&D.
☆949Updated this week
Alternatives and similar repositories for aideml
Users that are interested in aideml are comparing it to the libraries listed below
Sorting:
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆795Updated 3 weeks ago
- [ICLR 2025] Automated Design of Agentic Systems☆1,358Updated 5 months ago
- End-to-end Generative Optimization for AI Agents☆615Updated 2 weeks ago
- ☆1,025Updated 6 months ago
- ⚖️ The First Coding Agent-as-a-Judge☆573Updated last month
- xLAM: A Family of Large Action Models to Empower AI Agent Systems☆482Updated 3 weeks ago
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,781Updated 6 months ago
- ☆611Updated 5 months ago
- System 2 Reasoning Link Collection☆843Updated 3 months ago
- Code and Data for Tau-Bench☆657Updated 5 months ago
- Autonomous Agents (LLMs) research papers. Updated Daily.☆868Updated this week
- Atom of Thoughts for Markov LLM Test-Time Scaling☆577Updated 3 weeks ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆529Updated 2 weeks ago
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle☆286Updated 2 weeks ago
- Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhan…☆1,289Updated last year
- OO for LLMs☆810Updated this week
- An agent benchmark with tasks in a simulated software company.☆468Updated 2 weeks ago
- Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding☆392Updated last year
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆566Updated 3 months ago
- Synthetic data curation for post-training and structured data extraction☆1,434Updated this week
- A reading list on LLM based Synthetic Data Generation 🔥☆1,338Updated last month
- AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and re…☆358Updated this week
- A-MEM: Agentic Memory for LLM Agents☆472Updated 2 weeks ago
- 🌎💪 BrowserGym, a Gym environment for web task automation☆806Updated last week
- An intuitive LLM prompting framework for multifunctional agents, by explicitly constructing a complex "thought process" from simple natur…☆453Updated 6 months ago
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆643Updated 2 weeks ago
- A library for advanced large language model reasoning☆2,174Updated last month
- A compilation of the best multi-agent papers☆716Updated last week
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆345Updated last year
- Lightweight and portable LLM sandbox runtime (code interpreter) Python library.☆343Updated last week