xingyaoww / code-act
Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.
β956Updated 10 months ago
Alternatives and similar repositories for code-act:
Users that are interested in code-act are comparing it to the libraries listed below
- Agentlessπ±: an agentless approach to automatically solve software development problemsβ1,600Updated 3 months ago
- Search-o1: Agentic Search-Enhanced Large Reasoning Modelsβ748Updated 3 weeks ago
- free and open OpenAI Deep Researchβ491Updated last month
- An open-source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through inβ¦β691Updated 5 months ago
- π€ Agent-as-a-Judge and DevAI datasetβ388Updated 2 months ago
- [ICLR 2025] Agent S: an open agentic framework that uses computers like a humanβ1,407Updated last week
- π¦οΈ CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/β321Updated 4 months ago
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Modelsβ614Updated last week
- This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.β305Updated 2 weeks ago
- AIDE: AI-Driven Exploration in the Space of Code. State of the Art machine Learning engineering agents that automates AI R&D.β821Updated this week
- Windows Agent Arena (WAA) πͺ is a scalable OS platform for testing and benchmarking of multi-modal AI agents.β640Updated 3 weeks ago
- β586Updated 2 months ago
- β526Updated this week
- Code and Data for Tau-Benchβ367Updated 2 months ago
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"β477Updated 2 weeks ago
- An Open Large Reasoning Model for Real-World Solutionsβ1,477Updated 3 weeks ago
- Agent driven automation starting with the web. Try it: https://www.emergence.ai/web-automation-apiβ1,073Updated 2 months ago
- RAGChecker: A Fine-grained Framework For Diagnosing RAGβ815Updated 3 months ago
- Synthetic data curation for post-training and structured data extractionβ1,097Updated last week
- [ICLR 2025] Automated Design of Agentic Systemsβ1,241Updated 2 months ago
- Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"β937Updated last month
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineeringβ656Updated 2 months ago
- CodeI/O: Condensing Reasoning Patterns via Code Input-Output Predictionβ484Updated last month
- [NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge aβ¦β2,130Updated this week
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRLβ1,466Updated this week
- A compilation of the best multi-agent papersβ471Updated last week
- β913Updated 2 months ago
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and β¦β338Updated 9 months ago
- β846Updated 6 months ago
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generationβ293Updated 5 months ago