chengyou-jia / AgentStore
☆34Updated 2 months ago
Alternatives and similar repositories for AgentStore:
Users that are interested in AgentStore are comparing it to the libraries listed below
- [NeurIPS2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆92Updated 2 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆73Updated last month
- Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆103Updated 3 weeks ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆82Updated 4 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆125Updated 2 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆47Updated this week
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆116Updated 3 months ago
- ☆54Updated 5 months ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆51Updated 8 months ago
- Reformatted Alignment☆114Updated 4 months ago
- ☆81Updated 3 months ago
- ☆120Updated 8 months ago
- (ICLR 2025) The Official Code Repository for GUI-World.☆47Updated 2 months ago
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆74Updated last week
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆31Updated last year
- ☆42Updated 2 months ago
- ☆92Updated last month
- ☆53Updated 2 months ago
- ☆98Updated 2 months ago
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆27Updated 11 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆44Updated last month
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆96Updated last month
- The official repo for "TheoremQA: A Theorem-driven Question Answering dataset" (EMNLP 2023)☆27Updated 9 months ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆48Updated 2 weeks ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆52Updated 4 months ago
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆61Updated last month