chengyou-jia / AgentStore
☆35Updated 2 months ago
Alternatives and similar repositories for AgentStore:
Users that are interested in AgentStore are comparing it to the libraries listed below
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆84Updated 4 months ago
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆96Updated last week
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆58Updated 3 weeks ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆51Updated 9 months ago
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆77Updated last month
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆76Updated last week
- Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆111Updated last week
- Reformatted Alignment☆114Updated 5 months ago
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆116Updated 2 months ago
- ☆56Updated 3 months ago
- ☆49Updated 5 months ago
- ☆42Updated 2 months ago
- ☆82Updated 4 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆129Updated 3 months ago
- Official Repository of Are Your LLMs Capable of Stable Reasoning?☆22Updated last week
- Large Language Models Can Self-Improve in Long-context Reasoning☆62Updated 3 months ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆49Updated last month
- ☆28Updated 3 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆99Updated 4 months ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆49Updated 4 months ago
- ☆101Updated 3 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆32Updated last year
- FuseAI Project☆83Updated last month
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆66Updated last week
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆48Updated 3 months ago