snap-research / locomoLinks
☆206Updated last year
Alternatives and similar repositories for locomo
Users that are interested in locomo are comparing it to the libraries listed below
Sorting:
- Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".☆244Updated last year
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆239Updated 3 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆125Updated 5 months ago
- ☆312Updated last month
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆119Updated 6 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆230Updated 7 months ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆242Updated 2 weeks ago
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆139Updated 9 months ago
- ☆191Updated 2 weeks ago
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆337Updated 3 weeks ago
- ☆96Updated 8 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆258Updated last month
- ☆388Updated 3 weeks ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆156Updated 2 months ago
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆165Updated last year
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆148Updated 8 months ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆151Updated last year
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆176Updated 4 months ago
- Source code and demo for memory bank and SiliconFriend☆318Updated 2 years ago
- Benchmarking Chat Assistants on Long-Term Interactive Memory (ICLR 2025)☆184Updated 4 months ago
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆344Updated last year
- Open Source Implementation of Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evo…☆83Updated last month
- augmented LLM with self reflection☆130Updated last year
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆58Updated 3 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆243Updated 2 weeks ago
- FireAct: Toward Language Agent Fine-tuning☆282Updated last year
- [ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"☆404Updated 2 months ago
- SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis☆98Updated 2 months ago
- Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23☆226Updated last year
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆156Updated 5 months ago