snap-research / locomoLinks
☆162Updated 11 months ago
Alternatives and similar repositories for locomo
Users that are interested in locomo are comparing it to the libraries listed below
Sorting:
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆245Updated last week
- [EMNLP 2024 (Oral)] Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA☆139Updated 9 months ago
- ☆286Updated 2 weeks ago
- ☆131Updated 4 months ago
- Benchmarking Chat Assistants on Long-Term Interactive Memory (ICLR 2025)☆167Updated 3 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆229Updated 6 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆122Updated 4 months ago
- ☆174Updated 3 months ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆114Updated 6 months ago
- ☆310Updated 2 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆117Updated 5 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆237Updated 2 months ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆234Updated this week
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆166Updated 3 months ago
- A banchmark list for evaluation of large language models.☆134Updated last month
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]☆169Updated last month
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆233Updated 3 months ago
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆91Updated 2 months ago
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆158Updated last year
- augmented LLM with self reflection☆128Updated last year
- Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".☆243Updated last year
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆156Updated 2 months ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆148Updated last year
- SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis☆93Updated 2 months ago
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆145Updated 7 months ago
- [ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"☆394Updated last month
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆250Updated 2 months ago
- ☆189Updated 2 months ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆127Updated 4 months ago
- A version of verl to support tool use☆315Updated this week