Zhudongsheng75 / Divide-Then-AggregateLinks
(ACL 2025) Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation
☆10Updated 2 months ago
Alternatives and similar repositories for Divide-Then-Aggregate
Users that are interested in Divide-Then-Aggregate are comparing it to the libraries listed below
Sorting:
- The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"☆184Updated 9 months ago
- ☆31Updated 2 months ago
- 🏆 ICML 2025 Spotlight☆302Updated 3 weeks ago
- The awesome agents in the era of large language models☆68Updated last year
- The OlymMATH dataset☆19Updated 2 months ago
- Official code for AAAI2023 paper`Confucius: Iterative Tool Learning from Introspection Feedback by Easy-to-Difficult Curriculum`☆17Updated 6 months ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆185Updated last week
- ☆21Updated last year
- Contrastive Learning Reduces Hallucination in Conversations☆22Updated last year
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.☆320Updated last year
- Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"☆133Updated 3 weeks ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆218Updated this week
- This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction☆13Updated 9 months ago
- A curated list of awesome works in Routing LLMs paradigm (👉 Welcome to submit your contributions to this code repository)☆50Updated 3 weeks ago
- All-in-one Web Agent framework for post-training. Start building with a few clicks!☆266Updated last month
- ☆263Updated 2 months ago
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆169Updated 3 months ago
- The related works and background techniques about Openai o1☆224Updated 7 months ago
- Enhances Overleaf by allowing article searches and BibTeX retrieval from DBLP and Google Scholar | 通过允许从 DBLP 和 Google Scholar 进行文章搜索和获取 …☆103Updated 3 months ago
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆115Updated 4 months ago
- [ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future☆457Updated 6 months ago
- Benchmarking LLMs' Gaming Ability in Multi-Agent Environments☆85Updated 3 months ago
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆271Updated 3 weeks ago
- An Awesome List of Reinforcement Learning-based Large Language Agent Works. Collect directly from official code base.☆243Updated this week
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆185Updated 3 months ago
- [ACL' 25] The official code repository for PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models.☆78Updated 5 months ago
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆39Updated 3 weeks ago
- This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by g…☆35Updated last month
- A Comprehensive Survey on Long Context Language Modeling☆170Updated last month
- An awesome repository & A comprehensive survey on interpretability of LLM attention heads.☆355Updated 5 months ago