JianGuanTHU / AMOR_Agent
Code and Data for Our NeurIPS 2024 paper "AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback"
☆11Updated last month
Alternatives and similar repositories for AMOR_Agent:
Users that are interested in AMOR_Agent are comparing it to the libraries listed below
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆128Updated last week
- ☆57Updated 2 months ago
- ☆69Updated last week
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆41Updated 5 months ago
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆54Updated last month
- MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models☆21Updated last month
- This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"☆43Updated last month
- Reformatted Alignment☆113Updated 2 months ago
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆118Updated 5 months ago
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)☆37Updated last month
- ☆91Updated 8 months ago
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆60Updated 2 months ago
- ☆51Updated 3 months ago
- ☆36Updated last month
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆35Updated 4 months ago
- ✨✨Latest Papers and Datasets on Mobile and PC GUI Agent☆73Updated 2 weeks ago
- [NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.☆73Updated 2 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆64Updated 2 weeks ago
- Official completion of “Training on the Benchmark Is Not All You Need”.☆27Updated 3 months ago
- ☆320Updated 6 months ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆33Updated 2 weeks ago
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆74Updated this week
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆51Updated 2 months ago
- ☆36Updated 3 weeks ago
- Implementations of online merging optimizers proposed by Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment☆65Updated 5 months ago
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆71Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆40Updated 9 months ago
- ☆35Updated 3 months ago
- ☆141Updated 7 months ago
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆47Updated 9 months ago