AgentForceTeamOfficial / UA2-AgentLinks
Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environment"
☆17Updated 7 months ago
Alternatives and similar repositories for UA2-Agent
Users that are interested in UA2-Agent are comparing it to the libraries listed below
Sorting:
- Official Implementation of the Baby-AIGS system☆23Updated 7 months ago
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆23Updated last year
- ☆46Updated 4 months ago
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆48Updated 7 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- ☆40Updated 2 weeks ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- ☆43Updated 2 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆50Updated 3 weeks ago
- Evaluate the Quality of Critique☆35Updated last year
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆25Updated 3 months ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆22Updated 2 months ago
- [ACL 2024] The project of Symbol-LLM☆55Updated 11 months ago
- Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆83Updated 2 weeks ago
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆65Updated 2 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆18Updated this week
- [EMNLP 2024] Ask-before-Plan: Proactive Language Agents for Real-World Planning☆21Updated 8 months ago
- AbstainQA, ACL 2024☆26Updated 8 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆34Updated last year
- ☆22Updated 11 months ago
- Syntax Error-Free and Generalizable Tool Use for LLMs via Finite-State Decoding☆27Updated last year
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆21Updated 11 months ago
- ☆19Updated 2 weeks ago
- Codebase for Instruction Following without Instruction Tuning☆34Updated 9 months ago
- ☆35Updated 3 months ago
- ☆25Updated last year
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆16Updated 6 months ago
- ☆22Updated 6 months ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆38Updated 4 months ago
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Updated 5 months ago