AgentForceTeamOfficial / UA2-AgentLinks
Official Implementation of UA^{2}-Agent and other baseline algorithms of "Towards Unified Alignment Between Agents, Humans, and Environment"
☆19Updated last year
Alternatives and similar repositories for UA2-Agent
Users that are interested in UA2-Agent are comparing it to the libraries listed below
Sorting:
- Official Implementation of the Baby-AIGS system☆23Updated last year
- implementation of dualformer☆24Updated 8 months ago
- ☆51Updated 9 months ago
- ☆49Updated 7 months ago
- exploring whether LLMs perform case-based or rule-based reasoning☆30Updated last year
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆37Updated last year
- ☆48Updated last year
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆86Updated 6 months ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆31Updated 3 months ago
- SCoRe: Training Language Models to Self-Correct via Reinforcement Learning☆13Updated 10 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Updated last year
- ☆35Updated 6 months ago
- ☆23Updated last year
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆28Updated last month
- [NeurIPS 2025 Spotlight] ReasonFlux-Coder: Open-Source LLM Coders with Co-Evolving Reinforcement Learning☆133Updated 2 months ago
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆40Updated 2 years ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆44Updated 9 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- ☆16Updated last year
- This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers☆31Updated last year
- Syntax Error-Free and Generalizable Tool Use for LLMs via Finite-State Decoding☆28Updated last year
- [NeurIPS 2024] HonestLLM: Toward an Honest and Helpful Large Language Model☆29Updated 5 months ago
- ☆17Updated 3 months ago
- ☆17Updated last year
- [EMNLP 2024] Ask-before-Plan: Proactive Language Agents for Real-World Planning☆21Updated 4 months ago
- ☆24Updated last year
- ☆28Updated 2 weeks ago
- ☆18Updated 10 months ago
- ☆27Updated 2 weeks ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆123Updated last year