ulab-uiuc / ToMAPLinks
Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"
☆12Updated last month
Alternatives and similar repositories for ToMAP
Users that are interested in ToMAP are comparing it to the libraries listed below
Sorting:
- ☆66Updated 3 months ago
- ☆40Updated last week
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆21Updated 5 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆45Updated last week
- ☆47Updated 9 months ago
- Designing Multi-Agent Systems with Zero Supervision☆83Updated this week
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- ☆56Updated 7 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆35Updated last year
- Official Repository for Paper "BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Compet…☆18Updated 10 months ago
- RuleRAG: Rule-guided Retrieval-Augmented Generation with Language Models for Question Answering☆22Updated 7 months ago
- Codebase for Instruction Following without Instruction Tuning☆35Updated 9 months ago
- ☆30Updated last year
- The paper list of multilingual pre-trained models (Continual Updated).☆22Updated last year
- ☆16Updated 11 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆26Updated 7 months ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆25Updated 3 months ago
- ☆20Updated 3 months ago
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆24Updated last month
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆27Updated 3 weeks ago
- ☆47Updated last month
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Updated last year
- [ACL 2025] Agentic Knowledgeable Self-awareness☆73Updated 3 weeks ago
- ☆27Updated last week
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆39Updated 8 months ago
- ☆21Updated this week
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated last week
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆41Updated 8 months ago