ulab-uiuc / ToMAPLinks
Official code repository for the paper "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"
☆21Updated 2 months ago
Alternatives and similar repositories for ToMAP
Users that are interested in ToMAP are comparing it to the libraries listed below
Sorting:
- ☆67Updated 8 months ago
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆30Updated 11 months ago
- [NAACL 2025] Representing Rule-based Chatbots with Transformers☆23Updated 9 months ago
- ☆46Updated 5 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆38Updated last year
- [ACL 2025] Agentic Knowledgeable Self-awareness☆91Updated 5 months ago
- ☆62Updated last year
- ☆24Updated last year
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆136Updated last year
- ☆81Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- Designing Multi-Agent Systems with Zero Supervision☆104Updated 5 months ago
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆63Updated 5 months ago
- Codebase for Instruction Following without Instruction Tuning☆36Updated last year
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Updated last month
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆60Updated last year
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆45Updated 3 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆115Updated 5 months ago
- Automatic prompt optimization framework for multi-step agent tasks.☆36Updated last year
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆103Updated last month
- ☆16Updated last year
- Sotopia-RL: Reward Design for Social Intelligence☆44Updated 3 months ago
- SSRL: Self-Search Reinforcement Learning☆157Updated 3 months ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Updated last year
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆27Updated 11 months ago
- [NeurIPS'24 LanGame workshop] On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆41Updated 5 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆33Updated last year
- ☆23Updated 3 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Updated 2 years ago