xuyuzhuang11 / WerewolfLinks
☆56Updated last year
Alternatives and similar repositories for Werewolf
Users that are interested in Werewolf are comparing it to the libraries listed below
Sorting:
- Benchmarking LLMs' Gaming Ability in Multi-Agent Environments☆87Updated 4 months ago
- This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.☆321Updated last year
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆184Updated 7 months ago
- ☆160Updated 8 months ago
- Data and code for the ICLR 2023 paper "Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning".☆156Updated last year
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆147Updated 10 months ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆267Updated 11 months ago
- Repo of "Large Language Model-based Human-Agent Collaboration for Complex Task Solving(EMNLP2024 Findings)"☆34Updated 11 months ago
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆178Updated 4 months ago
- [ACL 2025] A Neural-Symbolic Self-Training Framework☆112Updated 3 months ago
- A research repo for experiments about Reinforcement Finetuning☆52Updated 4 months ago
- ☆95Updated last week
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆163Updated last year
- Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models☆107Updated last month
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆147Updated 6 months ago
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆189Updated 4 months ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆60Updated 10 months ago
- Paper List for In-context Learning 🌷☆183Updated last year
- ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)☆263Updated last year
- ☆262Updated last month
- ☆159Updated 7 months ago
- Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)☆192Updated last year
- Data and Code for Program of Thoughts [TMLR 2023]☆285Updated last year
- On Memorization of Large Language Models in Logical Reasoning☆71Updated 5 months ago
- AI Alignment: A Comprehensive Survey☆135Updated last year
- Code for ACL2024 paper - Adversarial Preference Optimization (APO).☆56Updated last year
- [ACL 2024] A Survey of Chain of Thought Reasoning: Advances, Frontiers and Future☆460Updated 7 months ago
- ☆204Updated 5 months ago
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆63Updated last month
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆80Updated last year