vinid / NegotiationArena
☆67Updated 11 months ago
Alternatives and similar repositories for NegotiationArena:
Users that are interested in NegotiationArena are comparing it to the libraries listed below
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆111Updated 9 months ago
- A mechanistic approach for understanding and detecting factual errors of large language models.☆41Updated 8 months ago
- The Prism Alignment Project☆68Updated 10 months ago
- Data and code for the Corr2Cause paper (ICLR 2024)☆94Updated 10 months ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆72Updated last year
- Functional Benchmarks and the Reasoning Gap☆84Updated 5 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆125Updated 11 months ago
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆106Updated 3 months ago
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆66Updated 8 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- Repository for the paper Stream of Search: Learning to Search in Language☆139Updated last month
- ☆103Updated 10 months ago
- Governance of the Commons Simulation (GovSim)☆41Updated last month
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆101Updated 11 months ago
- ☆130Updated 4 months ago
- ☆88Updated last month
- ☆119Updated 5 months ago
- Steering vectors for transformer language models in Pytorch / Huggingface☆90Updated 2 weeks ago
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆94Updated last year
- Inspecting and Editing Knowledge Representations in Language Models☆112Updated last year
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆35Updated 2 months ago
- ☆20Updated 9 months ago
- Discovering Data-driven Hypotheses in the Wild☆63Updated 3 months ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆166Updated last month
- This is the official repository for HypoGeniC (Hypothesis Generation in Context) and HypoRefine, which are automated, data-driven tools t…☆57Updated this week
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆100Updated 5 months ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆155Updated 3 months ago
- Code/data for MARG (multi-agent review generation)☆41Updated 3 months ago
- Learning to route instances for Human vs AI Feedback☆20Updated last month
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆157Updated 10 months ago