aypan17 / machiavelliLinks
☆139Updated 3 months ago
Alternatives and similar repositories for machiavelli
Users that are interested in machiavelli are comparing it to the libraries listed below
Sorting:
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆120Updated last year
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆94Updated 2 years ago
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…