lechmazur / deception

Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claude, GPT-4, Gemini, Llama, etc.) with standardized evaluation metrics.
15Updated this week

Alternatives and similar repositories for deception:

Users that are interested in deception are comparing it to the libraries listed below