redwoodresearch / Text-Steganography-Benchmark

Code for Preventing Language Models From Hiding Their Reasoning, which evaluates defenses against LLM steganography.
13Updated 9 months ago

Related projects

Alternatives and complementary repositories for Text-Steganography-Benchmark