allenai / wildguard

Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs
29Updated 2 months ago

Related projects: