CS-EVAL / CS-EvalLinks
CS-Eval is a comprehensive evaluation suite for fundamental cybersecurity models or large language models' cybersecurity ability.
☆58Updated last year
Alternatives and similar repositories for CS-Eval
Users that are interested in CS-Eval are comparing it to the libraries listed below
Sorting:
- CyberMetric dataset☆112Updated last year
- ☆55Updated last year
- CVE-Bench: A Benchmark for AI Agents’ Ability to Exploit Real-World Web Application Vulnerabilities☆136Updated last week
- [USENIX Security'24] Official repository of "Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise a…☆111Updated last year