ZiyueWang25 / llm-security-challenge

Can Large Language Models Solve Security Challenges? We test LLMs' ability to interact and break out of shell environments using the OverTheWire wargames environment, showing the models' surprising ability to do action-oriented cyberexploits in shell environments
12Updated last year

Alternatives and similar repositories for llm-security-challenge:

Users that are interested in llm-security-challenge are comparing it to the libraries listed below