PurCL / ASTRALinks
π₯ Amazon Nova AI Challenge Winner - ASTRA emerged victorious as the top attacking team in Amazon's global AI safety competition, defeating elite defending teams from universities worldwide in live adversarial evaluation.
β60Updated last month
Alternatives and similar repositories for ASTRA
Users that are interested in ASTRA are comparing it to the libraries listed below
Sorting:
- CyberGym is a large-scale, high-quality cybersecurity evaluation framework designed to rigorously assess the capabilities of AI agents onβ¦β70Updated last week
- An autonomous LLM-agent for large-scale, repository-level code auditingβ234Updated this week
- π₯π₯π₯ Detecting hidden backdoors in Large Language Models with only black-box accessβ44Updated 4 months ago
- β16Updated last year
- Official repo for FSE'24 paper "CodeArt: Better Code Models by Attention Regularization When Symbols Are Lacking"β16Updated 6 months ago
- SecLLMHolmes is a generalized, fully automated, and scalable framework to systematically evaluate the performance (i.e., accuracy and reaβ¦β57Updated 5 months ago
- CVE-Bench: A Benchmark for AI Agentsβ Ability to Exploit Real-World Web Application Vulnerabilitiesβ102Updated last month
- A curated list of awesome resources about LLM supply chain security (including papers, security reports and CVEs)β87Updated 8 months ago
- β122Updated last year
- β32Updated last year
- [CCS'24] An LLM-based, fully automated fuzzing tool for option combination testing.β89Updated 5 months ago
- Consuming Resrouce via Auto-generation for LLM-DoS Attack under Black-box Settingsβ16Updated last month
- β50Updated last year
- β25Updated last year
- [USENIX Security '24] An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities agaiβ¦β52Updated 6 months ago
- [USENIX Security'24] Official repository of "Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise aβ¦β108Updated 11 months ago
- [USENIX Security 25] PatchAgent is a LLM-based practical program repair agent that mimics human expertise.β91Updated last week
- Source code for LLMxCPG paperβ53Updated 3 weeks ago
- β33Updated 3 months ago
- Binary Code Summarization: Benchmarking ChatGPT/GPT-4 and Other Large Language Modelsβ20Updated last year
- β80Updated last year
- TensorFlow API analysis tool and malicious model detection toolβ34Updated 4 months ago
- Repository for "SecurityEval Dataset: Mining Vulnerability Examples to Evaluate Machine Learning-Based Code Generation Techniques" publisβ¦β78Updated last year
- A collection of security papers on top-tier publicationsβ55Updated last week
- β31Updated last year
- The D-CIPHER and NYU CTF baseline LLM Agents built for NYU CTF Benchβ96Updated 2 months ago
- β15Updated last year
- Parsing-based Analyzerβ51Updated 3 months ago
- The automated prompt injection framework for LLM-integrated applications.β230Updated last year
- Code for Book "AI for Cybersecurity: A Handbook of Use Case"β21Updated 2 years ago