Tencent / AICGSecEvalLinks
A repository-level AI-generated code security evaluation benchmark
☆331Updated 2 months ago
Alternatives and similar repositories for AICGSecEval
Users that are interested in AICGSecEval are comparing it to the libraries listed below
Sorting:
- Remote IDA Call, a python package that allows you to call IDA functions from a remote process.☆118Updated last year
- demo PsExec☆127Updated 3 years ago
- Software Security Vulnerability Hub☆129Updated 2 months ago
- A reading list for MLSecOps!☆141Updated 6 months ago
- 💯 Perfecting AI workflows with human intelligence☆98Updated 3 weeks ago
- Focus on Linux C2. The open source part is reverse shell management.☆664Updated last month
- LLM-FuzzX is a user-friendly fuzz testing tool for Large Language Models (e.g., GPT, Claude, LLaMA), featuring advanced task-aware mutati…☆114Updated 4 months ago
- Repo for paper *Measuring and Augmenting Large Language Models for Solving Capture-the-Flag Challenges*☆255Updated 2 months ago
- ☆32Updated last year
- Modern patch, written in Python. 现代化的 Patch 工具。☆104Updated 4 months ago
- The white paper which discusses the security and privacy problems of large models.☆95Updated 2 years ago
- ☆178Updated last month
- MATEval is the first multi-agent framework simulating human collaborative discussion for open-ended text evaluation.☆28Updated 3 months ago
- ☆478Updated this week
- RepoMaster: The open-source AI agent that masters GitHub. It turns any code repository into a powerful tool, achieving a new level of aut…☆360Updated last week
- LLMs for autonomous reasoning and analysis of firmware☆31Updated 5 months ago
- ☆190Updated last year
- ☆283Updated 2 months ago
- 一款专注于python object的调试器☆53Updated 4 months ago
- ☆131Updated 2 months ago
- Code Efficiency Benchmark☆84Updated 4 months ago
- ☆93Updated 3 months ago
- Repo-level benchmark for real-world Code Agents: from repo understanding → env setup → incremental dev/bug-fixing → task delivery, with c…☆201Updated last week
- RAS: Retrieval-And-Structuring for Knowledge-Intensive LLM Generation☆55Updated 4 months ago
- ☆47Updated 11 months ago
- DeepClaude Rust的升级版本☆208Updated 5 months ago
- A powerful multi-format file parsing, data cleaning, and AI annotation toolkit.☆140Updated this week
- A timestamp for Code LLMs☆71Updated 3 weeks ago
- Selective Prompt Anchoring☆87Updated last week
- PhishIntention: Phishing detection through webpage intention☆248Updated 3 weeks ago