maveryn/cti-bench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/maveryn/cti-bench)

maveryn / cti-bench

[NeurIPS'24, Spotlight] CTIBench: A Benchmark for Evaluating LLMs in Cyber Threat Intelligence

☆84

Alternatives and similar repositories for cti-bench

Users that are interested in cti-bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

boschresearch / anno-ctr-lrec-coling-2024
View on GitHub
AnnoCTR corpus for detection and linking of entities in cyber threat reports
☆29Apr 12, 2024Updated 2 years ago
cybermetric / CyberMetric
View on GitHub
CyberMetric dataset
☆121Jan 1, 2025Updated last year
Mhackiori / STIXnet
View on GitHub
A Novel and Modular Solution for Extracting All STIX Objects in CTI Reports
☆32Aug 21, 2023Updated 2 years ago
nansunsun / CWE-Knowledge-Graph-Based-Twitter-Data-Analysis-for-Cybersecurity
View on GitHub
☆10Jan 21, 2019Updated 7 years ago
CSJianYang / SEevenLLM
View on GitHub
☆42Feb 18, 2026Updated 3 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
aiforsec / MALOnt
View on GitHub
MALOnt - an ontology for Malware Threat Intelligence.
☆13Jul 8, 2021Updated 4 years ago
OWASP / OdTM
View on GitHub
OWASP Ontology-driven Threat Modelling framework
☆42Jul 11, 2023Updated 2 years ago
eclecticiq / stix-icons
View on GitHub
stix-icons is a collection of colourful and clean icons for use in software, training and marketing material to visualize cyber threats a…
☆38Dec 15, 2022Updated 3 years ago
aiforsec / LADDER
View on GitHub
☆36Jan 27, 2026Updated 4 months ago
CrowdStrike / CyberSOCEval_data
View on GitHub
Data for CyberSOCEval, an LLM benchmark by Meta & CrowdStrike
☆22Sep 22, 2025Updated 8 months ago
cinnqi / VulKG
View on GitHub
Vulnerability knowledge graph construction
☆30Dec 24, 2022Updated 3 years ago
for-just-we / CppCodeAnalyzer
View on GitHub
A tool based on python to parse C/C++ code into code property graph
☆18Nov 4, 2022Updated 3 years ago
BushidoUK / Cybercrime-Police-Raids
View on GitHub
Collection of videos of Raids on Cybercriminals
☆22Mar 19, 2025Updated last year
muchdogesec / txt2stix
View on GitHub
Extracts IoCs, TTPs and the relationships between them. Outputs a STIX 2.1 bundle.
☆81May 19, 2026Updated last week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
XuanwuAI / SecEval
View on GitHub
☆118Apr 3, 2024Updated 2 years ago
tmylla / Awesome-LLM4Cybersecurity
View on GitHub
An overview of LLMs for cybersecurity.
☆1,483May 6, 2026Updated 3 weeks ago
Cyb3rWard0g / IntelRAGU
View on GitHub
Intel Retrieval Augmented Generation (RAG) Utilities
☆90Jan 29, 2024Updated 2 years ago
elsheppo / unicode-thinking-claude
View on GitHub
This unique variation on Thinking Claude maps Claude's thought process steps to unicode and forces Claude to think in unicode, potentiall…
☆17Feb 24, 2025Updated last year
dessertlab / cti-to-mitre-with-nlp
View on GitHub
Replication package for the paper "Automatic Mapping of Unstructured Cyber Threat Intelligence: An Experimental Study" published at the I…
☆60Aug 29, 2022Updated 3 years ago
ai4cloudops / SecLLMHolmes
View on GitHub
SecLLMHolmes is a generalized, fully automated, and scalable framework to systematically evaluate the performance (i.e., accuracy and rea…
☆65May 4, 2025Updated last year
asappresearch / kbc-pomr
View on GitHub
Code for the paper "Knowledge Base Completion for Constructing Problem-Oriented Medical Records" at MLHC 2020
☆11Jun 8, 2021Updated 4 years ago
andyzorigin / cybench
View on GitHub
☆255Apr 22, 2026Updated last month
iamywang / bp-security-framework
View on GitHub
Research Artifact for HPCA'24 Paper: *Modeling, Derivation, and Automated Analysis of Branch Predictor Security Vulnerabilities*.
☆11Oct 30, 2025Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
CS-EVAL / CS-Eval
View on GitHub
CS-Eval is a comprehensive evaluation suite for fundamental cybersecurity models or large language models' cybersecurity ability.
☆62Nov 27, 2024Updated last year
muchdogesec / sigma2stix
View on GitHub
⚠️ ARCHIVED**: This repository is no longer actively maintained. All Sigma rules are now managed and available in SIEM Rules
☆13Mar 19, 2026Updated 2 months ago
NCATS-Gamma / robokop
View on GitHub
Master UI for ROBOKOP
☆15Mar 31, 2023Updated 3 years ago
muchdogesec / file2txt
View on GitHub
Turn a supported list of filetypes (e.g. .docx) into a markdown structured text file. Also optionally defangs indicators and extract text…
☆12May 19, 2026Updated last week
anshumanbh / cyber-safari
View on GitHub
A fun POC that is built to understand AI security agents.
☆36Oct 30, 2025Updated 6 months ago
wtwofire / A-systematic-review-of-fuzzing-based-on-machine-learning-techniques
View on GitHub
☆10Jul 9, 2020Updated 5 years ago
datasec-lab / CodeBreaker
View on GitHub
[USENIX Security '24] An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities agai…
☆59Mar 22, 2025Updated last year
nhshackday / nhshackday.github.io
View on GitHub
NHS Hack Day website
☆13Apr 25, 2026Updated last month
RobokopU24 / ORION
View on GitHub
ORION is a tool that ingests datasets from diverse knowledge bases and transforms them into modular, interoperable knowledge graphs.
☆17May 19, 2026Updated last week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jallen89 / theia-cdm-samples
View on GitHub
☆11May 3, 2019Updated 7 years ago
delftdata / wdm-project-benchmark
View on GitHub
Benchmarking suite for the Web-Scale Data Management course using Locust
☆14Aug 9, 2024Updated last year
moyix / elmfuzz
View on GitHub
Evolving fuzzers with large language models
☆17Dec 14, 2023Updated 2 years ago
PritomDas / Cyber-Attack-Attribution-with-Machine-Learning
View on GitHub
Cyber attack attribution is the process of attempting to trace back a piece of code or malware to a perpetrator of a cyberattack. As cybe…
☆15Jan 15, 2021Updated 5 years ago
nlpai-lab / CTI-reports-dataset
View on GitHub
☆43Apr 29, 2020Updated 6 years ago
NYU-LLM-CTF / nyuctf_agents
View on GitHub
The D-CIPHER and NYU CTF baseline LLM Agents built for NYU CTF Bench
☆146Oct 25, 2025Updated 7 months ago
eastmountyxz / Datasets-Security
View on GitHub
该资源为安全相关的数据集，包括恶意URL、恶意流量、图像分类、恶意软件等，希望对您有所帮助~
☆10Apr 21, 2021Updated 5 years ago