☆15Apr 27, 2024Updated last year
Alternatives and similar repositories for CASPER
Users that are interested in CASPER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code repo of our paper Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis (https://arxiv.org/abs/2406.10794…☆24Jul 26, 2024Updated last year
- ☆10Mar 14, 2021Updated 5 years ago
- 面向人脸视频防伪鉴别的大规模中文数据评测基准(Large-Scale Chinese Data Benchmark for Face Video Anti-Forgery Identification)☆13Feb 26, 2025Updated last year
- CovRL-Fuzz: Fuzzing JavaScript Interpreters with Coverage-Guided Reinforcement Learning for LLM-Based Mutation☆41Nov 10, 2024Updated last year
- Adversarial Attack for Pre-trained Code Models☆10Jul 19, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for "Astraea: Grammar-based Fairness Testing"☆10Jan 7, 2022Updated 4 years ago
- [Findings of EMNLP 2022] Expose Backdoors on the Way: A Feature-Based Efficient Defense against Textual Backdoor Attacks☆13Feb 26, 2023Updated 3 years ago
- AndroidSlicer is a dynamic slicing tool, useful for a variety of tasks, from testing to debugging to security.☆14Jul 28, 2019Updated 6 years ago
- DSN jailbreak Attack & Evaluation Ensemble☆17Feb 7, 2026Updated 2 months ago
- Intersectional bias in hate speech and abusive language datasets☆15Jan 25, 2024Updated 2 years ago
- ☆18Nov 28, 2023Updated 2 years ago
- [NDSS'25] The official implementation of safety misalignment.☆18Jan 8, 2025Updated last year
- ☆14Mar 9, 2025Updated last year
- Data set for LREC 2020 paper "I Feel Offended, Don't Be Abusive!"☆18Sep 23, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Mandoline is an accurate, low-overhead dynamic slicer for Android applicaions.☆11Dec 24, 2025Updated 3 months ago
- Official implementation of "Poisoning Attacks on Cyber Attack Detectors for Industrial Control Systems" by Moshe Kravchik, Battista Biggi…☆14Mar 22, 2021Updated 5 years ago
- [EMNLP 2025] Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking☆12Aug 22, 2025Updated 7 months ago
- Official repository of paper "Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval"☆10Dec 20, 2023Updated 2 years ago
- [Neurips 2025]StegoZip: Enhancing Linguistic Steganography Payload in Practice with Large Language Models☆29Dec 4, 2025Updated 4 months ago
- 【2024 ECAI】First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending☆14Jun 16, 2025Updated 9 months ago
- CCS 2023 | Explainable malware and vulnerability detection with XAI in paper "FINER: Enhancing State-of-the-art Classifiers with Feature …☆11Aug 20, 2024Updated last year
- ☆11Sep 10, 2024Updated last year
- Interpretable unified language safety checking with large language models☆32Apr 15, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- LLM-Powered Data Discovery System for Tabular Data☆26Apr 7, 2026Updated last week
- ☆26Nov 7, 2022Updated 3 years ago
- Code of the paper Unsupervised Domain Adaptation through Shape Modeling for Medical Image Segmentation.☆16Oct 16, 2022Updated 3 years ago
- ☆25Feb 6, 2022Updated 4 years ago
- Welcome to the official repository for Siren, a project aimed at understanding and mitigating harmful behaviors in large language models …☆15Sep 12, 2025Updated 7 months ago
- Research Artifact of USENIX Security 2023 Paper: Precise and Generalized Robustness Certification for Neural Networks☆13Jun 20, 2023Updated 2 years ago
- SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types☆25Nov 29, 2024Updated last year
- Adversarial Driving v.s. Autonomous Driving.☆23Jul 29, 2023Updated 2 years ago
- ☆10Jun 13, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- AutoEncoders for Event Detection (AEED): a Keras-based class for anomaly detection in water sensor networks.☆12Sep 17, 2018Updated 7 years ago
- [COLM 2025] JailDAM: Jailbreak Detection with Adaptive Memory for Vision-Language Model☆27Nov 25, 2025Updated 4 months ago
- 【Java-Mirai Console】口袋48机器人☆27Oct 5, 2023Updated 2 years ago
- [NDSS'25] "Provably Unlearnable Data Examples"☆22Dec 10, 2025Updated 4 months ago
- 恋爱记事本,一款轻便记录情侣日常生活的小程序。☆18Dec 28, 2023Updated 2 years ago
- KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality☆40Dec 1, 2025Updated 4 months ago
- Repository for the ACL 2024 conference website☆18Feb 3, 2025Updated last year