☆16Apr 27, 2024Updated 2 years ago
Alternatives and similar repositories for CASPER
Users that are interested in CASPER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Prolog specification of TensorFlow layers☆13Jun 12, 2023Updated 2 years ago
- ☆27Feb 1, 2023Updated 3 years ago
- Code repo of our paper Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis (https://arxiv.org/abs/2406.10794…☆24Jul 26, 2024Updated last year
- ☆10Mar 14, 2021Updated 5 years ago
- ☆33Jun 24, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A series of BERT and Albert model checkpoints trained to reduce gendered correlations in pre-training☆11Oct 22, 2020Updated 5 years ago
- 面向人脸视频防伪鉴别的大规模中文数据评测基准(Large-Scale Chinese Data Benchmark for Face Video Anti-Forgery Identification)☆13Feb 26, 2025Updated last year
- BrainWash: A Poisoning Attack to Forget in Continual Learning☆12Apr 15, 2024Updated 2 years ago
- CovRL-Fuzz: Fuzzing JavaScript Interpreters with Coverage-Guided Reinforcement Learning for LLM-Based Mutation☆41Nov 10, 2024Updated last year
- ☆15Aug 7, 2025Updated 9 months ago
- Adversarial Attack for Pre-trained Code Models☆10Jul 19, 2022Updated 3 years ago
- ☆21May 31, 2024Updated last year
- Llama中文社区,最好的中文Llama大模型,完全开源可商用☆12Aug 5, 2023Updated 2 years ago
- [Findings of EMNLP 2022] Expose Backdoors on the Way: A Feature-Based Efficient Defense against Textual Backdoor Attacks☆13Feb 26, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- AndroidSlicer is a dynamic slicing tool, useful for a variety of tasks, from testing to debugging to security.☆14Jul 28, 2019Updated 6 years ago
- DSN jailbreak Attack & Evaluation Ensemble☆17Feb 7, 2026Updated 3 months ago
- ☆104Mar 24, 2022Updated 4 years ago
- Android Benchmark Reproduction Framework☆13Nov 30, 2021Updated 4 years ago
- ☆19Nov 28, 2023Updated 2 years ago
- 1.0☆15Jun 7, 2025Updated 11 months ago
- Can We Trust Large Language Models?: A Benchmark for Responsible Large Language Models via Toxicity, Bias, and Value-alignment Evaluation☆26Oct 12, 2023Updated 2 years ago
- ☆18May 18, 2021Updated 5 years ago
- [NDSS'25] The official implementation of safety misalignment.☆19Jan 8, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is the implementation for IEEE S&P 2022 paper "Model Orthogonalization: Class Distance Hardening in Neural Networks for Better Secur…☆11Aug 24, 2022Updated 3 years ago
- ☆15Mar 9, 2025Updated last year
- [CVPR2025] Official Repository for IMMUNE: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment☆28Jun 11, 2025Updated 11 months ago
- Data set for LREC 2020 paper "I Feel Offended, Don't Be Abusive!"☆19Sep 23, 2023Updated 2 years ago
- Mandoline is an accurate, low-overhead dynamic slicer for Android applicaions.☆12Apr 24, 2026Updated last month
- [EMNLP 2025] Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking☆12Aug 22, 2025Updated 9 months ago
- [Neurips 2025]StegoZip: Enhancing Linguistic Steganography Payload in Practice with Large Language Models☆31Dec 4, 2025Updated 5 months ago
- 【2024 ECAI】First Creating Backgrounds Then Rendering Texts: A New Paradigm for Visual Text Blending☆14Jun 16, 2025Updated 11 months ago
- CCS 2023 | Explainable malware and vulnerability detection with XAI in paper "FINER: Enhancing State-of-the-art Classifiers with Feature …☆12Aug 20, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆11Sep 10, 2024Updated last year
- [ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models☆16Jun 18, 2025Updated 11 months ago
- ☆26Nov 7, 2022Updated 3 years ago
- Code of the paper Unsupervised Domain Adaptation through Shape Modeling for Medical Image Segmentation.☆16Oct 16, 2022Updated 3 years ago
- ☆25Feb 6, 2022Updated 4 years ago
- Structured Domain Adaptation with Online Relation Regularization for Unsupervised Person Re-ID☆18Jun 9, 2020Updated 5 years ago
- Welcome to the official repository for Siren, a project aimed at understanding and mitigating harmful behaviors in large language models …☆15Sep 12, 2025Updated 8 months ago