TrustAIRLab / JailbreakRadarLinks
☆79Updated last month
Alternatives and similar repositories for JailbreakRadar
Users that are interested in JailbreakRadar are comparing it to the libraries listed below
Sorting:
- StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving☆22Updated 7 months ago
- [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…☆40Updated 5 months ago
- ☆36Updated last year
- ☆65Updated 9 months ago
- [ICLR 2025] Official implementation of paper "Improving Data Efficiency via Curating LLM-Driven Rating Systems"☆97Updated 4 months ago
- The official code repo for "Safe Delta: Consistently Preserving Safety when Fine-Tuning LLMs on Diverse Datasets" in ICML 2025.☆50Updated last month
- GLT has presented the first attempt to accelerate GNN inference. Though promising, GLT encounters robustness and generalization issues wh…☆28Updated last year
- ☆78Updated 7 months ago
- [ACL 2023 findings] Towards Robust Personalized Dialogue Generation via Order-Insensitive Representation Regularization☆17Updated last year
- Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question Answering☆43Updated 2 months ago
- ☆130Updated last month
- This project help you understand the concepts of histogram equalization and histogram specification in image processing learning from a p…☆9Updated last year
- Official Code of Logits-Based-Finetuning☆87Updated last month
- 通过撤销数据对联邦学习模型的训练更新,解决了联邦学习中的数据隐私安全问题。☆25Updated 2 weeks ago
- ☆51Updated last year
- Please visit our demonstration website for interactive demonstrations☆31Updated 10 months ago
- An open-source highly heterogeneous entity alignment (HHEA) toolkit.☆31Updated last year
- [ICML 2025] Official implementation of paper "Token Cleaning: Fine-Grained Data Selection for LLM Supervised Fine-Tuning"☆49Updated last month
- A comprehensive collection of resources focused on addressing and understanding hallucination phenomena in MLLMs.☆34Updated last year
- alsap_frontend☆63Updated 5 months ago
- MGCF-Net for Phishing URLs Detection☆51Updated 2 months ago
- Code for ACL 2024 long paper: Are AI-Generated Text Detectors Robust to Adversarial Perturbations?☆31Updated last year
- ACL 2024