TrustAIRLab / JailbreakRadarLinks
☆79Updated 3 months ago
Alternatives and similar repositories for JailbreakRadar
Users that are interested in JailbreakRadar are comparing it to the libraries listed below
Sorting:
- StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving☆21Updated 9 months ago
- [ICLR 2025] Official implementation of paper "Improving Data Efficiency via Curating LLM-Driven Rating Systems"☆96Updated 5 months ago
- [EMNLP 2024 Findings] Official PyTorch Implementation of "Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Ge…☆39Updated 7 months ago
- ☆36Updated last year
- Code for ACL 2024 long paper: Are AI-Generated Text Detectors Robust to Adversarial Perturbations?☆32Updated last year
- The official code repo for "Safe Delta: Consistently Preserving Safety when Fine-Tuning LLMs on Diverse Datasets" in ICML 2025.☆52Updated 2 months ago
- An open-source highly heterogeneous entity alignment (HHEA) toolkit.☆32Updated last year
- ☆76Updated 8 months ago
- Code of Journey to the Center of the Knowledge Neurons: Discoveries of Language-Independent Knowledge Neurons and Degenerate Knowledge Ne…☆26Updated last year
- A comprehensive collection of resources focused on addressing and understanding hallucination phenomena in MLLMs.☆34Updated last year
- ☆62Updated 10 months ago
- GLT has presented the first attempt to accelerate GNN inference. Though promising, GLT encounters robustness and generalization issues wh…☆28Updated last year
- alsap_frontend☆63Updated 7 months ago
- ☆72Updated last year
- [ACL 2023 findings] Towards Robust Personalized Dialogue Generation via Order-Insensitive Representation Regularization☆17Updated 2 years ago
- Neobanker FinTalk-AI: A Grounded Orchestration Framework for Multi-Agent Collaboration on Financial Tasks Leveraging the OSWorld Environm…☆39Updated last month
- 通过撤销数据对联邦学习模型的训练更新,解决了联邦学习中的数据隐私安全问题。☆25Updated last month
- Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question Answering☆42Updated 4 months ago
- Concise Evaluation Benchmark for Large Language Models☆25Updated last month
- Multi-Attentional Deepfake Detection☆22Updated 10 months ago
- 低代码核心组件:数据模型的实现☆56Updated last year
- This is a traditional Chinese-based demographic dictionary search system that is free and open-source. 這是一個基於繁體中文的人口學詞典檢索系統,該系統是免費且開放的。☆39Updated 6 months ago
- Please visit our demonstration website for interactive demonstrations☆30Updated 11 months ago
- ☆46Updated 3 weeks ago
- mobile predict☆25Updated 9 months ago
- ☆49Updated last year
- ACL 2024☆34Updated last month
- Store and download PseudoMeta R Package☆28Updated 2 months ago
- This search engine leverages the Boost library for efficient document search, featuring data preprocessing, index creation, and advanced …☆57Updated last year
- LoRA fine-tuning Mistral-7b-v2 on PR Task☆19Updated last year