☆26Mar 17, 2025Updated last year
Alternatives and similar repositories for RACE
Users that are interested in RACE are comparing it to the libraries listed below
Sorting:
- Welcome to the official repository for Siren, a project aimed at understanding and mitigating harmful behaviors in large language models …☆15Sep 12, 2025Updated 6 months ago
- ☆22Oct 25, 2024Updated last year
- Red Queen Dataset and data generation template☆27Dec 26, 2025Updated 2 months ago
- ☆124Feb 3, 2025Updated last year
- ☆25Mar 16, 2025Updated last year
- [NDSS'25] The official implementation of safety misalignment.☆17Jan 8, 2025Updated last year
- ☆25Jun 17, 2025Updated 9 months ago
- ☆16Sep 1, 2025Updated 6 months ago
- ☆122Dec 3, 2025Updated 3 months ago
- Prompt Generator model for Stable Diffusion Models☆12Jun 20, 2023Updated 2 years ago
- 秦志金教授论文☆11Sep 14, 2021Updated 4 years ago
- 使用rag来学习rag☆11Sep 6, 2024Updated last year
- ☆24May 23, 2025Updated 9 months ago
- ☆14Oct 19, 2025Updated 5 months ago
- Consuming Resrouce via Auto-generation for LLM-DoS Attack under Black-box Settings☆18Sep 1, 2025Updated 6 months ago
- ☆11Apr 12, 2024Updated last year
- Responsible Robotic Manipulation☆16Aug 31, 2025Updated 6 months ago
- A list of research towards security&privacy in AI-Generated Content☆16Jan 10, 2025Updated last year
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Aug 9, 2023Updated 2 years ago
- Code implementation of R^2-Guard: Robust Reasoning Enabled LLM Guardrail via Knowledge-Enhanced Logical Reasoning☆22Jul 8, 2024Updated last year
- Code for Fast Propagation is Better: Accelerating Single-Step Adversarial Training via Sampling Subnetworks (TIFS2024)☆13Mar 29, 2024Updated last year
- ☆21Jan 16, 2025Updated last year
- Code for the paper: Fast and Private Inference of Deep Neural Networks by Co-designing Activation Functions☆11Mar 13, 2024Updated 2 years ago
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆23May 27, 2025Updated 9 months ago
- The official implementation of the paper "Towards Safe Self-Distillation of Internet-Scale Text-to-Image Diffusion Models" (ICML 2023 Wor…☆22Mar 19, 2024Updated 2 years ago
- Code for the paper - ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning☆22Aug 13, 2024Updated last year
- [ICCV 2025] Official implementation of "Holistic Unlearning Benchmark: A Multi-Faceted Evaluation for Text-to-Image Diffusion Model Unlea…☆22Jan 2, 2026Updated 2 months ago
- The repo of "Coral: Maliciously Secure Computation Framework for Packed and Mixed Circuits" (CCS 2024)☆12Sep 6, 2024Updated last year
- ☆10Aug 22, 2017Updated 8 years ago
- The Official Repo for Paper: Aligning Clinical Needs and AI Capabilities: A Survey on LLMs for Medical Reasoning☆22Sep 27, 2025Updated 5 months ago
- Research project on glyph-based Chinese character embedding. Preparing for EMNLP 2019☆11Mar 18, 2019Updated 7 years ago
- 基于中文 GPT2 预训练模型的语句困惑度计算☆15Apr 20, 2023Updated 2 years ago
- [ArXiv 2025] Denial-of-Service Poisoning Attacks on Large Language Models☆23Oct 22, 2024Updated last year
- Improved Secure 3-Party Neural Network Inference with Reducing Online Communication Costs☆11Jan 27, 2023Updated 3 years ago
- ☆42Feb 12, 2026Updated last month
- ☆14Apr 6, 2025Updated 11 months ago
- Code of paper: xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking"☆18Feb 17, 2026Updated last month
- Official implementation of “Response Attack: Exploiting Contextual Priming to Jailbreak Large Language Models” (AAAI 2026).☆33Dec 17, 2025Updated 3 months ago
- ☆18Oct 20, 2024Updated last year