ICL backdoor attack
☆17Nov 4, 2024Updated last year
Alternatives and similar repositories for ICLAttack
Users that are interested in ICLAttack are comparing it to the libraries listed below
Sorting:
- Taskflow: Share system resources without breaking a sweat☆19May 24, 2022Updated 3 years ago
- Composite Backdoor Attacks Against Large Language Models☆22Apr 12, 2024Updated last year
- Unofficial implementation of "Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection"☆26Jul 6, 2024Updated last year
- A lightweight library for large laguage model (LLM) jailbreaking defense.☆61Sep 11, 2025Updated 5 months ago
- 专用于搭建MT4或MT5交易跟单平台☆26Updated this week
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated last year
- ☆14Aug 28, 2024Updated last year
- Official Code Repository for the paper "Generating Realistic Images from In-the-wild Sounds", ICCV 2023☆12Aug 24, 2025Updated 6 months ago
- Speech Security and Privacy Compendium - Mini☆10Jun 18, 2024Updated last year
- Fine-tuning Llama2-7b and other llms for categorising emails for Deutsche Bahn (German National Railways)☆13Oct 9, 2023Updated 2 years ago
- Target Agnostic Attack on Deep Models: Exploiting Security Vulnerabilities of Transfer Learning☆10Jul 2, 2019Updated 6 years ago
- About The corresponding code from our paper " Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning…☆13Jan 14, 2026Updated last month
- ☆14Sep 17, 2024Updated last year
- 关于behance爬虫项目☆10May 16, 2019Updated 6 years ago
- awesome-audio-visual-robustness☆11Jan 27, 2024Updated 2 years ago
- ☆16Nov 18, 2024Updated last year
- My first IEEE conference paper on Computer Vision.☆11Jul 15, 2016Updated 9 years ago
- recommendation system building with online article data and tutorial☆12Nov 14, 2019Updated 6 years ago
- ☆13Apr 13, 2025Updated 10 months ago
- The official github repo for MixEval-X, the first any-to-any, real-world benchmark.☆16Feb 15, 2025Updated last year
- ☆14Jun 4, 2025Updated 9 months ago
- State-Relabeling Adversarial Active Learning☆14Aug 17, 2021Updated 4 years ago
- Code and datasets for the salesforce AI research paper on prompt leakage and multi-turn threats against LLMs☆20Nov 10, 2025Updated 3 months ago
- [TOIS'24] "RecRanker: Instruction Tuning Large Language Model as Ranker for Top-k Recommendation"☆16Dec 1, 2024Updated last year
- [ACL 2024] Raccoon: Prompt Extraction Benchmark of LLM-Integrated Applications☆14May 24, 2024Updated last year
- Code for the paper "Unbiased Supervised Contrastive Learning" | ICLR 2023 https://openreview.net/forum?id=Ph5cJSfD2XN☆13Sep 22, 2023Updated 2 years ago
- deep learning, malware detection, predictive uncertainty, dataset shift, calibration, uncertainty quantification, android malware☆16Nov 30, 2021Updated 4 years ago
- Operating Systems Laboratory - Minix☆10Oct 12, 2015Updated 10 years ago
- 管理博客园博客的Emacs扩展☆12Jun 1, 2023Updated 2 years ago
- ☆58May 30, 2024Updated last year
- Code for paper "Universal Jailbreak Backdoors from Poisoned Human Feedback"☆66Apr 24, 2024Updated last year
- [NeurIPS 2023] Differentially Private Image Classification by Learning Priors from Random Processes☆12Jun 12, 2023Updated 2 years ago
- Automatic Metric for Evaluating Generated Videos☆33Dec 8, 2025Updated 3 months ago
- ☆13Nov 11, 2022Updated 3 years ago
- Modeling Dynamic Topics in Chain-Free Fashion by Evolution-Tracking Contrastive Learning and Unassociated Word Exclusion (ACL 2024 Findin…☆17Aug 23, 2024Updated last year
- Working Memory Attack on LLMs☆17May 27, 2025Updated 9 months ago
- I fine-tuned (p-tuning) Tsinghua’s open-source large language model, ChatGLM2-6B, using several years of my WeChat chat history. Inspired…☆12Mar 6, 2024Updated 2 years ago
- ☆16Feb 8, 2024Updated 2 years ago
- MultiWOZ2.1-Parser for Dialogue State Tracking☆13Aug 3, 2021Updated 4 years ago