Kobe-Zed / Awesome-Continual-Learning-For-LLMs
☆11Updated last year
Alternatives and similar repositories for Awesome-Continual-Learning-For-LLMs
Users that are interested in Awesome-Continual-Learning-For-LLMs are comparing it to the libraries listed below
Sorting:
- ☆18Updated 3 years ago
- code for the ICLR'22 paper: On Robust Prefix-Tuning for Text Classification☆27Updated 3 years ago
- Official implementation of the EMNLP 2021 paper "ONION: A Simple and Effective Defense Against Textual Backdoor Attacks"☆33Updated 3 years ago
- Code for the paper "RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models" (EMNLP 2021)☆24Updated 3 years ago
- This repo is for the paper: On the Safety of Conversational Models: Taxonomy, Dataset, and Benchmark☆25Updated 2 years ago
- Generating Natural Language Adversarial Examples through Probability Weighted Word Saliency☆69Updated 2 years ago
- https://openreview.net/forum?id=OC1o4_OI6Jw☆13Updated 2 years ago
- Mostly recording papers about models' trustworthy applications. Intending to include topics like model evaluation & analysis, security, c…☆21Updated last year
- Code for paper "Defending aginast LLM Jailbreaking via Backtranslation"☆29Updated 9 months ago
- ☆27Updated last year
- Code for the paper "Rethinking Stealthiness of Backdoor Attack against NLP Models" (ACL-IJCNLP 2021)☆24Updated 3 years ago
- For Certified Robustness to Text Adversarial Attacks by Randomized [MASK]☆16Updated 7 months ago
- ACL'2023: Multi-Task Pre-Training of Modular Prompt for Few-Shot Learning☆41Updated 2 years ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆63Updated last year
- ☆33Updated 2 years ago
- ACL 2021 - Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble☆18Updated last year
- Shadow Alignment: The Ease of Subverting Safely-Aligned Language Models☆28Updated last year
- codes for "Searching for an Effective Defender:Benchmarking Defense against Adversarial Word Substitution"☆31Updated last year
- ☆26Updated 7 months ago
- [ICLR 2024]Data for "Multilingual Jailbreak Challenges in Large Language Models"☆73Updated last year
- 🩺 A collection of ChatGPT evaluation reports on various bechmarks.☆49Updated 2 years ago
- ☆36Updated last year
- Contextualized Perturbation for Textual Adversarial Attack, NAACL 2021☆43Updated 3 years ago
- ☆25Updated last year
- Code and data of the ACL-IJCNLP 2021 paper "Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger"☆43Updated 2 years ago
- Code for the paper "Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models" (NAACL-…☆40Updated 3 years ago
- Natural Universal Trigger Search (NUTS)☆21Updated 4 years ago
- Code for our EMNLP-2023 paper: "Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks"☆24Updated last year
- ☆9Updated 3 years ago
- Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuning☆44Updated last year