RobustNLP / TestNER
A toolkit for testing and improving named entity recognition [ESEC/FSE'23]
☆11Updated last year
Related projects ⓘ
Alternatives and complementary repositories for TestNER
- ☆11Updated last year
- A novel approach to improve the safety of large language models, enabling them to transition effectively from unsafe to safe state.☆52Updated last month
- A toolkit for testing machine translation [ICSE'20, '21, ESEC/FSE'20]☆33Updated 3 years ago
- ☆30Updated 5 months ago
- Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding☆99Updated 4 months ago
- Collections of research, benchmarks and tools towards more robust and reliable language models for code; LM4Code; LM4SE; reliable LLM; L…☆22Updated 11 months ago
- ☆15Updated 8 months ago
- Recent papers on (1) Psychology of LLMs; (2) Biases in LLMs.☆43Updated last year
- Code for the AAAI 2023 paper "CodeAttack: Code-based Adversarial Attacks for Pre-Trained Programming Language Models☆25Updated last year
- ☆33Updated last year
- Multilingual safety benchmark for Large Language Models☆23Updated 2 months ago
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!☆14Updated last year
- Code for paper "Defending aginast LLM Jailbreaking via Backtranslation"☆24Updated 3 months ago
- R-Judge: Benchmarking Safety Risk Awareness for LLM Agents (EMNLP Findings 2024)☆61Updated last month
- 【ACL 2024】 SALAD benchmark & MD-Judge☆106Updated last month
- A toolkit to assess data privacy in LLMs (under development)☆41Updated 2 months ago
- [EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"☆19Updated last month
- Mostly recording papers about models' trustworthy applications. Intending to include topics like model evaluation & analysis, security, c…☆20Updated last year
- The course website for Large Language Models Methods and Applications☆28Updated 6 months ago
- Weak-to-Strong Jailbreaking on Large Language Models☆67Updated 9 months ago
- Unofficial implementation of "Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection"☆14Updated 4 months ago
- ☆14Updated 8 months ago
- A lightweight tool for detecting bugs on Graph Database Management Systems☆14Updated 10 months ago
- Official repository for ICML 2024 paper "On Prompt-Driven Safeguarding for Large Language Models"☆73Updated 2 months ago
- ☆153Updated 11 months ago
- Training and Benchmarking LLMs for Code Preference.☆25Updated last week
- [NeurIPS 2024] Official implementation for "AgentPoison: Red-teaming LLM Agents via Memory or Knowledge Base Backdoor Poisoning"☆61Updated 3 months ago
- Repo for the research paper "Aligning LLMs to Be Robust Against Prompt Injection"☆19Updated 3 weeks ago
- Official implementation of the EMNLP 2021 paper "ONION: A Simple and Effective Defense Against Textual Backdoor Attacks"☆29Updated 3 years ago
- Official implementation of paper: DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers☆36Updated 2 months ago