uiuc-arc / llm-code-watermarkLinks
LLM Program Watermarking
☆17Updated last year
Alternatives and similar repositories for llm-code-watermark
Users that are interested in llm-code-watermark are comparing it to the libraries listed below
Sorting:
- Repo for the research paper "SecAlign: Defending Against Prompt Injection with Preference Optimization"☆58Updated 2 weeks ago
- Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives☆69Updated last year
- A novel approach to improve the safety of large language models, enabling them to transition effectively from unsafe to safe state.☆63Updated last month
- Official code for the paper "CodeChain: Towards Modular Code Generation Through Chain of Self-revisions with Representative Sub-modules"☆45Updated 6 months ago
- EvoEval: Evolving Coding Benchmarks via LLM☆75Updated last year
- Does Refusal Training in LLMs Generalize to the Past Tense? [ICLR 2025]☆69Updated 5 months ago
- ☆44Updated 4 months ago
- Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding☆137Updated last year
- Official Implementation of the paper "Three Bricks to Consolidate Watermarks for LLMs"☆48Updated last year
- Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs☆84Updated 7 months ago
- CRUXEval: Code Reasoning, Understanding, and Execution Evaluation☆149Updated 9 months ago
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆33Updated last year
- ☆31Updated 11 months ago
- Repoformer: Selective Retrieval for Repository-Level Code Completion (ICML 2024)☆55Updated last month
- RepoQA: Evaluating Long-Context Code Understanding☆111Updated 8 months ago
- Code for watermarking language models☆79Updated 10 months ago
- Dataset for the Tensor Trust project☆43Updated last year
- CoCoMIC: Code Completion By Jointly Modeling In-file and Cross-file Context☆17Updated 8 months ago
- Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses (NeurIPS 2024)☆62Updated 6 months ago
- Code to break Llama Guard☆31Updated last year
- Implementation of 'A Watermark for Large Language Models' paper by Kirchenbauer & Geiping et. al.☆24Updated 2 years ago
- Python package for measuring memorization in LLMs.☆160Updated this week
- Knowledge transfer from high-resource to low-resource programming languages for Code LLMs☆14Updated 10 months ago
- Training and Benchmarking LLMs for Code Preference.☆33Updated 8 months ago
- WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning m…☆128Updated last month
- [NeurIPS 2024] Official implementation for "AgentPoison: Red-teaming LLM Agents via Memory or Knowledge Base Backdoor Poisoning"☆130Updated 3 months ago
- ☆36Updated 2 years ago
- Jailbreak artifacts for JailbreakBench☆60Updated 8 months ago
- A prompt injection game to collect data for robust ML research☆62Updated 5 months ago
- ☆175Updated last year