[TACL] Code for "Red Teaming Language Model Detectors with Language Models"
☆24Nov 24, 2023Updated 2 years ago
Alternatives and similar repositories for LLM-Detector-Robustness
Users that are interested in LLM-Detector-Robustness are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A re-implementation of the "Red Teaming Language Models with Language Models" paper by Perez et al., 2022☆34Oct 9, 2023Updated 2 years ago
- ☆15May 5, 2026Updated last month
- Code for the paper: ConDA: Contrastive Domain Adaptation for AI-generated Text Detection☆41Dec 21, 2023Updated 2 years ago
- ☆11Jul 6, 2023Updated 2 years ago
- Fragment Linker Prediction Using Deep Encoder-Decoder Network for PROTAC Drug Design☆13Oct 2, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Repository of reference Gabriel graph, Internet Topology Zoo, SNDlib, CAIDA and synthetic backbone topologies for networking research☆15Sep 30, 2025Updated 8 months ago
- ☆13Nov 7, 2023Updated 2 years ago
- ☆11Apr 11, 2023Updated 3 years ago
- ☆15Aug 26, 2024Updated last year
- The official implementation of the paper "MotifRetro: Exploring the Combinability-Consistency Trade-offs in retrosynthesis via Dynamic Mo…☆11Jun 25, 2023Updated 2 years ago
- This repository gathers the SchNet4AIM code along with some instructions and readme files.☆15Mar 13, 2024Updated 2 years ago
- Official implementation of DrugGEN in PyTorch☆12Oct 27, 2023Updated 2 years ago
- MolKGNN is a deep learning model for predicting biological activity or molecular properties. It features in 1. SE(3)-invariance 2. confor…☆14Jan 22, 2024Updated 2 years ago
- Physico-chemical and biological property prediction for small molecules☆12May 3, 2022Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- R labs for the book OpenIntro Statistics (https://www.openintro.org/stat/)☆13Nov 17, 2016Updated 9 years ago
- [NDSS'25] The official implementation of safety misalignment.☆19Jan 8, 2025Updated last year
- ☆18Apr 2, 2021Updated 5 years ago
- CSCW 2023 Best Demo Award: Conversational AI Explanations to Support Human-AI Scientific Writing☆14Jun 25, 2023Updated 2 years ago
- Code for paper "Defending aginast LLM Jailbreaking via Backtranslation"☆35Aug 16, 2024Updated last year
- Experimental code for the paper 'Finding Convincing Arguments Using Scalable Bayesian Preference Learning'☆13Dec 8, 2022Updated 3 years ago
- [ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning☆98May 23, 2024Updated 2 years ago
- [ICML 2025] Weak-to-Strong Jailbreaking on Large Language Models☆90May 2, 2025Updated last year
- Use Large Language Models like OpenAI's GPT-3.5 for data annotation and model enhancement. This framework combines human expertise with L…☆41Sep 11, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- PWA application to easily calculate cost of Sushiro meal simply by counting☆13Apr 29, 2026Updated last month
- Framework for controlling demographic biases in NLG (using adversarial prompts)☆21Jun 12, 2023Updated 3 years ago
- 2110101 COMP PROG 2020-2☆13Jul 16, 2021Updated 4 years ago
- The source code of the paper "SHGNN: Structure-Aware Heterogeneous Graph Neural Network"☆14Dec 14, 2021Updated 4 years ago
- ☆22May 9, 2025Updated last year
- [USENIX'24] Prompt Stealing Attacks Against Text-to-Image Generation Models☆51Jan 11, 2025Updated last year
- scripts for personal reference☆18Dec 26, 2022Updated 3 years ago
- TCP-like Go-back-n protocol using UDP socket API☆14May 30, 2017Updated 9 years ago
- ☆10Jan 10, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [USENIX Security '24] An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities agai…☆59Mar 22, 2025Updated last year
- Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding☆153Jul 19, 2024Updated last year
- [KDD 2023] Multi-Grained Multimodal Interaction Network for Entity Linking☆29Sep 17, 2023Updated 2 years ago
- ☆27May 19, 2022Updated 4 years ago
- FS-GNNTR: Few-shot Learning with Transformers via Graph Embeddings for Molecular Property Prediction☆19Mar 21, 2025Updated last year
- ☆164Jan 24, 2025Updated last year
- The code implementation of GraCeFul (Accepted in COLING 2025)☆13Jan 27, 2025Updated last year