[TACL] Code for "Red Teaming Language Model Detectors with Language Models"
☆24Nov 24, 2023Updated 2 years ago
Alternatives and similar repositories for LLM-Detector-Robustness
Users that are interested in LLM-Detector-Robustness are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A re-implementation of the "Red Teaming Language Models with Language Models" paper by Perez et al., 2022☆34Oct 9, 2023Updated 2 years ago
- ☆14May 8, 2024Updated last year
- Code for the paper: ConDA: Contrastive Domain Adaptation for AI-generated Text Detection☆41Dec 21, 2023Updated 2 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆11Jan 10, 2024Updated 2 years ago
- Hackathon-104☆10Jul 19, 2023Updated 2 years ago
- Fragment Linker Prediction Using Deep Encoder-Decoder Network for PROTAC Drug Design☆12Oct 2, 2023Updated 2 years ago
- Source code for NeurIPS 2020 paper "Node Classification on Graphs with Few-Shot Novel Labels via Meta Transformed Network Embedding"☆10Nov 17, 2020Updated 5 years ago
- Repository of reference Gabriel graph, Internet Topology Zoo, SNDlib, CAIDA and synthetic backbone topologies for networking research☆12Sep 30, 2025Updated 5 months ago
- A python implementation of discrete optimal transport with a Tsallis entropy regularization.☆14Oct 23, 2023Updated 2 years ago
- ☆13Nov 7, 2023Updated 2 years ago
- ☆11Apr 11, 2023Updated 2 years ago
- ☆15Aug 26, 2024Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- The official implementation of the paper "MotifRetro: Exploring the Combinability-Consistency Trade-offs in retrosynthesis via Dynamic Mo…☆11Jun 25, 2023Updated 2 years ago
- This repository gathers the SchNet4AIM code along with some instructions and readme files.☆15Mar 13, 2024Updated 2 years ago
- MolKGNN is a deep learning model for predicting biological activity or molecular properties. It features in 1. SE(3)-invariance 2. confor…☆14Jan 22, 2024Updated 2 years ago
- Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs☆13Feb 13, 2024Updated 2 years ago
- [Paper][SIGIR 2024] NativE: Multi-modal Knowledge Graph Completion in the Wild☆52Aug 12, 2024Updated last year
- Physico-chemical and biological property prediction for small molecules☆12May 3, 2022Updated 3 years ago
- [NDSS'25] The official implementation of safety misalignment.☆17Jan 8, 2025Updated last year
- ☆11Nov 28, 2025Updated 3 months ago
- Imbalanced Gradients: A New Cause of Overestimated Adversarial Robustness. (MD attacks)☆11Aug 29, 2020Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Pocket-Oriented Elaboration of Molecules: application to CDK8 inhibition☆14Dec 30, 2022Updated 3 years ago
- ☆16Sep 1, 2025Updated 6 months ago
- CSCW 2023 Best Demo Award: Conversational AI Explanations to Support Human-AI Scientific Writing☆14Jun 25, 2023Updated 2 years ago
- ☆18Jan 17, 2024Updated 2 years ago
- ☆10Mar 7, 2024Updated 2 years ago
- A PyTorch implementation of "Meta-Amortized Variational Inference and Learning" (https://arxiv.org/abs/1902.01950)☆14Mar 31, 2020Updated 5 years ago
- Code to analyze the data from DNA-encoded libraries (DELs)☆11Jul 12, 2023Updated 2 years ago
- Experimental code for the paper 'Finding Convincing Arguments Using Scalable Bayesian Preference Learning'☆12Dec 8, 2022Updated 3 years ago
- [ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning☆98May 23, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICML 2025] Weak-to-Strong Jailbreaking on Large Language Models☆92May 2, 2025Updated 10 months ago
- Official implementation of ICML 2025 paper "Understanding Multimodal LLMs Under Distribution Shifts: An Information-Theoretic Approach"☆11May 27, 2025Updated 9 months ago
- Framework for controlling demographic biases in NLG (using adversarial prompts)☆21Jun 12, 2023Updated 2 years ago
- Evaluate interpretability methods on localizing and disentangling concepts in LLMs.☆58Oct 30, 2025Updated 4 months ago
- The source code of the paper "SHGNN: Structure-Aware Heterogeneous Graph Neural Network"☆14Dec 14, 2021Updated 4 years ago
- This repo is for the Mis2-KDD 2021 under review paper: Dataset of Propaganda Techniques of the State-Sponsored Information Operation of t…☆19Feb 5, 2022Updated 4 years ago
- [USENIX'24] Prompt Stealing Attacks Against Text-to-Image Generation Models☆51Jan 11, 2025Updated last year