shizhouxing/LLM-Detector-Robustness

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shizhouxing/LLM-Detector-Robustness)

shizhouxing / LLM-Detector-Robustness

[TACL] Code for "Red Teaming Language Model Detectors with Language Models"

☆24

Alternatives and similar repositories for LLM-Detector-Robustness

Users that are interested in LLM-Detector-Robustness are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shreyansh26 / Red-Teaming-Language-Models-with-Language-Models
View on GitHub
A re-implementation of the "Red Teaming Language Models with Language Models" paper by Perez et al., 2022
☆35Oct 9, 2023Updated 2 years ago
AmritaBh / ConDA-gen-text-detection
View on GitHub
Code for the paper: ConDA: Contrastive Domain Adaptation for AI-generated Text Detection
☆42Dec 21, 2023Updated 2 years ago
guardrails-ai / validator-template
View on GitHub
A test validator repo that includes just the regex validator
☆15Mar 3, 2026Updated 4 months ago
DanielSc4 / Dynamic-Activation-Composition
View on GitHub
Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"
☆14Nov 22, 2024Updated last year
LanceKnight / MolKGNN
View on GitHub
MolKGNN is a deep learning model for predicting biological activity or molecular properties. It features in 1. SE(3)-invariance 2. confor…
☆14Jan 22, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
AnHorn / AIMLinker
View on GitHub
Fragment Linker Prediction Using Deep Encoder-Decoder Network for PROTAC Drug Design
☆13Oct 2, 2023Updated 2 years ago
ruisizhang123 / REMARK-LLM
View on GitHub
[USENIX Security'24] REMARK-LLM: A robust and efficient watermarking framework for generative large language models
☆28Oct 23, 2024Updated last year
llan-ml / MetaTNE
View on GitHub
Source code for NeurIPS 2020 paper "Node Classification on Graphs with Few-Shot Novel Labels via Meta Transformed Network Embedding"
☆10Nov 17, 2020Updated 5 years ago
mtenenholtz / lmsys-chatbot-arena-solution
View on GitHub
☆15Aug 26, 2024Updated last year
A4Bio / MotifRetro
View on GitHub
The official implementation of the paper "MotifRetro: Exploring the Combinability-Consistency Trade-offs in retrosynthesis via Dynamic Mo…
☆11Jun 25, 2023Updated 3 years ago
joshterrell805 / OpenIntro_Statistics_Labs
View on GitHub
R labs for the book OpenIntro Statistics (https://www.openintro.org/stat/)
☆13Nov 17, 2016Updated 9 years ago
m-gallegos / SchNet4AIM
View on GitHub
This repository gathers the SchNet4AIM code along with some instructions and readme files.
☆15Mar 13, 2024Updated 2 years ago
asarigun / DrugGEN
View on GitHub
Official implementation of DrugGEN in PyTorch
☆12Oct 27, 2023Updated 2 years ago
vicgalle / refined-dpo
View on GitHub
Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs
☆13Feb 13, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
yizt / keras-imprinting
View on GitHub
论文Low-Shot Learning with Imprinted Weights 的keras 版简要实现;
☆15Dec 28, 2018Updated 7 years ago
narze / awesome-salim
View on GitHub
รวบรวมทุกแอคเค้าท์ Twitter, Facebook และ Instagram ของสลิ่มที่น่าติดตาม คุยด้วยเหตุและผล
☆12Nov 5, 2020Updated 5 years ago
deeplearning-wisc / LUMINA
View on GitHub
Official implementation of ICLR 2026 paper "LUMINA: Detecting Hallucinations in RAG System with Context–Knowledge Signals"
☆18Jan 31, 2026Updated 5 months ago
website-of-tinarskii / MukPakPak
View on GitHub
มุขแป๊ก ๆ ที่เพื่อนคุณยังจะเอามาเล่น...
☆12Sep 18, 2025Updated 10 months ago
kimeguida / POEM
View on GitHub
Pocket-Oriented Elaboration of Molecules: application to CDK8 inhibition
☆14Dec 30, 2022Updated 3 years ago
TSKumarage / Stylo-Det-AI-Gen-Twitter-Timelines
View on GitHub
☆10Mar 7, 2024Updated 2 years ago
lrank / Robust_and_Privacy_preserving_Text_Representations
View on GitHub
☆18Apr 2, 2021Updated 5 years ago
huashen218 / convxai
View on GitHub
CSCW 2023 Best Demo Award: Conversational AI Explanations to Support Human-AI Scientific Writing
☆15Jun 25, 2023Updated 3 years ago
mhw32 / meta-inference-public
View on GitHub
A PyTorch implementation of "Meta-Amortized Variational Inference and Learning" (https://arxiv.org/abs/1902.01950)
☆14Mar 31, 2020Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
narze / emkay
View on GitHub
Share your member card without hassle
☆11Updated this week
ChanghwaPark / DANN-tf2
View on GitHub
Tensorflow 2.0 implementation of Domain Adversarial Neural Networks (DANN)
☆12Dec 3, 2019Updated 6 years ago
saran9991 / llm-data-annotation
View on GitHub
Use Large Language Models like OpenAI's GPT-3.5 for data annotation and model enhancement. This framework combines human expertise with L…
☆41Sep 11, 2023Updated 2 years ago
SafeAILab / RAIN
View on GitHub
[ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning
☆99May 23, 2024Updated 2 years ago
deeplearning-wisc / mllmshift-emi
View on GitHub
Official implementation of ICML 2025 paper "Understanding Multimodal LLMs Under Distribution Shifts: An Information-Theoretic Approach"
☆12May 27, 2025Updated last year
amckenna41 / protPy
View on GitHub
Calculating a range of protein descriptors using their physicochemical, biological and structural properties 🔬.
☆16Jul 1, 2026Updated 3 weeks ago
ewsheng / controllable-nlg-biases
View on GitHub
Framework for controlling demographic biases in NLG (using adversarial prompts)
☆21Jun 12, 2023Updated 3 years ago
dmamur / elembert
View on GitHub
☆22May 9, 2025Updated last year
herumi / bls-go-binary
View on GitHub
☆22Oct 7, 2025Updated 9 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mw866 / go-back-n
View on GitHub
TCP-like Go-back-n protocol using UDP socket API
☆14May 30, 2017Updated 9 years ago
piotrjurkiewicz / topohub
View on GitHub
Repository of reference Gabriel graph, Internet Topology Zoo, SNDlib, CAIDA and synthetic backbone topologies for networking research
☆15Sep 30, 2025Updated 9 months ago
imethanlee / KnowPhish
View on GitHub
[USENIX Security 2024] Official Repository of 'KnowPhish: Large Language Models Meet Multimodal Knowledge Graphs for Enhancing Reference-…
☆17Aug 6, 2025Updated 11 months ago
apple / ml-aura
View on GitHub
Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models, ICML 2024
☆26Jul 7, 2024Updated 2 years ago
ohuelab / MMGX
View on GitHub
MMGX: Multiple Molecular Graph eXplainable Discovery
☆22Apr 28, 2026Updated 2 months ago
WhatTheFar / awesome-whatthefar
View on GitHub
😎 Curated list of awesome WhatTheFar
☆14Mar 3, 2025Updated last year
creatorsgarten / abitofcoloring
View on GitHub
a bit
☆10Apr 7, 2024Updated 2 years ago