junchaoIU / DetectRLLinks
[NeurIPS 2024 D&B] DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios
☆14Updated last year
Alternatives and similar repositories for DetectRL
Users that are interested in DetectRL are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024 D&B] DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios☆43Updated last year
- Source code of paper "An Unforgeable Publicly Verifiable Watermark for Large Language Models" accepted by ICLR 2024☆34Updated last year
- UP-TO-DATE LLM Watermark paper. 🔥🔥🔥☆370Updated last year
- The lastest paper about detection of LLM-generated text and code☆281Updated 6 months ago
- multi-bit language model watermarking (NAACL 24)☆17Updated last year
- The official GitHub page for the survey paper of AIGTD entitled "The Imitation Game Revisited: A Comprehensive Survey on Recent Advances …☆53Updated 10 months ago
- Watermarking LLM papers up-to-date☆13Updated 2 years ago
- Watermarking Text Generated by Black-Box Language Models☆40Updated 2 years ago
- Robust natural language watermarking using invariant features☆28Updated 2 years ago
- This is the code repository for "Uncovering Safety Risks of Large Language Models through Concept Activation Vector"☆47Updated 2 months ago
- A collection of resources on attacks and defenses targeting text-to-image diffusion models☆87Updated last week
- ☆32Updated last month
- (AAAI 24) Step Vulnerability Guided Mean Fluctuation Adversarial Attack against Conditional Diffusion Models☆11Updated last year
- Accepted by ECCV 2024☆179Updated last year
- ☆40Updated last year
- This is an official repository of ``VLAttack: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained Models'' (NeurIPS 2…☆63Updated 9 months ago
- [AAAI'25 (Oral)] Jailbreaking Large Vision-language Models via Typographic Visual Prompts☆184Updated 6 months ago
- Repository for Towards Codable Watermarking for Large Language Models☆38Updated 2 years ago
- Repo for SemStamp (NAACL2024) and k-SemStamp (ACL2024)☆27Updated last year
- A Survey on Jailbreak Attacks and Defenses against Multimodal Generative Models☆292Updated last month
- Code for Neurips 2024 paper "Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models"☆58Updated 11 months ago
- ☆30Updated last year
- Accepted by IJCAI-24 Survey Track☆225Updated last year
- MASTERKEY is a framework designed to explore and exploit vulnerabilities in large language model chatbots by automating jailbreak attacks…☆29Updated last year
- Safety at Scale: A Comprehensive Survey of Large Model Safety☆216Updated last month
- ☆22Updated last year
- [ICCV-2025] Universal Adversarial Attack, Multimodal Adversarial Attacks, VLP models, Contrastive Learning, Cross-modal Perturbation Gene…☆31Updated 5 months ago
- ☆33Updated last year
- [NeurIPS 2024] DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning☆51Updated last month
- ☆37Updated 7 months ago