M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection
☆45Apr 9, 2024Updated last year
Alternatives and similar repositories for M4
Users that are interested in M4 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆34Jul 12, 2024Updated last year
- [AAAI 2024] The official repository for our paper, "OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially …☆51Nov 5, 2025Updated 4 months ago
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆82Nov 19, 2024Updated last year
- ☆13Nov 7, 2023Updated 2 years ago
- Data from the paper "Ghostbuster: Detecting Text Ghostwritten by Large Language Models"☆14May 27, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text☆34Jul 26, 2023Updated 2 years ago
- ☆21Sep 25, 2023Updated 2 years ago
- Can AI-Generated Text be Reliably Detected?☆88Nov 16, 2023Updated 2 years ago
- SeqXGPT: An advance method for sentence-level AI-generated text detection.☆100Oct 16, 2023Updated 2 years ago
- Ghostbuster: Detecting Text Ghostwritten by Large Language Models (NAACL 2024)☆179May 27, 2024Updated last year
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆244Dec 30, 2024Updated last year
- Code and Data for the ACL 2022 paper "Rethinking Self-Supervision Objectives for Generalizable Coherence Modeling"☆11Apr 5, 2022Updated 3 years ago
- This project aims to build upon existing MGTBench project, extending its functionalities with the option to import and evaluate the bench…☆21Nov 5, 2024Updated last year
- Official Repository for "Ten Words Only Still Help: Improving Black-Box AI-Generated Text Detection via Proxy-Guided Efficient Re-Samplin…☆22Aug 15, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Code base for ICLR 2024 "Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature".☆386Feb 7, 2026Updated last month
- Official code repository for article Intrinsic Dimension Estimation for Robust Detection of AI-Generated Texts☆35Mar 15, 2025Updated last year
- Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM | EMNLP 2025 Findings☆18Oct 17, 2025Updated 5 months ago
- RAID is the largest and most challenging benchmark for AI-generated text detection. (ACL 2024)☆171Updated this week
- NLPCC-2025 Shared-Task 1: LLM-Generated Text Detection☆15May 19, 2025Updated 10 months ago
- [NeurIPS 2024 D&B] DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios☆46Dec 10, 2024Updated last year
- [EMNLP 2023] Release repo for our work "Token Prediction as Implicit Classification to Identify LLM-Generated Text"☆25Jan 7, 2024Updated 2 years ago
- Offiical codes for DNA-GPT (ICLR 2024)☆56Apr 15, 2024Updated last year
- A summarizer for Japanese articles (but ChatGPT is better)☆10Aug 1, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- LLMDet is a text detection tool that can identify which generated sources the text came from (e.g. large language model or human-write).☆85May 27, 2024Updated last year
- Wikipedia article dataset☆12May 10, 2019Updated 6 years ago
- Python package to deal with PAN corpora and extract stylometric features from text documents.☆15Nov 11, 2022Updated 3 years ago
- ☆44Mar 22, 2026Updated last week
- ☆12May 12, 2020Updated 5 years ago
- Source code of NAACL 2025 Findings "Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models"☆15Dec 16, 2025Updated 3 months ago
- Three modules of extractive text summarization, including implementation of Kmeans clustering using BERT sentence embedding☆13Dec 9, 2019Updated 6 years ago
- A modular and extensible Python framework, designed to aid in the creation of high-quality, unbiased datasets to build robust models for …☆20Mar 7, 2026Updated 3 weeks ago
- ☆10Mar 7, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆12Aug 20, 2023Updated 2 years ago
- ☆39Feb 25, 2024Updated 2 years ago
- Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models☆13Dec 23, 2024Updated last year
- ☆10Aug 6, 2022Updated 3 years ago
- Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥☆1,345Dec 1, 2023Updated 2 years ago
- Implementation for Machine-Generated Text Localization (ACL 2024 Findings)☆15Jun 17, 2024Updated last year
- A repository for ACL 2022 paper "How do we answer complex questions: Discourse structure of long form answers"☆19May 31, 2025Updated 9 months ago