M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection
☆45Apr 9, 2024Updated 2 years ago
Alternatives and similar repositories for M4
Users that are interested in M4 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SemEval2024-task8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection☆81Apr 22, 2024Updated 2 years ago
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆82Nov 19, 2024Updated last year
- ☆13Nov 7, 2023Updated 2 years ago
- Data from the paper "Ghostbuster: Detecting Text Ghostwritten by Large Language Models"☆14May 27, 2024Updated last year
- ☆17Apr 7, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆22Sep 25, 2023Updated 2 years ago
- SeqXGPT: An advance method for sentence-level AI-generated text detection.☆100Oct 16, 2023Updated 2 years ago
- Ghostbuster: Detecting Text Ghostwritten by Large Language Models (NAACL 2024)☆180May 27, 2024Updated last year
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆244Dec 30, 2024Updated last year
- Code and Data for the ACL 2022 paper "Rethinking Self-Supervision Objectives for Generalizable Coherence Modeling"☆11Apr 5, 2022Updated 4 years ago
- Official repository for our NeurIPS 2023 paper "Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense…☆197Nov 9, 2023Updated 2 years ago
- Official Repository for "Ten Words Only Still Help: Improving Black-Box AI-Generated Text Detection via Proxy-Guided Efficient Re-Samplin…☆23Aug 15, 2024Updated last year
- The lastest paper about detection of LLM-generated text and code☆287Jun 19, 2025Updated 10 months ago
- Code for the paper: ConDA: Contrastive Domain Adaptation for AI-generated Text Detection☆41Dec 21, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official code repository for article Intrinsic Dimension Estimation for Robust Detection of AI-Generated Texts☆35Mar 15, 2025Updated last year
- (NAACL 2024) Official code repository for Mixset.☆26Dec 4, 2024Updated last year
- Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM | EMNLP 2025 Findings☆18Oct 17, 2025Updated 6 months ago
- NLPCC-2025 Shared-Task 1: LLM-Generated Text Detection☆16Apr 6, 2026Updated last month
- RAID is the largest and most challenging benchmark for AI-generated text detection. (ACL 2024)☆184Apr 24, 2026Updated 2 weeks ago
- Crosslingual Reasoning through Test-Time Scaling☆19May 13, 2025Updated 11 months ago
- Offiical codes for DNA-GPT (ICLR 2024)☆56Apr 15, 2024Updated 2 years ago
- ☆15Sep 21, 2021Updated 4 years ago
- ☆42Sep 14, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Repository of data and code to use the models described in the paper "Citation Needed: A Taxonomy and Algorithmic Assessment of Wikipedia…☆11Nov 21, 2022Updated 3 years ago
- Wikipedia article dataset☆12May 10, 2019Updated 6 years ago
- ☆162Jan 24, 2025Updated last year
- [NeurIPS 2024 D&B] DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios☆15Nov 19, 2024Updated last year
- ☆14Oct 21, 2024Updated last year
- 1st Place Solution for LLM - Detect AI Generated Text Kaggle Competition☆209May 20, 2024Updated last year
- GPU-accelerated algorithm for subsampling datasets while preserving diversity☆27Jan 12, 2024Updated 2 years ago
- ☆23Nov 18, 2022Updated 3 years ago
- ☆53Apr 30, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Three modules of extractive text summarization, including implementation of Kmeans clustering using BERT sentence embedding☆13Dec 9, 2019Updated 6 years ago
- https://sites.google.com/site/multidimensionaltagger☆38Dec 6, 2023Updated 2 years ago
- A modular and extensible Python framework, designed to aid in the creation of high-quality, unbiased datasets to build robust models for …☆21Mar 7, 2026Updated 2 months ago
- ☆10Mar 7, 2024Updated 2 years ago
- ☆13Feb 21, 2025Updated last year
- ☆12Aug 20, 2023Updated 2 years ago
- LaTeX Proposal Template for the University of Chinese Academy of Sciences☆18Oct 14, 2023Updated 2 years ago