Data from the paper "Ghostbuster: Detecting Text Ghostwritten by Large Language Models"
☆14May 27, 2024Updated last year
Alternatives and similar repositories for ghostbuster-data
Users that are interested in ghostbuster-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Ghostbuster: Detecting Text Ghostwritten by Large Language Models (NAACL 2024)☆179May 27, 2024Updated last year
- Implementation for Machine-Generated Text Localization (ACL 2024 Findings)☆15Jun 17, 2024Updated last year
- [NeurIPS 2024 D&B] DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios☆46Dec 10, 2024Updated last year
- [NeurIPS 2024 D&B] DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios☆14Nov 19, 2024Updated last year
- Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM | EMNLP 2025 Findings☆19Oct 17, 2025Updated 5 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆13Feb 21, 2025Updated last year
- Post hoc inference via multiple testing☆12Nov 2, 2025Updated 4 months ago
- Official repository of the paper: Detecting LLM-Generated Korean Text through Linguistic Feature Analysis (ACL 2025 Main)☆19Oct 17, 2025Updated 5 months ago
- A knowledge base backend system for LLMs with full-text search, semantic retrieval, and knowledge graph querying. Ready-to-use modules fo…☆28Apr 13, 2025Updated 11 months ago
- ☆18Jan 20, 2026Updated 2 months ago
- Tool for the automatic assessment of lexical diversity☆14Sep 6, 2025Updated 6 months ago
- Official Repository for "Ten Words Only Still Help: Improving Black-Box AI-Generated Text Detection via Proxy-Guided Efficient Re-Samplin…☆22Aug 15, 2024Updated last year
- Benchmark denoising strategies available from fmriprep.☆16Dec 18, 2025Updated 3 months ago
- Neural ngram language model in PyTorch.☆10Sep 27, 2018Updated 7 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Official implementation of Inconsistency Masks. A robust semi-supervised segmentation framework that reframes model disagreement as a…☆19Jan 23, 2026Updated 2 months ago
- (NAACL 2024) Official code repository for Mixset.☆26Dec 4, 2024Updated last year
- ☆18Feb 25, 2023Updated 3 years ago
- [not maintained anymore] [for study purpose] A simple PyTorch implementation for "Global Vectors for Word Representation".☆17Nov 7, 2019Updated 6 years ago
- OneStop: A 360-Participant Eye Tracking Dataset with Different Reading Regimes☆17Dec 5, 2025Updated 3 months ago
- ☆15Oct 24, 2023Updated 2 years ago
- Code snippets and reproductions from JustAByte☆25Jan 25, 2026Updated 2 months ago
- Benchmark of common hash functions☆10Sep 15, 2019Updated 6 years ago
- Code base for "Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood"☆15Aug 10, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A module to compute textual lexical richness (aka lexical diversity).☆112Aug 27, 2023Updated 2 years ago
- ☆14Nov 19, 2024Updated last year
- ☆10Feb 2, 2023Updated 3 years ago
- Cellular Automata - Pokemon Type Battle Simulation☆10Oct 26, 2024Updated last year
- Transformer language model (GPT-2) with sentencepiece tokenizer☆10Oct 15, 2019Updated 6 years ago
- ANE accelerated embedding models!☆20Dec 11, 2024Updated last year
- A modular and extensible Python framework, designed to aid in the creation of high-quality, unbiased datasets to build robust models for …☆20Mar 7, 2026Updated 2 weeks ago
- SeqXGPT: An advance method for sentence-level AI-generated text detection.☆100Oct 16, 2023Updated 2 years ago
- User-friendly viewer for Parquet files☆11Mar 7, 2026Updated 2 weeks ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A simple tutorial script on Streamlit using the Iris Dataset☆13Sep 13, 2023Updated 2 years ago
- micro-gpt in ASM on the Super Nintendo☆51Feb 12, 2026Updated last month
- tgrep2 Searching for NLTK Trees☆15Oct 28, 2016Updated 9 years ago
- This repository contains all code for the BLMM toolbox.☆20Jan 31, 2024Updated 2 years ago
- [ICML 2025] "From Passive to Active Reasoning: Can Large Language Models Ask the Right Questions under Incomplete Information?"☆49Oct 8, 2025Updated 5 months ago
- Online Chinese Stroke Order Lookup☆20Oct 25, 2016Updated 9 years ago
- ☆15Mar 2, 2026Updated 3 weeks ago