Data from the paper "Ghostbuster: Detecting Text Ghostwritten by Large Language Models"
☆14May 27, 2024Updated 2 years ago
Alternatives and similar repositories for ghostbuster-data
Users that are interested in ghostbuster-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation for Machine-Generated Text Localization (ACL 2024 Findings)☆14Jun 17, 2024Updated last year
- Python package to deal with PAN corpora and extract stylometric features from text documents.☆15Nov 11, 2022Updated 3 years ago
- Three modules of extractive text summarization, including implementation of Kmeans clustering using BERT sentence embedding☆13Dec 9, 2019Updated 6 years ago
- Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM | EMNLP 2025 Findings☆18Oct 17, 2025Updated 7 months ago
- A set of general tools.☆17May 27, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- MATLAB toolbox implementing template independent component analysis☆12Sep 14, 2020Updated 5 years ago
- NLPCC-2025 Shared-Task 1: LLM-Generated Text Detection☆16Apr 6, 2026Updated 2 months ago
- A knowledge base backend system for LLMs with full-text search, semantic retrieval, and knowledge graph querying. Ready-to-use modules fo…☆28Apr 13, 2025Updated last year
- ☆10Feb 12, 2024Updated 2 years ago
- A repository for the EMNLP 2021 paper "Is Information Density Uniform in Task-Oriented Dialogues?" and for the CoNLL 2021 paper "Analysin…☆10Jun 17, 2024Updated last year
- A repository for notebooks associated with FMRIPREP☆17Oct 25, 2018Updated 7 years ago
- Official Repository for "Ten Words Only Still Help: Improving Black-Box AI-Generated Text Detection via Proxy-Guided Efficient Re-Samplin…☆23Aug 15, 2024Updated last year
- Neural ngram language model in PyTorch.☆10Sep 27, 2018Updated 7 years ago
- Training-free LLM-generated Text Detection by Mining Token Probability Sequences (ICLR 2025)☆36Apr 25, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Notebooks and other course materials for Emory QTM 340 (Fall 2022)☆12Dec 13, 2022Updated 3 years ago
- (NAACL 2024) Official code repository for Mixset.☆26Dec 4, 2024Updated last year
- ☆18Feb 25, 2023Updated 3 years ago
- [not maintained anymore] [for study purpose] A simple PyTorch implementation for "Global Vectors for Word Representation".☆17Nov 7, 2019Updated 6 years ago
- OneStop: A 360-Participant Eye Tracking Dataset with Different Reading Regimes☆19Apr 18, 2026Updated last month
- ☆15Oct 24, 2023Updated 2 years ago
- Offiical codes for DNA-GPT (ICLR 2024)☆56Apr 15, 2024Updated 2 years ago
- A module to compute textual lexical richness (aka lexical diversity).☆113May 2, 2026Updated last month
- Code base for "Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood"☆15Aug 10, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 基于 Streamlit、LangChain 与 Chroma 的轻量级 RAG 学习项目,支持本地知识库上传、检索增强问答与聊天式交互。☆166Apr 3, 2026Updated 2 months ago
- ☆10Feb 2, 2023Updated 3 years ago
- Cellular Automata - Pokemon Type Battle Simulation☆11Oct 26, 2024Updated last year
- ANE accelerated embedding models!☆19Dec 11, 2024Updated last year
- SeqXGPT: An advance method for sentence-level AI-generated text detection.☆100Oct 16, 2023Updated 2 years ago
- A modular and extensible Python framework, designed to aid in the creation of high-quality, unbiased datasets to build robust models for …☆21Mar 7, 2026Updated 3 months ago
- micro-gpt in ASM on the Super Nintendo☆67Feb 12, 2026Updated 4 months ago
- tgrep2 Searching for NLTK Trees☆15Oct 28, 2016Updated 9 years ago
- This repository contains all code for the BLMM toolbox.☆21Jan 31, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICML 2024] Binoculars: Zero-Shot Detection of LLM-Generated Text☆385May 14, 2024Updated 2 years ago
- Online Chinese Stroke Order Lookup☆20Oct 25, 2016Updated 9 years ago
- A MATLAB package for multivariate permutation testing and effect size measurement☆25Aug 29, 2024Updated last year
- ☆16Mar 2, 2026Updated 3 months ago
- Customized foundational image segmentation models for picking protein particles in cryo-EM images☆22Jan 7, 2026Updated 5 months ago
- Dataset and codes for identifying sentence-level discourse elements in Chinese argumentative student essays.☆14Nov 16, 2022Updated 3 years ago
- DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text☆34Jul 26, 2023Updated 2 years ago