vivek3141 / ghostbuster-dataLinks
Data from the paper "Ghostbuster: Detecting Text Ghostwritten by Large Language Models"
β14Updated last year
Alternatives and similar repositories for ghostbuster-data
Users that are interested in ghostbuster-data are comparing it to the libraries listed below
Sorting:
- Official implementation of "Data Mixture Inference: What do BPE tokenizers reveal about their training data?"β18Updated 6 months ago
- πΎ Universal, customizable and deployable fine-grained evaluation for text generation.β24Updated 2 years ago
- β102Updated last year
- A repository with several curated datasets of counter-narratives to fight online hate speech.β93Updated 4 months ago
- Collection of academic works in natural language processing, computational linguistics, and computational cognitive science that study thβ¦β22Updated last year
- β17Updated 2 years ago
- Ghostbuster: Detecting Text Ghostwritten by Large Language Models (NAACL 2024)β168Updated last year
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"β27Updated 4 years ago
- Public repository for SemEval 2023 - Task 10 - Explainable Detection of Online Sexism (EDOS)β25Updated 2 years ago
- Repro is a library for easily running code from published papers via Docker.β41Updated 2 years ago
- A Python library that encapsulates various methods for neuron interpretation and analysis in Deep NLP models.β105Updated 2 years ago
- β36Updated 4 months ago
- Code and data for "Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"β16Updated 4 years ago
- Data for evaluating gender bias in coreference resolution systems.β81Updated 6 years ago
- Multidocument Summarization for Literature Review Shared Task 2022β30Updated 3 years ago
- Code and data for the NAACL 2021 paper: "XFORMAL: A Benchmark for Multilingual Formality Style Transfer"β12Updated 4 years ago
- PathPiece tokenizerβ13Updated last year
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).β20Updated 3 years ago
- Semantically Structured Sentence Embeddingsβ69Updated last year
- β56Updated 6 months ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.β54Updated 2 years ago
- [COLING 2022]: CommunityLM: Probing Partisan Worldviews from Language Modelsβ14Updated 2 years ago
- To analyze and remove gender bias in coreference resolution systemsβ78Updated 6 months ago
- Code for SaGe subword tokenizer (EACL 2023)β27Updated last year
- A corpus and code for understanding norms and subjectivity. π€β52Updated last year
- A Multi-subject High School Examinations Dataset for Cross-lingual and Multilingual Question Answeringβ46Updated 3 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (httpsβ¦β44Updated last year
- β37Updated last month
- β40Updated last week
- Apps built using Inspired Cognition's Critique.β57Updated 2 years ago