vivek3141 / ghostbusterLinks
Ghostbuster: Detecting Text Ghostwritten by Large Language Models (NAACL 2024)
☆172Updated last year
Alternatives and similar repositories for ghostbuster
Users that are interested in ghostbuster are comparing it to the libraries listed below
Sorting:
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆215Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆135Updated last year
- ☆298Updated 2 years ago
- Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.☆108Updated last year
- Code accompanying "How I learned to start worrying about prompt formatting".☆112Updated 6 months ago
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆224Updated last year
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆79Updated last year
- ☆116Updated last year
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models☆584Updated last year
- Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder …☆164Updated 5 months ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆156Updated 2 years ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆189Updated 5 months ago
- ☆159Updated last year
- Do-Not-Answer: A Dataset for Evaluating Safeguards in LLMs☆298Updated last year
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆236Updated 11 months ago
- This repo contains the code for generating the ToxiGen dataset, published at ACL 2022.☆344Updated last year
- RAID is the largest and most challenging benchmark for AI-generated text detection. (ACL 2024)☆102Updated last week
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆193Updated 10 months ago
- ☆226Updated 4 years ago
- A Survey of Attributions for Large Language Models☆220Updated last year
- Attribute (or cite) statements generated by LLMs back to in-context information.☆305Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆102Updated last year
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆71Updated 2 years ago
- MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents [EMNLP 2024]☆193Updated 3 months ago
- A package to generate summaries of long-form text and evaluate the coherence of these summaries. Official package for our ICLR 2024 paper…☆128Updated last year
- NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistake…☆45Updated last year
- ☆53Updated last year
- Official repository for our NeurIPS 2023 paper "Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense…☆181Updated 2 years ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆166Updated last year
- Resources for cultural NLP research☆110Updated 2 months ago