vivek3141 / ghostbusterLinks
Ghostbuster: Detecting Text Ghostwritten by Large Language Models (NAACL 2024)
☆162Updated last year
Alternatives and similar repositories for ghostbuster
Users that are interested in ghostbuster are comparing it to the libraries listed below
Sorting:
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆133Updated last year
- Code accompanying "How I learned to start worrying about prompt formatting".☆111Updated 3 months ago
- Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.☆105Updated last year
- The official code repo for "Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations".☆84Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆153Updated last year
- ☆295Updated last year
- ☆52Updated last year
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆197Updated 9 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆161Updated last year
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆86Updated last year
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆224Updated 10 months ago
- Code for the paper "Fishing for Magikarp"☆165Updated 4 months ago
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆76Updated 10 months ago
- Code/data for MARG (multi-agent review generation)☆49Updated 10 months ago
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆70Updated 2 years ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆70Updated last year
- ☆114Updated last year
- [ICML 2024] Binoculars: Zero-Shot Detection of LLM-Generated Text☆306Updated last year
- ☆154Updated last year
- Finetune mistral-7b-instruct for sentence embeddings☆86Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆96Updated 9 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆186Updated 2 months ago
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆107Updated last year
- Reverse Instructions to generate instruction tuning data with corpus examples☆215Updated last year
- Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning☆46Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆116Updated 2 years ago
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆163Updated last year
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆143Updated 10 months ago
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆72Updated last year