vivek3141 / ghostbusterLinks
Ghostbuster: Detecting Text Ghostwritten by Large Language Models (NAACL 2024)
☆162Updated last year
Alternatives and similar repositories for ghostbuster
Users that are interested in ghostbuster are comparing it to the libraries listed below
Sorting:
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆133Updated last year
- ☆295Updated last year
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆224Updated 10 months ago
- Code accompanying "How I learned to start worrying about prompt formatting".☆110Updated 4 months ago
- Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.☆105Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆153Updated last year
- ☆155Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆116Updated 2 years ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆203Updated 10 months ago
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆71Updated 2 years ago
- ☆52Updated last year
- The Synthetic-Persona-Chat dataset is a synthetically generated persona-based dialogue dataset. It extends the original Persona-Chat data…☆101Updated last year
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆77Updated 10 months ago
- [ICML 2024] Binoculars: Zero-Shot Detection of LLM-Generated Text☆310Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆187Updated 3 months ago
- Evaluating LLMs with fewer examples☆161Updated last year
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆161Updated last year
- Official repository for our NeurIPS 2023 paper "Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense…☆175Updated last year
- This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"☆204Updated 9 months ago
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆107Updated 2 years ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆165Updated 2 years ago
- Finetune mistral-7b-instruct for sentence embeddings☆86Updated last year
- ☆98Updated last year
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆163Updated last year
- ☆220Updated 4 years ago
- 🚢 Data Toolkit for Sailor Language Models☆94Updated 7 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆86Updated last year
- Inspecting and Editing Knowledge Representations in Language Models☆116Updated 2 years ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆143Updated 11 months ago
- Reverse Instructions to generate instruction tuning data with corpus examples☆215Updated last year