vivek3141 / ghostbuster
Ghostbuster: Detecting Text Ghostwritten by Large Language Models (NAACL 2024)
☆142Updated 9 months ago
Alternatives and similar repositories for ghostbuster:
Users that are interested in ghostbuster are comparing it to the libraries listed below
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆68Updated 3 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆125Updated 11 months ago
- Multilingual Large Language Models Evaluation Benchmark☆118Updated 6 months ago
- RAID is the largest and most challenging benchmark for AI-generated text detection. (ACL 2024)☆55Updated 3 weeks ago
- Code accompanying "How I learned to start worrying about prompt formatting".☆102Updated 5 months ago
- Evaluating LLMs with fewer examples☆147Updated 11 months ago
- Official repository for our NeurIPS 2023 paper "Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense…☆159Updated last year
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.☆161Updated last year
- ☆119Updated 5 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆175Updated 2 months ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆119Updated 2 months ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆156Updated 3 months ago
- Can AI-Generated Text be Reliably Detected?☆72Updated last year
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆157Updated 10 months ago
- SeqXGPT: An advance method for sentence-level AI-generated text detection.☆81Updated last year
- This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…☆218Updated last year
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆108Updated last year
- M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection☆22Updated 11 months ago
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆211Updated 3 months ago
- Code/data for MARG (multi-agent review generation)☆41Updated 3 months ago
- ☆142Updated 10 months ago
- Evaluating LLMs with CommonGen-Lite☆89Updated 11 months ago
- Steering vectors for transformer language models in Pytorch / Huggingface☆90Updated 2 weeks ago
- This project aims to build upon existing MGTBench project, extending its functionalities with the option to import and evaluate the bench…☆13Updated 4 months ago
- ☆91Updated 9 months ago
- awesome synthetic (text) datasets☆264Updated 4 months ago
- This is work done by the Oxen.ai Community, trying to reproduce the Self-Rewarding Language Model paper from MetaAI.☆122Updated 3 months ago
- ☆153Updated this week
- Finetune mistral-7b-instruct for sentence embeddings☆79Updated 10 months ago
- Reverse Instructions to generate instruction tuning data with corpus examples☆208Updated last year