vivek3141 / ghostbuster
Ghostbuster: Detecting Text Ghostwritten by Large Language Models (NAACL 2024)
☆151Updated 10 months ago
Alternatives and similar repositories for ghostbuster:
Users that are interested in ghostbuster are comparing it to the libraries listed below
- Official repository for our NeurIPS 2023 paper "Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense…☆166Updated last year
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆72Updated 5 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆128Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆180Updated 3 months ago
- Code accompanying "How I learned to start worrying about prompt formatting".☆104Updated 6 months ago
- SeqXGPT: An advance method for sentence-level AI-generated text detection.☆87Updated last year
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆69Updated 2 years ago
- RAID is the largest and most challenging benchmark for AI-generated text detection. (ACL 2024)☆59Updated 3 weeks ago
- M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection☆24Updated last year
- Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.☆95Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆150Updated last year
- Reverse Instructions to generate instruction tuning data with corpus examples☆209Updated last year
- DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text☆29Updated last year
- Code/data for MARG (multi-agent review generation)☆42Updated 5 months ago
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆108Updated last year
- Multilingual Large Language Models Evaluation Benchmark☆123Updated 8 months ago
- ☆106Updated 11 months ago
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆218Updated 5 months ago
- Inspecting and Editing Knowledge Representations in Language Models☆115Updated last year
- Offiical codes for DNA-GPT (ICLR 2024)☆50Updated last year
- Can AI-Generated Text be Reliably Detected?☆76Updated last year
- Wrapper to easily generate the chat template for Llama2☆64Updated last year
- Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder …☆155Updated 6 months ago
- Improving Alignment and Robustness with Circuit Breakers☆197Updated 7 months ago
- Continuously updated list of related resources for generative LLMs like GPT and their analysis and detection.☆219Updated last week
- [ICML 2024] Binoculars: Zero-Shot Detection of LLM-Generated Text☆267Updated 11 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆74Updated 6 months ago
- Code for "From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Mod…☆36Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆110Updated last year
- Finetune mistral-7b-instruct for sentence embeddings☆81Updated 11 months ago