vivek3141 / ghostbuster
Ghostbuster: Detecting Text Ghostwritten by Large Language Models (NAACL 2024)
☆139Updated 8 months ago
Alternatives and similar repositories for ghostbuster:
Users that are interested in ghostbuster are comparing it to the libraries listed below
- Code accompanying "How I learned to start worrying about prompt formatting".☆102Updated 4 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆125Updated 11 months ago
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆69Updated last year
- Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.☆86Updated last year
- ☆104Updated 9 months ago
- Finetune mistral-7b-instruct for sentence embeddings☆78Updated 9 months ago
- RARR: Researching and Revising What Language Models Say, Using Language Models☆45Updated last year
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆67Updated 3 months ago
- Multilingual Large Language Models Evaluation Benchmark☆117Updated 6 months ago
- Inspecting and Editing Knowledge Representations in Language Models☆112Updated last year
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆206Updated 3 months ago
- A repository containing the code for translating popular LLM benchmarks to German.☆25Updated last year
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆157Updated 9 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆146Updated last year
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆83Updated 6 months ago
- Evaluating LLMs with fewer examples☆145Updated 10 months ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆71Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆74Updated 5 months ago
- Lightweight tool to identify Data Contamination in LLMs evaluation☆46Updated 11 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆252Updated 7 months ago
- Token-level Reference-free Hallucination Detection☆94Updated last year
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆93Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆108Updated last year
- 🚢 Data Toolkit for Sailor Language Models☆85Updated 2 months ago
- Notebooks for training universal 0-shot classifiers on many different tasks☆119Updated last month
- Reverse Instructions to generate instruction tuning data with corpus examples☆208Updated 11 months ago
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆65Updated 11 months ago
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆108Updated last year
- Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning☆45Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆135Updated 3 months ago