allenai / real-toxicity-promptsLinks

☆220

Alternatives and similar repositories for real-toxicity-prompts

Users that are interested in real-toxicity-prompts are comparing it to the libraries listed below

Sorting:

microsoft / TOXIGEN
This repo contains the code for generating the ToxiGen dataset, published at ACL 2022.
☆331Updated last year
amazon-science / bold
Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper
☆80Updated 4 years ago
nyu-mll / BBQ
Repository for the Bias Benchmark for QA dataset.
☆128Updated last year
nyu-mll / crows-pairs
This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maske…
☆124Updated last year
shmsw25 / FActScore
A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…
☆386Updated 5 months ago
openai / moderation-api-release
☆142Updated 3 years ago
facebookresearch / ResponsibleNLP
Repository for research in the field of Responsible NLP at Meta.
☆202Updated 4 months ago
declare-lab / red-instruct
Codes and datasets of the paper Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment
☆105Updated last year
AlexTMallen / adaptive-retrieval
☆189Updated 3 months ago
swj0419 / detect-pretrain-code
This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…
☆233Updated last year
microsoft / HaDes
Token-level Reference-free Hallucination Detection
☆96Updated 2 years ago
mkshing / Prompt-Tuning
Implementation of "The Power of Scale for Parameter-Efficient Prompt Tuning"
☆166Updated 4 years ago
anthropics / ConstitutionalHarmlessnessPaper
☆241Updated 2 years ago
yizhongw / Tk-Instruct
Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.
☆181Updated 2 years ago
realtimeqa / realtimeqa_public
☆78Updated last year
tatsu-lab / opinions_qa
☆115Updated last year
RUCAIBox / HaluEval
This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.
☆514Updated last year
martiansideofthemoon / ai-detection-paraphrases
Official repository for our NeurIPS 2023 paper "Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense…
☆175Updated last year
yuh-zha / AlignScore
ACL2023 - AlignScore, a metric for factual consistency evaluation.
☆138Updated last year
allenai / wimbd
What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets
☆224Updated 10 months ago
yuxiaw / Factcheck-GPT
Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.
☆105Updated last year
google-research / true
Code and data accompanying the paper "TRUE: Re-evaluating Factual Consistency Evaluation".
☆81Updated 2 months ago
GXimingLu / Quark
☆75Updated last year
evandez / REMEDI
Inspecting and Editing Knowledge Representations in Language Models
☆116Updated 2 years ago
PKU-Alignment / beavertails
BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).
☆158Updated last year
veronica320 / Faithful-COT
Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".
☆163Updated last year
salesforce / factualNLG
Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"
☆59Updated 8 months ago
jinlanfu / GPTScore
Source Code of Paper "GPTScore: Evaluate as You Desire"
☆257Updated 2 years ago
bigscience-workshop / lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
☆104Updated 2 years ago
hkust-nlp / felm
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆59Updated last year