for-ai / goodtriever

Code for "Goodtriever: Toxicity Mitigation with Retrieval-augmented Language Models"

☆22

Related projects ⓘ

Alternatives and complementary repositories for goodtriever

wyu97 / RACo
Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.
☆20Updated 2 years ago
hkust-nlp / felm
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆54Updated 11 months ago
declare-lab / resta
Restore safety in fine-tuned language models through task arithmetic
☆26Updated 7 months ago
yizhongw / llm-temporal-alignment
Methods and evaluation for aligning language models temporally
☆24Updated 8 months ago
wzhouad / context-faithful-llm
Code and data for paper "Context-faithful Prompting for Large Language Models".
☆39Updated last year
princeton-nlp / MABEL
EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975
☆37Updated 11 months ago
Zce1112zslx / IKE
☆40Updated 11 months ago
allenai / natural-instructions-v1
Benchmarking Generalization to New Tasks from Natural Language Instructions
☆25Updated 3 years ago
jkallini / mission-impossible-language-models
Code repository for the paper "Mission: Impossible Language Models."
☆39Updated 10 months ago
yanaiela / pararel
☆42Updated 10 months ago
GAIR-NLP / MetaCritique
Evaluate the Quality of Critique
☆35Updated 5 months ago
xu1998hz / InstructScore_SEScore3
First explanation metric (diagnostic report) for text generation evaluation
☆61Updated 4 months ago
microsoft / HaDes
Token-level Reference-free Hallucination Detection
☆93Updated last year
Nanami18 / Snowballed_Hallucination
☆44Updated 2 months ago
ellaneeman / disent_qa
This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.
☆18Updated last year
ntunlp / LLMSanitize
An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).
☆43Updated 3 months ago
xlang-ai / BRIGHT
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
☆57Updated last month
balevinstein / Probes
☆39Updated last year
vr25 / hallucination-foundation-model-survey
A Survey of Hallucination in Large Foundation Models
☆50Updated 10 months ago
swj0419 / in-context-pretraining
☆39Updated 7 months ago
KwanWaiChung / M4LE
Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models
☆22Updated 3 months ago
wenhuchen / Time-Sensitive-QA
Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"
☆64Updated 2 years ago
swj0419 / kNN_prompt
TBC
☆26Updated 2 years ago
AI21Labs / factor
Code and data for the FACTOR paper
☆39Updated last year
SALT-NLP / mic
Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"
☆18Updated last year
google-research / true
Code and data accompanying the paper "TRUE: Re-evaluating Factual Consistency Evaluation".
☆71Updated last week
krystalan / chatgpt_as_nlg_evaluator
Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study
☆42Updated last year
BunsenFeng / AbstainQA
AbstainQA, ACL 2024
☆20Updated last month
peterwestuw / surface-form-competition
☆58Updated 2 years ago
epfl-dlab / SynthIE
The data and the PyTorch implementation for the models and experiments in the paper "Exploiting Asymmetry for Synthetic Training Data Gen…
☆58Updated last year