for-ai / goodtriever
Code for "Goodtriever: Toxicity Mitigation with Retrieval-augmented Language Models"
☆22Updated 3 months ago
Related projects: ⓘ
- An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).☆40Updated last month
- Restore safety in fine-tuned language models through task arithmetic☆25Updated 5 months ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆37Updated 2 months ago
- Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models☆42Updated last week
- ☆44Updated 2 weeks ago
- TBC☆26Updated last year
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!☆14Updated 10 months ago
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆41Updated last month
- Code and data for the FACTOR paper☆36Updated 10 months ago
- Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models☆22Updated last month
- Retrieval as Attention☆77Updated last year
- Evaluate the Quality of Critique☆35Updated 3 months ago
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated 9 months ago
- The data and the PyTorch implementation for the models and experiments in the paper "Exploiting Asymmetry for Synthetic Training Data Gen…☆56Updated last year
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆51Updated last year
- Data and info for the paper "ParaDetox: Text Detoxification with Parallel Data"☆27Updated last year
- GPT as Human☆17Updated 8 months ago
- ☆17Updated 8 months ago
- ☆15Updated last year
- ☆39Updated 9 months ago
- Official implementation of the ACL 2023 paper: "Zero-shot Faithful Factual Error Correction"☆17Updated last year
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆13Updated last year
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆63Updated 2 years ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models"☆54Updated 8 months ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆28Updated 6 months ago
- ☆32Updated 5 months ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆38Updated 9 months ago
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆63Updated 3 years ago
- Token-level Reference-free Hallucination Detection☆92Updated last year
- Methods and evaluation for aligning language models temporally☆24Updated 6 months ago