Code & Data for the paper "RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models"
☆32May 31, 2021Updated 4 years ago
Alternatives and similar repositories for RedditBias
Users that are interested in RedditBias are comparing it to the libraries listed below
Sorting:
- ☆25Feb 6, 2022Updated 4 years ago
- ☆25Oct 6, 2023Updated 2 years ago
- ☆11Jun 7, 2023Updated 2 years ago
- EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.☆25Oct 20, 2025Updated 4 months ago
- This repository contains code for the paper RMM: A Recursive Mental Model for Dialog Navigation☆10Nov 22, 2022Updated 3 years ago
- [ICML 2022 Spotlight] Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks☆11May 21, 2023Updated 2 years ago
- Open source web application implementing MIST Misinformation Susceptibility Test☆14Nov 19, 2025Updated 3 months ago
- [AAAI 2024] DataElixir: Purifying Poisoned Dataset to Mitigate Backdoor Attacks via Diffusion Models☆12Dec 5, 2024Updated last year
- ☆11Jul 12, 2021Updated 4 years ago
- ACL 2023 *oral* paper "MGR: Multi-generator based Rationalization"☆10Nov 21, 2024Updated last year
- A pytorch implementation of focal loss☆10Jan 9, 2020Updated 6 years ago
- python project template for personal projects! 🙋♀️☆11Nov 28, 2020Updated 5 years ago
- PyTorch implementation of quantization-aware matrix factorization (QMF) for data compression☆15Jul 14, 2025Updated 7 months ago
- Official repository for ICLR 2024 Spotlight paper "Large Language Models Are Not Robust Multiple Choice Selectors"☆43May 20, 2025Updated 9 months ago
- ☆16Jul 17, 2025Updated 7 months ago
- MDRDC dataset and used baselines☆11Feb 20, 2023Updated 3 years ago
- [ICLR 2026] SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs☆44Oct 14, 2025Updated 4 months ago
- code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis☆12Nov 17, 2024Updated last year
- code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…☆13Nov 17, 2024Updated last year
- Source code and data for ADEPT: A DEbiasing PrompT Framework (AAAI-23).☆15Dec 13, 2024Updated last year
- Code for "RADCoT: Retrieval-Augmented Distillation to Specialization Models for Generating Chain-of-Thoughts in Query Expansion", LREC-CO…☆11May 25, 2024Updated last year
- Code for the WWW'23 paper "Sanitizing Sentence Embeddings (and Labels) for Local Differential Privacy"☆12Feb 20, 2023Updated 3 years ago
- [TMI' 23] FedDM: Federated Weakly Supervised Segmentation via Annotation Calibration and Gradient De-conflicting☆14Mar 11, 2023Updated 2 years ago
- Official implementation of BPA (CVPR 2022)☆13Jun 17, 2022Updated 3 years ago
- The official source code for TaleBrush (CHI 2022)☆15Jul 13, 2022Updated 3 years ago
- [NeurIPS 2024] Fight Back Against Jailbreaking via Prompt Adversarial Tuning☆11Oct 29, 2024Updated last year
- Scalable framework for comparing metric measure spaces with up to 1M points.☆16Apr 6, 2021Updated 4 years ago
- ☆12Apr 26, 2025Updated 10 months ago
- PyTorch implementation of the RCSLS cross-lingual word embedding alignment method☆12May 1, 2019Updated 6 years ago
- Tools for optimizing steering vectors in LLMs.☆20Apr 10, 2025Updated 10 months ago
- Control LLM generation format efficiently. A simple version of microsoft/aici in vllm and transformers☆14Jun 7, 2024Updated last year
- Accepted to ICLR 2025. MetaMetrics is a calibrated meta-metric designed to evaluate generation tasks across different modalities aligned …☆14Dec 30, 2024Updated last year
- CUDA implementation of Multidimensional Scaling☆15May 8, 2021Updated 4 years ago
- ☆13Nov 30, 2022Updated 3 years ago
- Explaining Generative Diffusion Models via Visual Analysis for Interpretable Decision-Making Process☆12May 13, 2024Updated last year
- Central Alaskan Yup'ik FST morphological analyzer/generator☆13Feb 4, 2026Updated last month
- Official Code for ICLR 2023 Paper: A Message Passing Perspective on Learning Dynamics of Contrastive Learning☆11Mar 9, 2023Updated 2 years ago
- 实现Android手机下 类似今日头条 视频播放列表☆12Apr 23, 2019Updated 6 years ago
- 🐥 Code and Dataset for our EMNLP 2022 paper - "ProsocialDialog: A Prosocial Backbone for Conversational Agents"☆65Aug 2, 2023Updated 2 years ago