A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.
☆21Apr 8, 2025Updated 11 months ago
Alternatives and similar repositories for honest
Users that are interested in honest are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Natural language understanding benchmarks for Norwegian☆14Aug 29, 2025Updated 6 months ago
- A nlp framework to find hate speech comments out of a comments corpus.☆11Dec 8, 2022Updated 3 years ago
- CMU Linguistic Annotation Backend☆15Sep 22, 2025Updated 6 months ago
- Dataset + classifier tools to study social perception biases in natural language generation☆71Jun 12, 2023Updated 2 years ago
- ☆10Nov 8, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Tokun to can tokens☆18Jun 19, 2025Updated 9 months ago
- Generalized Sentiment Classifier finetuned by KoELECTRA☆11Nov 28, 2024Updated last year
- A benchmark of programming tasks for LLMs that supports almost any programming language.☆13Jun 30, 2025Updated 8 months ago
- The InterScript dataset contains interactive user feedback on scripts generated by a T5-XXL model.☆12Dec 15, 2021Updated 4 years ago
- StereoSet: Measuring stereotypical bias in pretrained language models☆200Dec 8, 2022Updated 3 years ago
- ☆37Nov 14, 2025Updated 4 months ago
- [ICML 2021] Towards Understanding and Mitigating Social Biases in Language Models☆61Nov 2, 2022Updated 3 years ago
- Automatically modelling and distilling knowledge within AI. In other words, summarising the AI research firehose.☆24Mar 15, 2019Updated 7 years ago
- Scripts to evaluate various bias metrics for different NLG models + decoding algorithms☆16Dec 6, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A curated list of research papers and resources on Cultural LLM.☆52Sep 26, 2024Updated last year
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆68Feb 8, 2023Updated 3 years ago
- ☆17Mar 6, 2025Updated last year
- Trained Neural Networks (LSTM, HybridCNN/LSTM, PyramidCNN, Transformers, etc.) & comparison for the task of Hate Speech Detection on the …☆21Dec 14, 2021Updated 4 years ago
- Official implementation for KDD'22 paper "Learning Fair Representation via Distributional Contrastive Disentanglement"☆23Jun 25, 2022Updated 3 years ago
- ☆10Dec 12, 2023Updated 2 years ago
- This repository keep my research materials about Named Entity Recognition using Transfer Learning☆10Oct 15, 2020Updated 5 years ago
- Visual Clustering: Clustering Plotted Data by Image Segmentation☆25Feb 25, 2025Updated last year
- A pre-commit hook for Pyrefly.☆23Mar 19, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- small C coroutine library based on pypy's stacklet and boost context☆12Jan 28, 2018Updated 8 years ago
- Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control☆76Nov 13, 2022Updated 3 years ago
- Code to compute topic coherence for several topic cardinalities and aggregate scores across them☆21Sep 10, 2025Updated 6 months ago
- Explainability of Deep RL algorithms using graph networks and layer-wise relevance propagation.☆11Aug 20, 2024Updated last year
- code for our paper "Understanding by Understanding Not: Modeling Negation in Language Models"☆16Aug 15, 2022Updated 3 years ago
- Repository for the paper "Thou shalt not hate: Countering Online Hate Speech" accepted at ICWSM 2019.☆32Mar 25, 2023Updated 3 years ago
- Turkish and English Dataset from "Large-Scale Hate Speech Detection with Cross-Domain Transfer"☆29Oct 11, 2023Updated 2 years ago
- A multilingual lexicon of words to hurt.☆95Oct 10, 2025Updated 5 months ago
- R code to get co-citation networks on social networks in the social sciences vs physics and computer science using Web of Science data.☆22Jan 28, 2015Updated 11 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Benchmarking library for image manipulation detection.☆15Mar 29, 2023Updated 2 years ago
- A Python wrapper for the bioRxiv API.☆10Aug 18, 2021Updated 4 years ago
- Lightweight piece tokenization library☆12Apr 15, 2024Updated last year
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆87Mar 2, 2021Updated 5 years ago
- The page with all info about SOTA text detoxification models and datasets.☆13Apr 2, 2025Updated 11 months ago
- Implementation of some algorithms for text clustering☆14Sep 5, 2018Updated 7 years ago
- This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".☆89Aug 20, 2021Updated 4 years ago