See https://meta.wikimedia.org/wiki/Research:Modeling_Talk_Page_Abuse
☆151Aug 3, 2020Updated 5 years ago
Alternatives and similar repositories for wiki-detox
Users that are interested in wiki-detox are comparing it to the libraries listed below
Sorting:
- ☆332Feb 25, 2026Updated last week
- Statistics on multilingual datasets☆17Jul 12, 2022Updated 3 years ago
- ☆234Dec 27, 2016Updated 9 years ago
- ☆68Oct 28, 2021Updated 4 years ago
- Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017☆836Jun 12, 2023Updated 2 years ago
- Question Dependent Recurrent Entity Network☆13Sep 21, 2017Updated 8 years ago
- Data and code for APPDIA: A Discourse-aware Transformer-based Style Transfer Model for Offensive Social Media Conversations (COLING 2022)…☆13Sep 8, 2022Updated 3 years ago
- ☆14Aug 26, 2016Updated 9 years ago
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Aug 11, 2023Updated 2 years ago
- ☆21Jul 15, 2024Updated last year
- ☆12Sep 26, 2019Updated 6 years ago
- Course materials for Sta 104 - Summer 2015 semester at Duke University☆22Jun 18, 2015Updated 10 years ago
- Document exploration tool☆12Sep 6, 2016Updated 9 years ago
- ☆55Mar 24, 2022Updated 3 years ago
- ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analys…☆12Jan 26, 2022Updated 4 years ago
- ☆15Feb 25, 2021Updated 5 years ago
- [0.9.9 Released] A high performance non-SPARQL based RDF data cube validator☆16Mar 11, 2016Updated 9 years ago
- Official repository of "Efficient and Effective Query Expansion for Web Search", Short Paper @ CIKM 2018☆15Nov 17, 2019Updated 6 years ago
- Codebase for HYPHEN, accepted at ACL 2022 (main)☆11May 17, 2022Updated 3 years ago
- Distributed implementation of Robust PLSA using Spark☆12Apr 29, 2021Updated 4 years ago
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Feb 10, 2023Updated 3 years ago
- A Wikipedia-based summarization dataset☆14Mar 27, 2023Updated 2 years ago
- Röttger et al. (WOAH at NAACL 2022): "Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models"☆17May 23, 2022Updated 3 years ago
- Distributed storage with REST API & dispatcher for backend services☆14Sep 20, 2017Updated 8 years ago
- generative-camouflaged-spam-detector☆11Aug 20, 2020Updated 5 years ago
- Bag of, not words, but tricks!☆68Oct 31, 2023Updated 2 years ago
- Source code for paper "Generative Flow Network for Listwise Recommendation"☆17Nov 8, 2024Updated last year
- Content-based Recommendation Generator☆13Jan 21, 2015Updated 11 years ago
- Supplementary material for MSR2017 paper Structure and Evolution of Package Dependency Networks☆18Oct 8, 2018Updated 7 years ago
- Source code accompanying the ICLR2020 publication 'Massively Multilingual Sparse Word Representations' https://openreview.net/forum?id=Hy…☆12Aug 15, 2023Updated 2 years ago
- Includes chainer code used to get 1.24 bpc on hutter prize☆15Oct 12, 2017Updated 8 years ago
- Scripts for WASSA-2017 Shared Task on Emotion Intensity☆14Oct 4, 2017Updated 8 years ago
- This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …☆57Jan 1, 2021Updated 5 years ago
- SFU Opinion and Comments Corpus☆92Jun 19, 2025Updated 8 months ago
- A Super-Lightweight Annotation Tool for Experts: Label text in a terminal with just Python☆112Dec 16, 2025Updated 2 months ago
- PyTorch Implementation of Hierarchical Multiscale Recurrent Neural Networks☆15Nov 13, 2018Updated 7 years ago
- Implementation of provably Rawlsian fair ML algorithms for contextual bandits.☆14May 10, 2017Updated 8 years ago
- a single interface around speech-to-speech foundation models☆27Jun 27, 2025Updated 8 months ago
- The code of SKS☆15Mar 22, 2022Updated 3 years ago