ewulczyn / wiki-detoxView external linksLinks
See https://meta.wikimedia.org/wiki/Research:Modeling_Talk_Page_Abuse
☆150Aug 3, 2020Updated 5 years ago
Alternatives and similar repositories for wiki-detox
Users that are interested in wiki-detox are comparing it to the libraries listed below
Sorting:
- ☆332Updated this week
- This repository contains all new resources that were created for the NAACL-2018 paper "Inducing a Lexicon of Abusive Words -- A Feature-B…☆29Mar 14, 2019Updated 6 years ago
- Statistics on multilingual datasets☆17Jul 12, 2022Updated 3 years ago
- A toolkit for generating paraphrase vector representations for words in context☆23May 19, 2015Updated 10 years ago
- ☆234Dec 27, 2016Updated 9 years ago
- ☆68Oct 28, 2021Updated 4 years ago
- Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017☆834Jun 12, 2023Updated 2 years ago
- This repository contains material of a teaching innovation project in Universitat de Barcelona: "Intelligent Support System for Tutor of …☆10Jun 30, 2020Updated 5 years ago
- On-the-fly Table Generation - SIGIR'18☆10Feb 1, 2020Updated 6 years ago
- HPYLMのC++実装☆11May 2, 2017Updated 8 years ago
- tensorflow2 implementation of SnapMix as described in SnapMix: Semantically Proportional Mixing for Augmenting Fine-grained Data☆11Feb 4, 2021Updated 5 years ago
- Question Dependent Recurrent Entity Network☆13Sep 21, 2017Updated 8 years ago
- Playing in Kaggle Playground☆11Oct 3, 2023Updated 2 years ago
- ☆12Sep 26, 2019Updated 6 years ago
- Speaker Role Contextual Model for Dialogues☆15Sep 30, 2017Updated 8 years ago
- ☆13Aug 28, 2018Updated 7 years ago
- ☆21Jul 15, 2024Updated last year
- ☆14Aug 26, 2016Updated 9 years ago
- Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"☆10Aug 11, 2023Updated 2 years ago
- ☆24Mar 23, 2018Updated 7 years ago
- Course materials for Sta 104 - Summer 2015 semester at Duke University☆22Jun 18, 2015Updated 10 years ago
- Code for the Kaggle insult competition☆30Apr 25, 2015Updated 10 years ago
- Document exploration tool☆12Sep 6, 2016Updated 9 years ago
- ☆13Sep 18, 2019Updated 6 years ago
- ☆15Feb 25, 2021Updated 4 years ago
- ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analys…☆12Jan 26, 2022Updated 4 years ago
- Official repository of "Efficient and Effective Query Expansion for Web Search", Short Paper @ CIKM 2018☆15Nov 17, 2019Updated 6 years ago
- ☆55Mar 24, 2022Updated 3 years ago
- Toxicity Detection in Context: Assuming that the comment exists in a thread and that the parent comment or/and the discussion topic are e…☆29Jul 21, 2023Updated 2 years ago
- Repo for the EACL2017 tutorial on imitation learning☆28Apr 3, 2017Updated 8 years ago
- Distributed storage with REST API & dispatcher for backend services☆14Sep 20, 2017Updated 8 years ago
- generative-camouflaged-spam-detector☆11Aug 20, 2020Updated 5 years ago
- Distributed implementation of Robust PLSA using Spark☆12Apr 29, 2021Updated 4 years ago
- A Wikipedia-based summarization dataset☆14Mar 27, 2023Updated 2 years ago
- Röttger et al. (WOAH at NAACL 2022): "Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models"☆17May 23, 2022Updated 3 years ago
- Codebase for HYPHEN, accepted at ACL 2022 (main)☆11May 17, 2022Updated 3 years ago
- Bag of, not words, but tricks!☆68Oct 31, 2023Updated 2 years ago
- Highly specialized crate to parse and use `google/sentencepiece` 's precompiled_charsmap in `tokenizers`☆19Jan 8, 2026Updated last month
- Includes chainer code used to get 1.24 bpc on hutter prize☆15Oct 12, 2017Updated 8 years ago