ewulczyn / wiki-detox
See https://meta.wikimedia.org/wiki/Research:Modeling_Talk_Page_Abuse
☆152Updated 4 years ago
Alternatives and similar repositories for wiki-detox:
Users that are interested in wiki-detox are comparing it to the libraries listed below
- Python port of the Twokenize class of ark-tweet-nlp☆141Updated 6 years ago
- Socially-Equitable Language Identification☆78Updated 2 years ago
- Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html☆138Updated 2 years ago
- Code and data for inducing domain-specific sentiment lexicons.☆195Updated 8 months ago
- ☆325Updated 3 weeks ago
- A baseline implementation for FNC-1☆138Updated 3 years ago
- Deep Learning models to detect hate speech in tweets☆217Updated 7 years ago
- Corpus and annotations for the CL-Aff Shared Task from the University of Pennsylvania☆19Updated 3 years ago
- Making sense embedding out of word embeddings using graph-based word sense induction☆213Updated 3 years ago
- An introduction to using spaCy for NLP and machine learning☆191Updated 3 years ago
- Getting started with AllenNLP and PyTorch by training a tweet classifier☆66Updated 7 years ago
- A Dependency Parser for Tweets☆78Updated 5 years ago
- CrisisLex: Your data and lexical resource in crises☆52Updated last year
- A python module to get the emotion of a word.☆75Updated 6 years ago
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Updated 8 years ago
- Automatic labeling for topic model☆57Updated 9 years ago
- SFU Opinion and Comments Corpus☆91Updated last year
- Text classification example in Python using Latent Semantic Analysis (LSA)☆105Updated 6 years ago
- Fast supervised sentence boundary detection using the averaged perceptron☆90Updated 6 years ago
- Quickly extract multi-word phrases from a corpus☆191Updated 4 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆181Updated last year
- Tutorial on computational models of language change☆114Updated 5 years ago
- ☆40Updated 9 years ago
- Tokenization and pre-processing for Twitter data used to train classifiers.☆72Updated 8 years ago
- This contains materials for the word embeddings workshop☆126Updated 7 years ago
- Stanford NLP group's shared Python tools.☆137Updated 7 years ago
- Instructions & code for the EuroPython 2014 training session "Topic Modeling for Fun and Profit"☆110Updated 10 years ago
- Data and analysis for the BuzzFeed News article, "Hyperpartisan Facebook Pages Are Publishing False And Misleading Information At An Alar…☆110Updated 8 years ago
- Topic Modelling for Humans☆40Updated 7 years ago
- Retrofitting Word Vectors to Semantic Lexicons☆375Updated 5 years ago