erikavaris / tokenizer
Tokenizer for Twitter and Reddit data
☆45Updated 5 years ago
Alternatives and similar repositories for tokenizer:
Users that are interested in tokenizer are comparing it to the libraries listed below
- A Dependency Parser for Tweets☆78Updated 5 years ago
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Updated 8 years ago
- ☆104Updated 6 years ago
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆29Updated 6 years ago
- Python port of the Twokenize class of ark-tweet-nlp☆141Updated 6 years ago
- A framework to identify relations between ideas in temporal text corpora.☆28Updated 6 years ago
- Predict edit intentions on Wikipedia☆19Updated 6 years ago
- Multi-Annotator Competence Estimation tool☆63Updated 5 years ago
- Code to reproduce experiments from the EMNLP 2015 paper about Rumour Stance Classification with Gaussian Processes.☆36Updated 8 years ago
- SemEval 2019 Hyperpartisan News Detection - team Bertha von Suttner contribution☆22Updated 5 years ago
- Visualize word embeddings of a vocabulary in TensorBoard, including the neighbors☆46Updated 7 years ago
- Temporal Word Analogies in Python☆18Updated 7 years ago
- Sparse Additive Generative Model of Text☆87Updated 8 years ago
- public repository of the interdisciplinary working group 'Hatespeech' of the research training group UCSM☆17Updated 6 years ago
- Utility scripts in Python☆37Updated 7 months ago
- Code to compute topic coherence for several topic cardinalities and aggregate scores across them☆22Updated 2 weeks ago
- A Large Automatically-Constructed Resource of Predicate Paraphrases☆45Updated 4 years ago
- Code & Data for the Paper "Learning Word Relatedness over Time", EMNLP 2017☆12Updated 3 years ago
- Corpus and annotations for the CL-Aff Shared Task from the University of Pennsylvania☆19Updated 3 years ago
- scripts and data for ACL 16 paper☆14Updated 8 years ago
- LexNET: Integrated Path-based and Distributional Method for Lexical Semantic Relation Classification☆62Updated 6 years ago
- ☆54Updated 3 years ago
- Mining Argument Structures with Expressive Inference (Linear and LSTM Engines)☆65Updated 7 years ago
- ☆15Updated 5 years ago
- CLEARumor: ConvoLving ELMo against Rumors☆11Updated 7 months ago
- Incremental learning of word embeddings with context informativeness.☆94Updated last year
- Counter-fitting Word Vectors to Linguistic Constraints☆144Updated 4 years ago
- The Attract-Repel algorithm presented in (Mrkšić et al., TACL 2017), with accompanying resources.☆63Updated 7 years ago
- Collaborative web framework for analyzing text (e.g., tweets). Supports standard labeling and pairwise comparison.☆14Updated 3 years ago
- Non-distributional linguistic word vector representations.☆62Updated 7 years ago