erikavaris / tokenizer
Tokenizer for Twitter and Reddit data
☆43Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for tokenizer
- A Dependency Parser for Tweets☆79Updated 5 years ago
- A framework to identify relations between ideas in temporal text corpora.☆29Updated 6 years ago
- public repository of the interdisciplinary working group 'Hatespeech' of the research training group UCSM☆17Updated 5 years ago
- Sparse Additive Generative Model of Text☆86Updated 8 years ago
- Visualize word embeddings of a vocabulary in TensorBoard, including the neighbors☆45Updated 7 years ago
- Code to reproduce experiments from the EMNLP 2015 paper about Rumour Stance Classification with Gaussian Processes.☆35Updated 8 years ago
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Updated 8 years ago
- ☆104Updated 6 years ago
- Processing the MPQA Corpus☆27Updated 6 years ago
- ☆44Updated 6 years ago
- Named Entity Disambiguation for Noisy Text☆67Updated 7 years ago
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆27Updated 6 years ago
- Code to accompany "A Neural Framework for Generalized Topic Models"☆67Updated 6 years ago
- Python port of the Twokenize class of ark-tweet-nlp☆141Updated 6 years ago
- Corpus and annotations for the CL-Aff Shared Task from the University of Pennsylvania☆19Updated 3 years ago
- Code for the implementation of Tweet2Vec☆61Updated 6 years ago
- A Large Automatically-Constructed Resource of Predicate Paraphrases☆42Updated 4 years ago
- Predict edit intentions on Wikipedia☆19Updated 5 years ago
- EmoInt provides a high level wrapper to combine various word embeddings and creating ensembles from multiple trained models☆27Updated 4 years ago
- Uncovering divergent linguistic information in word embeddings with lessons for intrinsic and extrinsic evaluation☆63Updated 6 years ago
- Multi-Annotator Competence Estimation tool☆63Updated 5 years ago
- Sentence specificity prediction☆25Updated 5 years ago
- A framework to compare entity linking systems.☆37Updated 6 years ago
- Computation of the semantic interpretability of topics produced by topic models.☆179Updated 7 years ago
- Temporal Word Analogies in Python☆18Updated 7 years ago
- ☆42Updated 7 years ago
- Extract all the fields from the NY Times Corpus to a csv☆26Updated 2 years ago
- ☆66Updated last year
- SemEval 2019 Hyperpartisan News Detection - team Bertha von Suttner contribution☆22Updated 5 years ago