myleott / ark-twokenize-pyView external linksLinks
Python port of the Twokenize class of ark-tweet-nlp
☆142May 4, 2018Updated 7 years ago
Alternatives and similar repositories for ark-twokenize-py
Users that are interested in ark-twokenize-py are comparing it to the libraries listed below
Sorting:
- CMU ARK Twitter Part-of-Speech Tagger☆577Dec 17, 2023Updated 2 years ago
- A Dependency Parser for Tweets☆78Sep 5, 2019Updated 6 years ago
- Simple Python wrapper around runTagger.sh of ark-tweet-nlp☆67Dec 1, 2018Updated 7 years ago
- An implementation of Color2Gray with convolutional neural networks☆11Dec 23, 2015Updated 10 years ago
- Code and data for Teddy https://arxiv.org/abs/2001.05171.☆15Jun 21, 2022Updated 3 years ago
- Crisis Event Extraction Service (CREES)☆15Feb 4, 2019Updated 7 years ago
- C# implementation of Peter Norvig’s spelling corrector☆10Feb 24, 2023Updated 2 years ago
- Named Entity Disambiguation for Noisy Text☆66Jun 26, 2017Updated 8 years ago
- Python library for fitting massive mixture models using DP priors and GPU computation.☆23Apr 7, 2016Updated 9 years ago
- Tokenizer for Twitter and Reddit data☆45Apr 14, 2019Updated 6 years ago
- How (but not why) to do Twitter sociolinguistic analysis in the Unix Shell☆10Apr 19, 2016Updated 9 years ago
- Code and pre-trained model used in ACL 2013 demo paper☆13Dec 7, 2014Updated 11 years ago
- TBEEF, a doubly ensemble framework for recommendation and prediction problems.☆20Apr 16, 2016Updated 9 years ago
- This projects hosts an annotated dataset of 39 transcripts of United States presidential election debates annotated with argument compone…☆12Jun 3, 2019Updated 6 years ago
- An OpenCalais API Interface for Python.☆21Mar 13, 2012Updated 13 years ago
- Extracting Entities with Limited Evidence☆16Dec 26, 2022Updated 3 years ago
- Tweets annotated with coarse-grained sense labels (supersenses)☆13Jun 13, 2014Updated 11 years ago
- A K-Means implementation for KL-Divergence (instead of squared euclidean distance)☆14Jul 7, 2015Updated 10 years ago
- An API for querying the latest Corona / COVID-19 data☆12May 22, 2023Updated 2 years ago
- Interpreting Sarcasm with Sentiment Based Monolingual Machine Translation☆11May 7, 2017Updated 8 years ago
- MiTextExplorer - interactive browser of text and document covariates.☆24Jun 17, 2015Updated 10 years ago
- A Neural Model for User Geolocation and Lexical Dialectology☆16Nov 11, 2018Updated 7 years ago
- Software for the paper "Gender and Lexical Variation in Social Media" with David Bamman and Tyler Schnoebelen☆17Nov 10, 2015Updated 10 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 2 years ago
- Python tools for data analysis☆19May 21, 2019Updated 6 years ago
- TREC Real-Time Summarization Tools☆15Jul 19, 2017Updated 8 years ago
- probabilistic language corrector based on google ngrams☆21May 31, 2011Updated 14 years ago
- ☆17Jan 30, 2025Updated last year
- Recurrent versus Recursive Approaches Towards Compositionality in Semantic Vector Spaces.☆13Sep 22, 2021Updated 4 years ago
- ☆36Oct 1, 2020Updated 5 years ago
- Active Learning for text classification using scikit-learn☆24Jun 6, 2019Updated 6 years ago
- UT Austin Machine Learning Group Latent Variable Modeling Toolkit☆26Feb 2, 2012Updated 14 years ago
- Python modules and scripts for working with Concrete, a data serialization format for NLP☆21Oct 20, 2023Updated 2 years ago
- Slides and coding demo for word2vec☆12Nov 14, 2016Updated 9 years ago
- Pretrained Biomedical Name Encoder☆15Jul 28, 2019Updated 6 years ago
- A web interface to understand language-specific BERT-models☆18Apr 16, 2024Updated last year
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16May 3, 2022Updated 3 years ago
- ☆42Nov 17, 2016Updated 9 years ago
- OKR: A Consolidated Open Knowledge Representation for Multiple Texts☆41Jan 25, 2018Updated 8 years ago