bwbaugh / twitter-corpusLinks
Collects all tweets from the sample Public stream using Twitter's streaming API, and saves them to a file for later use as a corpus.
☆45Updated 4 years ago
Alternatives and similar repositories for twitter-corpus
Users that are interested in twitter-corpus are comparing it to the libraries listed below
Sorting:
- Non-distributional linguistic word vector representations.☆62Updated 7 years ago
- Python wrapper for Stanford CoreNLP tools☆58Updated 9 years ago
- Normalizes lexically ill-formed text to its most likely clean text, e.g. "c u thr 2nite!" -> "see you there tonight!".☆63Updated 9 years ago
- framework for doing NER and other types of entity recognition, in Python☆68Updated 2 years ago
- Natural Language Inference Dataset Generation☆29Updated 8 years ago
- ☆46Updated 7 years ago
- ESA implementation using Wikiprep output☆56Updated 11 years ago
- A python wrapper around the ZPar parser for English.☆49Updated 4 years ago
- SUMPY: a python automatic text summarization library☆55Updated 9 years ago
- Semantic Textual Similarity in Python☆80Updated 8 years ago
- Labeled examples from wiki dumps in Python☆67Updated 8 years ago
- Code to train and use models from "Charagram: Embedding Words and Sentences via Character n-grams".☆124Updated 8 years ago
- An autoencoder to calculate word embeddings as mentioned in Lebret/Collobert paper 2015☆74Updated 8 years ago
- ☆22Updated 7 years ago
- TREC Real-Time Summarization Tools☆15Updated 7 years ago
- Code of NAACL paper "Unsupervised Multi-Domain Adaptation with Feature Embeddings"☆33Updated 10 years ago
- CogComp's light-weight Python NLP annotators☆115Updated 6 years ago
- Word vectors☆64Updated 7 years ago
- Document context language models☆22Updated 9 years ago
- Python port of Mikolov's word2phrase.c from the word2vec toolkit☆111Updated 5 years ago
- Implementation of Word Embedding-based Antonym Detection using Thesauri and Distributional Information in NAACL2015☆35Updated 3 years ago
- End-to-end relation extraction and knowledge base population pipeline.☆48Updated 8 years ago
- Query-Document Relevance☆42Updated 10 years ago
- Standalone Semanticizer☆32Updated 10 years ago
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Updated 9 years ago
- ☆24Updated 8 years ago
- WordRank: Learning Word Embeddings via Robust Ranking☆51Updated 6 years ago
- Code for EMNLP 2016 paper: Morphological Priors for Probabilistic Word Embeddings☆53Updated 8 years ago
- A convolutional neural network library for NLP.☆60Updated 7 years ago
- Multi-Perspective Convolutional Neural Networks for modeling textual similarity (He et al., EMNLP 2015)☆106Updated 7 years ago