econpy / google-ngrams
Python scripts for retrieving CSV data from the Google Ngram Viewer and plotting it in XKCD style. The Python script for retrieving ngram data was originally modified from the script at www.culturomics.org.
☆250Updated 3 years ago
Related projects: ⓘ
- ☆95Updated 3 years ago
- Python port of the Twokenize class of ark-tweet-nlp☆142Updated 6 years ago
- a collection of functions that measure the readability of a given body of text☆189Updated 7 years ago
- A toolkit for corpus linguistics☆199Updated 5 years ago
- Sample implementation of a politeness model, trained on the Stanford Politeness Corpus☆146Updated 2 years ago
- ☆151Updated 4 years ago
- Data Server for Topic Models☆121Updated last year
- A large corpus of discourse annotations and relations on ~10K forum threads.☆238Updated 5 years ago
- Quickly extract multi-word phrases from a corpus☆190Updated 4 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆82Updated 8 years ago
- A simple interface to the Project Gutenberg corpus.☆320Updated last year
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.☆152Updated 5 years ago
- Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser…☆47Updated last week
- wpcorpus - NLP corpus based on Wikipedia's full article dump☆97Updated 9 years ago
- Package for Statistically significant linguistic change☆52Updated last year
- relationship modeling networks (NAACL 2016)☆87Updated 3 years ago
- topic model visualization☆51Updated 9 years ago
- Python wrapper for Stanford CoreNLP tools☆58Updated 8 years ago
- iPython-based tutorial in Noun Phrase chunking with the NLTK. Written to accompany PyCon 2015 poster presentation.☆17Updated 9 years ago
- An open-source CRF Reference String Parsing Package☆155Updated 4 years ago
- Collection of tools for building diachronic/historical word vectors☆417Updated 9 months ago
- Various utilities for processing the data.☆203Updated this week
- Socially-Equitable Language Identification☆78Updated last year
- Stanford NLP group's shared Python tools.☆139Updated 6 years ago
- Tokenization and pre-processing for Twitter data used to train classifiers.☆71Updated 7 years ago
- NLTK Contrib☆166Updated 6 months ago
- (Mental) maps of texts with kernel density estimation and force-directed networks.☆106Updated 9 years ago
- A Dependency Parser for Tweets☆79Updated 5 years ago
- Vector Space Model Framework developed for InPhO☆35Updated 4 years ago
- A toolkit for coreference resolution and error analysis.☆129Updated 4 years ago