econpy / google-ngramsLinks
Python scripts for retrieving CSV data from the Google Ngram Viewer and plotting it in XKCD style. The Python script for retrieving ngram data was originally modified from the script at www.culturomics.org.
☆254Updated 5 years ago
Alternatives and similar repositories for google-ngrams
Users that are interested in google-ngrams are comparing it to the libraries listed below
Sorting:
- ☆98Updated 4 years ago
- (Mental) maps of texts with kernel density estimation and force-directed networks.☆108Updated 10 years ago
- a collection of functions that measure the readability of a given body of text☆196Updated 8 years ago
- A toolkit for corpus linguistics☆206Updated 6 years ago
- Python port of the Twokenize class of ark-tweet-nlp☆142Updated 7 years ago
- This is a mirror of the script by Giuseppe Attardi, and contains history before the official repo started: https://github.com/attardi/wik…☆260Updated 9 years ago
- Quickly extract multi-word phrases from a corpus☆195Updated 5 years ago
- A simple interface to the Project Gutenberg corpus.☆331Updated 3 years ago
- Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.co…☆316Updated 4 years ago
- Data Server for Topic Models☆122Updated 2 years ago
- Python 2 & 3 wrapper around the Stanford Topic Modeling Toolbox. Intended to be used for hassle-free supervised topic classification with…☆58Updated 7 years ago
- Socially-Equitable Language Identification☆78Updated 2 years ago
- An implementation of latent Dirichlet allocation in javascript☆185Updated 3 years ago
- Sample implementation of a politeness model, trained on the Stanford Politeness Corpus☆148Updated 3 years ago
- ☆151Updated 6 years ago
- An open-source CRF Reference String Parsing Package☆160Updated 5 years ago
- Temporal Expression Recognition and Normalisation in Python☆77Updated 10 years ago
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆76Updated 8 months ago
- A Python library to calculate the readability score of a text.☆141Updated 8 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆81Updated 9 years ago
- A large corpus of discourse annotations and relations on ~10K forum threads.☆241Updated 7 years ago
- topic model visualization☆51Updated 10 years ago
- Tokenization and pre-processing for Twitter data used to train classifiers.☆72Updated 9 years ago
- The Python-language successor to the TABARI event-data coding software.☆45Updated 8 years ago
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.☆155Updated last year
- Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies☆69Updated 6 years ago
- 💫 Scripts, tools and resources for developing spaCy☆126Updated 6 years ago
- Package for Statistically significant linguistic change☆56Updated 3 years ago
- Python port of Mikolov's word2phrase.c from the word2vec toolkit☆111Updated 5 years ago
- Excitement Open Platform for Recognizing Textual Entailments☆89Updated 8 years ago