econpy / google-ngramsLinks
Python scripts for retrieving CSV data from the Google Ngram Viewer and plotting it in XKCD style. The Python script for retrieving ngram data was originally modified from the script at www.culturomics.org.
☆254Updated 4 years ago
Alternatives and similar repositories for google-ngrams
Users that are interested in google-ngrams are comparing it to the libraries listed below
Sorting:
- ☆97Updated 4 years ago
- (Mental) maps of texts with kernel density estimation and force-directed networks.☆109Updated 10 years ago
- Python port of the Twokenize class of ark-tweet-nlp☆142Updated 7 years ago
- A toolkit for corpus linguistics☆205Updated 6 years ago
- This is a mirror of the script by Giuseppe Attardi, and contains history before the official repo started: https://github.com/attardi/wik…☆260Updated 9 years ago
- A simple interface to the Project Gutenberg corpus.☆329Updated 2 years ago
- Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.co…☆316Updated 3 years ago
- An open-source CRF Reference String Parsing Package☆160Updated 5 years ago
- topic model visualization☆51Updated 10 years ago
- An implementation of latent Dirichlet allocation in javascript☆185Updated 3 years ago
- a collection of functions that measure the readability of a given body of text☆195Updated 7 years ago
- Data Server for Topic Models☆121Updated 2 years ago
- Sample implementation of a politeness model, trained on the Stanford Politeness Corpus☆147Updated 3 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆81Updated 9 years ago
- Python package for stylometry☆63Updated 4 years ago
- wpcorpus - NLP corpus based on Wikipedia's full article dump☆97Updated 9 years ago
- Quickly extract multi-word phrases from a corpus☆194Updated 5 years ago
- A large corpus of discourse annotations and relations on ~10K forum threads.☆240Updated 6 years ago
- Tools for parsing and querying Wikimedia Foundation pageview data from both static dumps and the online API.☆66Updated 3 years ago
- Collection of tools for building diachronic/historical word vectors☆438Updated last year
- The Python-language successor to the TABARI event-data coding software.☆45Updated 8 years ago
- ☆151Updated 5 years ago
- Python 2 & 3 wrapper around the Stanford Topic Modeling Toolbox. Intended to be used for hassle-free supervised topic classification with…☆58Updated 7 years ago
- rapid nlp prototyping☆71Updated 2 years ago
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.☆154Updated 10 months ago
- A command-line program to download text corpora.☆34Updated 8 years ago
- Socially-Equitable Language Identification☆78Updated 2 years ago
- Package for Statistically significant linguistic change☆56Updated 2 years ago
- Exploring the shapes of stories using indico sentiment analysis APIs☆28Updated 10 years ago
- Temporal Expression Recognition and Normalisation in Python☆77Updated 9 years ago