econpy / google-ngramsLinks
Python scripts for retrieving CSV data from the Google Ngram Viewer and plotting it in XKCD style. The Python script for retrieving ngram data was originally modified from the script at www.culturomics.org.
☆254Updated 4 years ago
Alternatives and similar repositories for google-ngrams
Users that are interested in google-ngrams are comparing it to the libraries listed below
Sorting:
- ☆97Updated 4 years ago
- a collection of functions that measure the readability of a given body of text☆195Updated 7 years ago
- (Mental) maps of texts with kernel density estimation and force-directed networks.☆109Updated 10 years ago
- topic model visualization☆52Updated 10 years ago
- Quickly extract multi-word phrases from a corpus☆193Updated 5 years ago
- A toolkit for corpus linguistics☆205Updated 6 years ago
- Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.co…☆315Updated 3 years ago
- An implementation of latent Dirichlet allocation in javascript☆185Updated 3 years ago
- Data Server for Topic Models☆121Updated 2 years ago
- An open-source CRF Reference String Parsing Package☆160Updated 5 years ago
- Python port of the Twokenize class of ark-tweet-nlp☆142Updated 7 years ago
- Python 2 & 3 wrapper around the Stanford Topic Modeling Toolbox. Intended to be used for hassle-free supervised topic classification with…☆58Updated 7 years ago
- Sample implementation of a politeness model, trained on the Stanford Politeness Corpus☆147Updated 3 years ago
- Socially-Equitable Language Identification☆78Updated 2 years ago
- This is a mirror of the script by Giuseppe Attardi, and contains history before the official repo started: https://github.com/attardi/wik…☆260Updated 8 years ago
- Python package for stylometry☆63Updated 4 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆81Updated 9 years ago
- ☆34Updated 3 years ago
- wpcorpus - NLP corpus based on Wikipedia's full article dump☆97Updated 9 years ago
- The Python-language successor to the TABARI event-data coding software.☆45Updated 8 years ago
- A dataset containing story plots from Wikipedia (books, movies, etc.) and the code for the extractor.☆315Updated 7 years ago
- A large corpus of discourse annotations and relations on ~10K forum threads.☆240Updated 6 years ago
- A simple interface to the Project Gutenberg corpus.☆330Updated 2 years ago
- Quantitative Text Analysis for the digitale Geisteswissenschaften☆47Updated 10 years ago
- *Deprecated* A fast and accurate part-of-speech tagger for TextBlob.☆102Updated 9 years ago
- Temporal Expression Recognition and Normalisation in Python☆77Updated 9 years ago
- Practical Natural Language Processing Tools for Humans. Dependency Parsing, Syntactic Constituent Parsing, Semantic Role Labeling, Named …☆194Updated 7 years ago
- Tokenization and pre-processing for Twitter data used to train classifiers.☆72Updated 8 years ago
- Collection of tools for building diachronic/historical word vectors☆437Updated last year
- The Art of Literary Text Analysis☆166Updated 6 years ago