econpy / google-ngrams
Python scripts for retrieving CSV data from the Google Ngram Viewer and plotting it in XKCD style. The Python script for retrieving ngram data was originally modified from the script at www.culturomics.org.
☆253Updated 4 years ago
Alternatives and similar repositories for google-ngrams:
Users that are interested in google-ngrams are comparing it to the libraries listed below
- ☆97Updated 3 years ago
- ☆151Updated 5 years ago
- Python port of the Twokenize class of ark-tweet-nlp☆141Updated 6 years ago
- a collection of functions that measure the readability of a given body of text☆191Updated 7 years ago
- Data Server for Topic Models☆120Updated 2 years ago
- A toolkit for corpus linguistics☆205Updated 5 years ago
- Socially-Equitable Language Identification☆78Updated 2 years ago
- Sample implementation of a politeness model, trained on the Stanford Politeness Corpus☆147Updated 2 years ago
- A Dependency Parser for Tweets☆78Updated 5 years ago
- Collection of tools for building diachronic/historical word vectors☆427Updated last year
- Stanford NLP group's shared Python tools.☆137Updated 7 years ago
- (Mental) maps of texts with kernel density estimation and force-directed networks.☆107Updated 9 years ago
- Quickly extract multi-word phrases from a corpus☆191Updated 4 years ago
- English data☆206Updated last week
- A simple interface to the Project Gutenberg corpus.☆326Updated 2 years ago
- BLLIP reranking parser (also known as Charniak-Johnson parser, Charniak parser, Brown reranking parser) See http://pypi.python.org/pypi/b…☆226Updated 3 years ago
- This is a mirror of the script by Giuseppe Attardi, and contains history before the official repo started: https://github.com/attardi/wik…☆259Updated 8 years ago
- Various utilities for processing the data.☆208Updated this week
- An implementation of latent Dirichlet allocation in javascript☆184Updated 2 years ago
- topic model visualization☆52Updated 10 years ago
- A large corpus of discourse annotations and relations on ~10K forum threads.☆239Updated 6 years ago
- Python port of Mikolov's word2phrase.c from the word2vec toolkit☆111Updated 5 years ago
- A hack to replace Pride & Prejudice text with closest word2vec model word, and visualize results.☆61Updated 10 years ago
- A command-line program to download text corpora.☆34Updated 7 years ago
- PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, an…☆477Updated last year
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆67Updated 2 years ago
- A multilingual dependency parser based on linear programming relaxations.☆115Updated 6 years ago
- Python wrapper for Stanford CoreNLP☆355Updated 4 years ago
- The Python-language successor to the TABARI event-data coding software.☆45Updated 7 years ago
- Exploring the shapes of stories using indico sentiment analysis APIs☆28Updated 9 years ago