econpy / google-ngramsLinks
Python scripts for retrieving CSV data from the Google Ngram Viewer and plotting it in XKCD style. The Python script for retrieving ngram data was originally modified from the script at www.culturomics.org.
☆254Updated 4 years ago
Alternatives and similar repositories for google-ngrams
Users that are interested in google-ngrams are comparing it to the libraries listed below
Sorting:
- ☆97Updated 3 years ago
- Sample implementation of a politeness model, trained on the Stanford Politeness Corpus☆147Updated 3 years ago
- Python port of the Twokenize class of ark-tweet-nlp☆142Updated 7 years ago
- Package for Statistically significant linguistic change☆56Updated 2 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆81Updated 9 years ago
- This is a mirror of the script by Giuseppe Attardi, and contains history before the official repo started: https://github.com/attardi/wik…☆259Updated 8 years ago
- A dataset containing story plots from Wikipedia (books, movies, etc.) and the code for the extractor.☆315Updated 7 years ago
- (Mental) maps of texts with kernel density estimation and force-directed networks.☆108Updated 10 years ago
- Socially-Equitable Language Identification☆78Updated 2 years ago
- Quickly extract multi-word phrases from a corpus☆191Updated 5 years ago
- 💫 Scripts, tools and resources for developing spaCy☆126Updated 6 years ago
- An open-source CRF Reference String Parsing Package☆158Updated 5 years ago
- Stanford NLP group's shared Python tools.☆137Updated 7 years ago
- Python package for stylometry☆63Updated 4 years ago
- ☆151Updated 5 years ago
- Extract all the fields from the NY Times Corpus to a csv☆27Updated 2 years ago
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.☆154Updated 7 months ago
- Practical Natural Language Processing Tools for Humans. Dependency Parsing, Syntactic Constituent Parsing, Semantic Role Labeling, Named …☆193Updated 7 years ago
- ☆34Updated 3 years ago
- a collection of functions that measure the readability of a given body of text☆194Updated 7 years ago
- Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.co…☆315Updated 3 years ago
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆68Updated 3 years ago
- A large corpus of discourse annotations and relations on ~10K forum threads.☆240Updated 6 years ago
- topic model visualization☆52Updated 10 years ago
- Another next-generation event coding platform.☆76Updated 6 years ago
- Tokenization and pre-processing for Twitter data used to train classifiers.☆72Updated 8 years ago
- Collection of tools for building diachronic/historical word vectors☆434Updated last year
- Automatically exported from code.google.com/p/universal-pos-tags☆129Updated 3 years ago
- Python port of Mikolov's word2phrase.c from the word2vec toolkit☆111Updated 5 years ago
- A toolkit for corpus linguistics☆204Updated 6 years ago