c-w / gutenberg
A simple interface to the Project Gutenberg corpus.
☆325Updated 2 years ago
Alternatives and similar repositories for gutenberg:
Users that are interested in gutenberg are comparing it to the libraries listed below
- A HTTP interface to the Project Gutenberg corpus.☆77Updated 5 years ago
- I wanted all of plaintext Project Gutenberg in an easy-to-use format, so I made this☆220Updated last year
- A dataset containing story plots from Wikipedia (books, movies, etc.) and the code for the extractor.☆315Updated 7 years ago
- Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.co…☆312Updated 3 years ago
- A corpus of poetry from Project Gutenberg☆198Updated 6 years ago
- Python scripts for retrieving CSV data from the Google Ngram Viewer and plotting it in XKCD style. The Python script for retrieving ngram…☆253Updated 4 years ago
- A toolkit for corpus linguistics☆205Updated 5 years ago
- Scraper for downloading the entire ebooks repository of project Gutenberg☆144Updated this week
- A simple interface for the CMU pronouncing dictionary☆311Updated 7 months ago
- Analyse rhyme scheme, metre and form of poems☆130Updated 3 years ago
- A textual corpus database for the digital humanities.☆61Updated 4 years ago
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆107Updated 6 years ago
- Metadata from Project Gutenberg☆41Updated 2 months ago
- A Corpus of Quotes☆68Updated 5 years ago
- A command-line program to download text corpora.☆34Updated 7 years ago
- ☆97Updated 3 years ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆254Updated 6 months ago
- Collection of tools for building diachronic/historical word vectors☆425Updated last year
- A simple Python interface for Darius Kazemi's Corpora Project.☆120Updated 5 years ago
- a collection of functions that measure the readability of a given body of text☆191Updated 7 years ago
- ☆33Updated 3 years ago
- Practical Approaches to Data Science with Text☆39Updated 5 years ago
- MediaWiki API wrapper in python http://pymediawiki.readthedocs.io/en/latest/☆182Updated 2 months ago
- Sample implementation of a politeness model, trained on the Stanford Politeness Corpus☆148Updated 2 years ago
- The WordSeer text analysis tool, written in Flask.☆42Updated 9 years ago
- ☆30Updated 8 years ago
- A point-and-click tool for creating and analyzing topic models produced by MALLET.☆108Updated 4 years ago
- An intelligent reading agent that understands text and translates it into Wikidata statements.☆115Updated 8 years ago
- Various utilities for processing the data.☆208Updated this week
- Official releases of the PROIEL treebank of ancient Indo-European languages☆37Updated last year