c-w / gutenbergLinks
A simple interface to the Project Gutenberg corpus.
☆328Updated 2 years ago
Alternatives and similar repositories for gutenberg
Users that are interested in gutenberg are comparing it to the libraries listed below
Sorting:
- I wanted all of plaintext Project Gutenberg in an easy-to-use format, so I made this☆223Updated 2 years ago
- A HTTP interface to the Project Gutenberg corpus.☆77Updated 5 years ago
- A dataset containing story plots from Wikipedia (books, movies, etc.) and the code for the extractor.☆315Updated 7 years ago
- Python scripts for retrieving CSV data from the Google Ngram Viewer and plotting it in XKCD style. The Python script for retrieving ngram…☆254Updated 4 years ago
- Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.co…☆315Updated 3 years ago
- ☆97Updated 3 years ago
- A corpus of poetry from Project Gutenberg☆203Updated 6 years ago
- A simple interface for the CMU pronouncing dictionary☆313Updated 10 months ago
- Scraper for downloading the entire ebooks repository of project Gutenberg☆150Updated this week
- Analyse rhyme scheme, metre and form of poems☆131Updated 4 years ago
- Python package for stylometry☆63Updated 4 years ago
- A command-line program to download text corpora.☆34Updated 7 years ago
- ☆30Updated 8 years ago
- a python package for cleaning Gutenberg books and dataset☆34Updated last month
- Various utilities for processing the data.☆209Updated this week
- Metadata from Project Gutenberg☆41Updated 2 months ago
- System for building, visualizing, and working with LDA topic models☆96Updated last week
- Collection of tools for building diachronic/historical word vectors☆434Updated last year
- a collection of functions that measure the readability of a given body of text☆194Updated 7 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆181Updated 2 years ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆259Updated 9 months ago
- LingPy: Python library for quantitative tasks in historical linguistics☆134Updated 3 months ago
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆108Updated 6 years ago
- Pipeline to generate the Standardized Project Gutenberg Corpus☆184Updated last year
- Prosodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.☆283Updated 3 months ago
- A simple Python interface for Darius Kazemi's Corpora Project.☆120Updated 5 years ago
- Quickly extract multi-word phrases from a corpus☆191Updated 4 years ago
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.☆154Updated 7 months ago
- 💫 Scripts, tools and resources for developing spaCy☆126Updated 6 years ago
- A Corpus of Quotes☆68Updated 6 years ago