A simple interface to the Project Gutenberg corpus.
☆333Jan 12, 2023Updated 3 years ago
Alternatives and similar repositories for gutenberg
Users that are interested in gutenberg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple interface to the Project Gutenberg corpus.☆17Dec 23, 2015Updated 10 years ago
- Pipeline to generate the Standardized Project Gutenberg Corpus☆216Jan 5, 2024Updated 2 years ago
- Selected code and data for The Online Books Page and related applications☆11Apr 1, 2026Updated last month
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆116Sep 8, 2018Updated 7 years ago
- A command-line program to download text corpora.☆34Aug 12, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Python package for stylometry☆64Mar 30, 2021Updated 5 years ago
- django repo for running the GITenberg website☆41Jan 29, 2025Updated last year
- A corpus of poetry from Project Gutenberg☆217Aug 13, 2018Updated 7 years ago
- A content-based recommender system for books using the Project Gutenberg text corpus☆29Feb 20, 2017Updated 9 years ago
- ☆32Mar 14, 2017Updated 9 years ago
- Histonets is an application to convert images of scanned maps into digital networks☆20Oct 16, 2017Updated 8 years ago
- lemon lexicon for DBpedia☆28Oct 13, 2015Updated 10 years ago
- Find whole sentences matching a regex in Project Gutenberg☆32Feb 5, 2023Updated 3 years ago
- This repo contain the exercies of the Next.ML 2015 presentation☆24Jan 17, 2015Updated 11 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆49Jul 13, 2017Updated 8 years ago
- A simple Python interface for Darius Kazemi's Corpora Project.☆124Feb 7, 2020Updated 6 years ago
- a python package for cleaning Gutenberg books and dataset☆35May 2, 2025Updated last year
- ☆19Nov 30, 2021Updated 4 years ago
- Python port of Kate Compton's Tracery text expansion library.☆257Mar 8, 2024Updated 2 years ago
- Search, download, and process public domain texts from Project Gutenberg☆114Apr 14, 2026Updated 3 weeks ago
- PyPy.js example usage of: https://github.com/pypyjs/pypyjs-release☆15Oct 9, 2018Updated 7 years ago
- Stylometric framework in Python☆17Apr 9, 2015Updated 11 years ago
- Parses Wikipedia citation templates in Python☆17Mar 26, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.co…☆316Feb 4, 2022Updated 4 years ago
- OH: This is just python 2.7 fanfic isn't it?☆21Sep 13, 2019Updated 6 years ago
- Strips boilerplate from Project Gutenberg text files☆17Jul 28, 2021Updated 4 years ago
- National Novel Generation Month, 2016 edition.☆161Sep 30, 2023Updated 2 years ago
- R package for finding related words through the Datamuse API☆25Nov 30, 2025Updated 5 months ago
- Live clone of https://sourceforge.net/p/docutils/code/HEAD/tree/.☆21Apr 17, 2026Updated 2 weeks ago
- The JSON files from CourtListener.com for the Supreme Court of the United States☆11Jul 9, 2015Updated 10 years ago
- Tools in python for dealing with Google Books Ngram files and other similar data sets.☆19May 7, 2014Updated 11 years ago
- A grunt-init template for text generating pages with twitter/link sharing.☆10Aug 13, 2015Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Django-crowdsourcing is a highly configurable survey and report tool for journalists, with a feature set that supports a wide range of us…☆25Nov 5, 2013Updated 12 years ago
- National Novel Generation Month, 2023 edition.☆27Oct 2, 2024Updated last year
- Python webservices (APIs) for Django, Flask and Twisted.☆23Apr 10, 2018Updated 8 years ago
- CSV on the Web parser☆17Mar 2, 2026Updated 2 months ago
- ☆11Jun 13, 2020Updated 5 years ago
- Command line interface to find and download open datasets☆20Apr 18, 2015Updated 11 years ago
- Convert your text ᶦᶰᵗᵒ ᵗᶦᶰᶦᵉʳ ᵗᵉˣᵗ☆23Updated this week