puntonim / gutenberg-bulk-downloader
Bulk downloader for free ebooks hosted at Project Gutenberg
☆18Updated 2 years ago
Alternatives and similar repositories for gutenberg-bulk-downloader:
Users that are interested in gutenberg-bulk-downloader are comparing it to the libraries listed below
- Wikipedia API wrapper for humans and elk. (en.wikipedia.org/w/api.php, get it?)☆36Updated 10 years ago
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆66Updated last month
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆63Updated this week
- eXternally configurable REference and Non Named Entity Recognizer☆17Updated 7 months ago
- Wiktionary parser tool for many language editions.☆53Updated 2 years ago
- Python/Flask-based website for text analysis workflow. Previous (stable) release is live at:☆120Updated 8 months ago
- ☆29Updated 7 years ago
- A textual corpus database for the digital humanities.☆60Updated 4 years ago
- automate incrementally producing word pronunciation recordings for Wiktionary through Wikimedia Commons☆22Updated 6 years ago
- Download, convert and organize Gutenberg books for eBook Readers☆46Updated 5 years ago
- A parser and autocorrection tool for wiktionary.☆39Updated 9 years ago
- Multilingual Language Modeling Toolkit☆11Updated 7 years ago
- Lightning Fast Language Prediction 🚀☆165Updated 5 years ago
- A powerful, tagset-independent and theory-neutral meta model and API for storing, manipulating, and representing nearly all types of ling…☆15Updated last year
- Automatically exported from code.google.com/p/guess-language☆53Updated 11 months ago
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆34Updated last year
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆98Updated 4 years ago
- Tools for tracking stories on news homepages☆48Updated 5 years ago
- Wikipedia citation tool for Google Books, New York Times, ISBN, DOI and more☆21Updated 8 years ago
- All the reports and data powering http://weekly.hatnote.com☆12Updated this week
- Modernized version of Eric Brill's Part Of Speech tagger.☆17Updated last year
- Stylometric framework in Python☆13Updated 9 years ago
- 🇪🇺 Resources and Learning Games for European Romance Language Communication☆20Updated 7 years ago
- An ultra-simple example of how to use Python to write stories based on a set of data.☆29Updated 11 years ago
- A PDF classifier ensemble with REST API service☆23Updated 3 years ago
- Pipeline for distributed Natural Language Processing, made in Python☆65Updated 7 years ago
- Basic dataset for the linguistic data collection.☆15Updated 7 years ago