puntonim / gutenberg-bulk-downloader
Bulk downloader for free ebooks hosted at Project Gutenberg
☆17Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for gutenberg-bulk-downloader
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆65Updated this week
- Wiktionary parser tool for many language editions.☆53Updated 2 years ago
- python-timbl, originally developed by Sander Canisius, is a Python extension module wrapping the full TiMBL C++ programming interface. Wi…☆18Updated 3 weeks ago
- API for WOLF, a free French WordNet☆13Updated 6 years ago
- Basic dataset for the linguistic data collection.☆15Updated 7 years ago
- Pipeline for distributed Natural Language Processing, made in Python