raduangelescu / gutenbergpyLinks
Gutenberg cache and query library
☆46Updated 2 months ago
Alternatives and similar repositories for gutenbergpy
Users that are interested in gutenbergpy are comparing it to the libraries listed below
Sorting:
- tool for collectively summarizing large discussions☆145Updated 3 years ago
- A simple interface to the Project Gutenberg corpus.☆330Updated 3 years ago
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆114Updated 7 years ago
- Poetic processing, for Python.☆42Updated last year
- This is a collection of sentence-level aligned Sanskrit-Tibetan Etexts.☆15Updated 3 years ago
- Verb forms dictionary☆70Updated 8 years ago
- The WordSeer text analysis tool, written in Flask.☆46Updated 9 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆58Updated 4 years ago
- A Node.js-based server to run Zotero translators☆140Updated 2 months ago
- Tool for the Automatic Analysis of Syntactic Sophistication and Complexity☆31Updated 2 years ago
- JSON representation of the Zotero data model☆63Updated 3 weeks ago
- Inspect a URL and estimate if it contains a news story☆39Updated this week
- An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship ty…☆147Updated last year
- Scraper for downloading the entire ebooks repository of project Gutenberg☆155Updated last week
- QnA Markup editor and interpreter.☆50Updated 4 years ago
- python library to validate, clean, transform and get metadata of ISBN strings (for devs).☆273Updated last year
- Find legal citations in any block of text☆208Updated 4 months ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆18Updated 2 years ago
- Prosodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.☆292Updated 10 months ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆67Updated last week
- An experiment replicating part of "Why Literary Time is Measured in Minutes" with GPT-4.☆34Updated 2 years ago
- Decompose, transform, and recombine prose into mutated forms.☆12Updated last year
- AnyStyle Command Line Interface☆62Updated 9 months ago
- Sanskrit Tibetan Parallel Dataset☆11Updated 7 months ago
- Add website scraping abilities to Datasette☆66Updated 2 years ago
- SerendipSlim is a visualization tool for exploring topic models built on large collections of text documents.☆39Updated 7 years ago
- A database of court reporters, tests and other experiments☆122Updated this week
- ☆210Updated 4 years ago
- Multilingual syllable annotation pipeline component for spacy☆39Updated 2 years ago
- linguistics backend☆42Updated 2 years ago