raduangelescu / gutenbergpyLinks
Gutenberg cache and query library
☆42Updated last week
Alternatives and similar repositories for gutenbergpy
Users that are interested in gutenbergpy are comparing it to the libraries listed below
Sorting:
- a python package for cleaning Gutenberg books and dataset☆34Updated 6 months ago
- A simple interface to the Project Gutenberg corpus.☆330Updated 2 years ago
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆114Updated 7 years ago
- Find legal citations in any block of text☆182Updated last month
- tool for collectively summarizing large discussions☆145Updated 2 years ago
- Poetic processing, for Python.☆42Updated last year
- An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship ty…☆137Updated last year
- ☆113Updated 2 weeks ago
- A database of court reporters, tests and other experiments☆117Updated this week
- Verb forms dictionary☆67Updated 8 years ago
- Reading legal authority for the last time☆41Updated 8 months ago
- An open-source archive that gathers, saves, shares and analyzes news homepages☆148Updated 3 weeks ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆65Updated last week
- Python wrapper library for the Datamuse API☆81Updated 2 years ago
- Pipeline to generate the Standardized Project Gutenberg Corpus☆203Updated last year
- Export Airtable data to YAML, JSON or SQLite files on disk☆129Updated last year
- Python based Wikidata framework for easy dataframe extraction☆45Updated 2 years ago
- Scraper for downloading the entire ebooks repository of project Gutenberg☆153Updated this week
- Prosodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.☆287Updated 8 months ago
- Add website scraping abilities to Datasette☆66Updated 2 years ago
- H2O is a web app for creating and reading open educational resources, primarily in the legal field☆43Updated last month
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆56Updated 4 years ago
- This is a collection of sentence-level aligned Sanskrit-Tibetan Etexts.☆15Updated 3 years ago
- I wanted all of plaintext Project Gutenberg in an easy-to-use format, so I made this☆224Updated 2 years ago
- An experiment replicating part of "Why Literary Time is Measured in Minutes" with GPT-4.☆34Updated 2 years ago
- A collection of regular expressions for matching citations to state, federal, and even international law☆40Updated 4 years ago
- linguistics backend☆42Updated 2 years ago
- A Node.js-based server to run Zotero translators☆135Updated this week
- Inspect a URL and estimate if it contains a news story☆39Updated 3 weeks ago
- A place for me to share VisiData plugins I've written.☆39Updated 4 years ago