raduangelescu / gutenbergpyLinks
Gutenberg cache and query library
☆39Updated last year
Alternatives and similar repositories for gutenbergpy
Users that are interested in gutenbergpy are comparing it to the libraries listed below
Sorting:
- This is a collection of sentence-level aligned Sanskrit-Tibetan Etexts.☆15Updated 3 years ago
- Poetic processing, for Python.☆42Updated last year
- ☆107Updated last month
- a python package for cleaning Gutenberg books and dataset☆34Updated 4 months ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated 2 years ago
- Frontend for Korp, a tool using the IMS Open Corpus Workbench (CWB).☆16Updated this week
- Reference datasets for folktale motifs, tale types, and annotated texts☆15Updated 4 months ago
- Source files for "An Introduction to VisiData"☆74Updated 7 months ago
- tool for collectively summarizing large discussions☆145Updated 2 years ago
- Inspect a URL and estimate if it contains a news story☆39Updated 10 months ago
- A Node.js-based server to run Zotero translators☆134Updated 7 months ago
- ☆210Updated 4 years ago
- This repository makes available the Talk of Norway (ToN) dataset, a collection of Norwegian parliament speeches from 1998 to 2016. Every …☆31Updated 2 years ago
- Abbreviations for use with the Abbreviation Filter developed for use with Multilingual Zotero.☆18Updated last year
- A simple interface to the Project Gutenberg corpus.☆330Updated 2 years ago
- JSON representation of the Zotero data model☆57Updated 7 months ago
- linguistics backend☆41Updated 2 years ago
- An experiment replicating part of "Why Literary Time is Measured in Minutes" with GPT-4.☆34Updated 2 years ago
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆111Updated 7 years ago
- AnyStyle Command Line Interface☆60Updated 4 months ago
- Prosodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.☆286Updated 6 months ago
- Python wrapper library for the Datamuse API☆80Updated 2 years ago
- America's most comprehensive dictionary of campaign finance jargon. A free resource created by and for data journalists.☆17Updated 3 weeks ago
- An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship ty…☆130Updated last year
- Text Corpus of African American Fiction and Poetry, from 1853-1923☆10Updated 5 years ago
- An open-source archive that gathers, saves, shares and analyzes news homepages☆144Updated 3 weeks ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆53Updated 4 years ago
- A Python scraper for Goodreads books and reviews.☆297Updated 7 months ago
- Simple command line tool for quickly analysing the structure of an arbitrary XML file☆34Updated 2 years ago
- Extract networks of entities from journalistic reporting☆48Updated 2 years ago