raduangelescu / gutenbergpyLinks
Gutenberg cache and query library
☆39Updated last year
Alternatives and similar repositories for gutenbergpy
Users that are interested in gutenbergpy are comparing it to the libraries listed below
Sorting:
- a python package for cleaning Gutenberg books and dataset☆34Updated 4 months ago
- This is a collection of sentence-level aligned Sanskrit-Tibetan Etexts.☆15Updated 3 years ago
- A simple interface to the Project Gutenberg corpus.☆329Updated 2 years ago
- Reference datasets for folktale motifs, tale types, and annotated texts☆15Updated 3 months ago
- An experiment replicating part of "Why Literary Time is Measured in Minutes" with GPT-4.☆34Updated 2 years ago
- Poetic processing, for Python.☆42Updated last year
- ☆73Updated 2 years ago
- An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship ty…☆130Updated last year
- tool for collectively summarizing large discussions☆145Updated 2 years ago
- ☆211Updated 4 years ago
- Pipeline to generate the Standardized Project Gutenberg Corpus☆196Updated last year
- ☆104Updated 2 weeks ago
- eXtensible Interlinear Glossed Text☆33Updated 3 years ago
- LLM plugin for embeddings using sentence-transformers☆70Updated 4 months ago
- Text Corpus of African American Fiction and Poetry, from 1853-1923☆10Updated 5 years ago
- A Node.js-based server to run Zotero translators☆133Updated 6 months ago
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆110Updated 6 years ago
- Add website scraping abilities to Datasette☆64Updated 2 years ago
- Use visual programming to build data tables based on text data within the Orange data mining software environment☆29Updated 2 months ago
- ☆55Updated last year
- I wanted all of plaintext Project Gutenberg in an easy-to-use format, so I made this☆223Updated 2 years ago
- Web service to generate citations and bibliographies using citeproc-js☆63Updated 3 weeks ago
- Source files for "An Introduction to VisiData"☆74Updated 6 months ago
- AnyStyle Command Line Interface☆60Updated 3 months ago
- A Python scraper for Goodreads books and reviews.☆296Updated 6 months ago
- A textual corpus database for the digital humanities.☆61Updated 5 years ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated 3 weeks ago
- Scraper for downloading the entire ebooks repository of project Gutenberg☆152Updated last month
- Tracking the history of trees in San Francisco☆46Updated last week
- Source files for the Open, Transparent, and Reproducible Data Science Handbook☆49Updated 2 months ago