raduangelescu / gutenbergpyLinks
Gutenberg cache and query library
☆42Updated last year
Alternatives and similar repositories for gutenbergpy
Users that are interested in gutenbergpy are comparing it to the libraries listed below
Sorting:
- A simple interface to the Project Gutenberg corpus.☆330Updated 2 years ago
- a python package for cleaning Gutenberg books and dataset☆34Updated 6 months ago
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆113Updated 7 years ago
- An open etymology dataset created using Wiktionary data. Contains 3.8M entries, 1.8M terms, 2900 languages, and 31 unique relationship ty…☆133Updated last year
- This is a collection of sentence-level aligned Sanskrit-Tibetan Etexts.☆15Updated 3 years ago
- Poetic processing, for Python.☆42Updated last year
- An experiment replicating part of "Why Literary Time is Measured in Minutes" with GPT-4.☆34Updated 2 years ago
- Find legal citations in any block of text☆178Updated last month
- ☆55Updated last year
- Reference datasets for folktale motifs, tale types, and annotated texts☆15Updated 5 months ago
- tool for collectively summarizing large discussions☆145Updated 2 years ago
- Verb forms dictionary☆67Updated 8 years ago
- poetry from dirty ocr☆62Updated 4 years ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆78Updated last week
- Scraper for downloading the entire ebooks repository of project Gutenberg☆152Updated last week
- ☆110Updated last month
- This repository makes available the Talk of Norway (ToN) dataset, a collection of Norwegian parliament speeches from 1998 to 2016. Every …☆31Updated 2 years ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated 2 years ago
- Python parser for the Archie Markup Language (ArchieML)☆12Updated 3 years ago
- A textual corpus database for the digital humanities.☆62Updated 5 years ago
- A place for me to share VisiData plugins I've written.☆39Updated 4 years ago
- spaCy extension for Visual Studio Code☆31Updated 7 months ago
- ☆73Updated last week
- linguistics backend☆41Updated 2 years ago
- Plot suggestions for writers of creative fiction☆140Updated last year
- Automatically exported from code.google.com/p/guess-language☆53Updated 2 weeks ago
- QnA Markup editor and interpreter.☆51Updated 3 years ago
- ☆73Updated 2 years ago
- Group thousands of similar spreadsheet or database text entries in seconds☆157Updated 2 years ago
- [LREC 2020] EtymDB, an Etymological DataBase (v2.1)☆24Updated 3 years ago