raduangelescu / gutenbergpyLinks
Gutenberg cache and query library
☆38Updated 11 months ago
Alternatives and similar repositories for gutenbergpy
Users that are interested in gutenbergpy are comparing it to the libraries listed below
Sorting:
- Poetic processing, for Python.☆42Updated last year
- This is a collection of sentence-level aligned Sanskrit-Tibetan Etexts.☆15Updated 3 years ago
- A textual corpus database for the digital humanities.☆62Updated 4 years ago
- A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Sp…☆30Updated 3 weeks ago
- ☆100Updated last week
- Inspect a URL and estimate if it contains a news story☆39Updated 7 months ago
- Source files for "An Introduction to VisiData"☆73Updated 5 months ago
- Reference datasets for folktale motifs, tale types, and annotated texts☆13Updated last month
- Scraper for downloading the entire ebooks repository of project Gutenberg☆151Updated this week
- SerendipSlim is a visualization tool for exploring topic models built on large collections of text documents.☆39Updated 7 years ago
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆109Updated 6 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆52Updated 4 years ago
- Python wrapper library for the Datamuse API☆80Updated 2 years ago
- Dataset: BuzzFeed News “Trending” Strip, 2018–2023☆19Updated 2 years ago
- Multilingual syllable annotation pipeline component for spacy☆39Updated 2 years ago
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated 9 months ago
- Free-for-all repository of TEI and plain text files for you (to do cool stuff) provided by the Digital Collections Services group at the …☆27Updated 8 years ago
- tool for collectively summarizing large discussions☆144Updated 2 years ago
- Automatically exported from code.google.com/p/guess-language☆53Updated last year
- Tool for the Automatic Analysis of Syntactic Sophistication and Complexity☆25Updated last year
- Datasette plugin for uploading CSV files and converting them to database tables☆27Updated last year
- An experiment replicating part of "Why Literary Time is Measured in Minutes" with GPT-4.☆34Updated 2 years ago
- Tag news stories based on models trained on the NYT corpus.☆42Updated 2 years ago
- A command-line tool for interacting with books in git☆111Updated 11 months ago
- an experimental implementation of Burrow's delta in Python 3☆21Updated 3 years ago
- A maximum-strength name parser for record linkage.☆37Updated last month
- A place for me to share VisiData plugins I've written.☆37Updated 3 years ago
- Web service to generate citations and bibliographies using citeproc-js☆63Updated last year
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Explore your own text collection with a topic model – without prior knowledge.☆63Updated 7 months ago