raduangelescu / gutenbergpy
Gutenberg cache and query library
☆35Updated 6 months ago
Alternatives and similar repositories for gutenbergpy:
Users that are interested in gutenbergpy are comparing it to the libraries listed below
- a python package for cleaning Gutenberg books and dataset☆34Updated last year
- An experiment replicating part of "Why Literary Time is Measured in Minutes" with GPT-4.☆32Updated last year
- Multilingual syllable annotation pipeline component for spacy☆39Updated last year
- Poetic processing, for Python.☆40Updated 9 months ago
- ☆67Updated 11 months ago
- WordWanderer – take your text for a walk☆12Updated 5 years ago
- ☆54Updated last year
- Explore your own text collection with a topic model – without prior knowledge.☆62Updated last month
- JSON representation of the Zotero data model☆52Updated 2 weeks ago
- A textual corpus database for the digital humanities.☆60Updated 4 years ago
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆104Updated 6 years ago
- 🌸 Train floret vectors☆18Updated last year
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated 11 months ago
- Scrollership through 20m pubmed abstracts.☆26Updated last year
- Free-for-all repository of TEI and plain text files for you (to do cool stuff) provided by the Digital Collections Services group at the …☆27Updated 7 years ago
- Python based Wikidata framework for easy dataframe extraction☆41Updated last year
- ☆84Updated this week
- Interactive Visualization Interface for Multidimensional Datasets☆56Updated 2 weeks ago
- This is a collection of sentence-level aligned Sanskrit-Tibetan Etexts.☆14Updated 2 years ago
- Inspect a URL and estimate if it contains a news story☆39Updated 2 months ago
- The official repository for the The Project Dialogism Novel Corpus, a dataset of annotated quotations in full-length English novels.☆39Updated last year
- A simple interface to the Project Gutenberg corpus.☆324Updated 2 years ago
- Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.☆20Updated 2 months ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆163Updated 8 months ago
- ☆27Updated last week
- Easy PDF to text to spaCy text extraction in Python.☆38Updated 4 months ago
- ☆55Updated last year
- Scraper for downloading the entire ebooks repository of project Gutenberg☆140Updated 3 months ago