hugovk / gutenberg-metadata
Metadata from Project Gutenberg
☆41Updated last week
Alternatives and similar repositories for gutenberg-metadata:
Users that are interested in gutenberg-metadata are comparing it to the libraries listed below
- Wrapper for DKPro Core to extract lingustic information from books.☆16Updated 2 years ago
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆103Updated 6 years ago
- Poetic processing, for Python.☆40Updated 8 months ago
- Text Re-use Alignment Visualization☆38Updated 7 years ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- Citation Classification using hybrid neural network model for Wikipedia References☆28Updated 2 years ago
- Official releases of the PROIEL treebank of ancient Indo-European languages☆36Updated last year
- ☆29Updated 7 years ago
- TEI Reader Python Library☆17Updated last year
- Discourse Analysis Tool Suite☆18Updated this week
- Analysis of gutenberg dataset☆42Updated 6 years ago
- Scripts that clean up OCR and munge Hathi metadata.☆75Updated 7 years ago
- In-browser OCR of Ancient Greek and Latin☆25Updated 2 months ago
- Homebase of the IPTC EXTRA project about rule-based text categorization☆13Updated 7 years ago
- The official repository for the The Project Dialogism Novel Corpus, a dataset of annotated quotations in full-length English novels.☆39Updated last year
- Python 3 library for processing historical English☆64Updated 5 months ago
- A textual corpus database for the digital humanities.☆60Updated 4 years ago
- The Tesserae project aims to provide a flexible and robust web interface for exploring intertextual parallels. Select two poems below to …☆30Updated 2 months ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- Latin BERT☆58Updated 6 months ago
- NERD and wiKIData (NERD KID) is a machine learning application for classifying Wikidata items into 27 classes (as defined by the Grobid-…☆8Updated last year
- Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)☆22Updated 8 months ago
- A tool for analyzing the word histories of a text.☆34Updated last month
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆60Updated 8 months ago
- An NLP processing pipeline for characters in fanfiction. Developed by students at Carnegie Mellon University from 2019-2021.☆31Updated 4 months ago
- A simple interface to the Project Gutenberg corpus.☆323Updated 2 years ago
- Wikidata authority file mapping tool☆11Updated 6 years ago
- Detect and align similar passages☆92Updated last month
- a python package for cleaning Gutenberg books and dataset☆32Updated last year
- PoKi: A Large Dataset of Poems by Children☆34Updated 4 years ago