hugovk / gutenberg-metadata
Metadata from Project Gutenberg
☆41Updated 2 months ago
Alternatives and similar repositories for gutenberg-metadata:
Users that are interested in gutenberg-metadata are comparing it to the libraries listed below
- a python package for cleaning Gutenberg books and dataset☆34Updated last year
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- The official repository for the The Project Dialogism Novel Corpus, a dataset of annotated quotations in full-length English novels.☆38Updated last year
- Wrapper for DKPro Core to extract lingustic information from books.☆16Updated 3 years ago
- Practical Approaches to Data Science with Text☆39Updated 5 years ago
- Python package for stylometry☆61Updated 3 years ago
- ☆30Updated 8 years ago
- A simple interface to the Project Gutenberg corpus.☆325Updated 2 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆41Updated last year
- A simple web application for searching Word2Vec embeddings derived from approximately 2,000 law reports published by the The Incorporated…☆26Updated 2 years ago
- Scripts for scraping metadata from Project Gutenberg books, via GITenberg.☆19Updated 6 years ago
- Finds linguistic patterns effortlessly☆35Updated last year
- Python 3 library for processing historical English☆66Updated 7 months ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Updated last year
- Analysis of gutenberg dataset☆43Updated 6 years ago
- Text Re-use Alignment Visualization☆38Updated 7 years ago
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆107Updated 6 years ago
- The curation repository for the data behind Concepticon.☆38Updated last month
- PoKi: A Large Dataset of Poems by Children☆35Updated last month
- Legal Reference Extraction☆29Updated 7 months ago
- A modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.☆95Updated 3 years ago
- This is a public repository for sharing, improving, and versioning "The Topic Modeling Game," a lesson developed by Lisa Rhody to teach t…☆10Updated 6 years ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆13Updated 5 years ago
- Berkeley DLab Python Intensive May 23-26☆28Updated 8 years ago
- MinScIE is an Open Information Extraction system which provides structured knowledge enriched with semantic information about citations.☆15Updated 5 years ago
- A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Sp…☆29Updated 3 years ago
- wrapper for the crossref events api☆21Updated last year
- Examples for getting started using https://case.law☆65Updated 2 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- A textual corpus database for the digital humanities.☆61Updated 4 years ago