hugovk / gutenberg-metadataLinks
Metadata from Project Gutenberg
☆41Updated last week
Alternatives and similar repositories for gutenberg-metadata
Users that are interested in gutenberg-metadata are comparing it to the libraries listed below
Sorting:
- A deep learning model for extracting references from text☆29Updated last year
- Explore your own text collection with a topic model – without prior knowledge.☆63Updated 6 months ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆71Updated last week
- Poetic processing, for Python.☆42Updated last year
- Python tools for interacting with Wikidata☆154Updated last year
- A deep learning architecture for reference mining from literature in the arts and humanities.☆16Updated 5 years ago
- Wrapper for DKPro Core to extract lingustic information from books.☆16Updated 3 years ago
- Citation Classification using hybrid neural network model for Wikipedia References☆30Updated 2 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated 5 months ago
- An experiment replicating part of "Why Literary Time is Measured in Minutes" with GPT-4.☆34Updated 2 years ago
- A gold-standard dataset of software mentions in research publications.☆37Updated last year
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- a python package for cleaning Gutenberg books and dataset☆34Updated 2 months ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Updated 2 years ago
- A Named-Entity Recogniser based on Grobid.☆55Updated 2 months ago
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆65Updated last year
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- The official repository for the The Project Dialogism Novel Corpus, a dataset of annotated quotations in full-length English novels.☆39Updated last year
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆109Updated 6 years ago
- Libraries, Archives and Museums (LAM)☆84Updated 2 years ago
- Text Re-use Alignment Visualization☆38Updated 7 years ago
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆51Updated 8 years ago
- Textstelle is a collection of corpora for the creation of bots and other things that generate text 🤖☆20Updated 3 years ago
- This is code that we will cover in my Hacking the Humanities class at Leiden University. Video tutorials will be uploaded to my YouTube c…☆32Updated 6 years ago
- The GitHub repository containing all the material related to the Computational Thinking and Programming course of the Digital Humanities …☆30Updated 5 years ago
- Analysis of gutenberg dataset☆45Updated 6 years ago
- ☆30Updated 8 years ago
- Linguistic search for large annotated text corpora, based on Apache Lucene☆113Updated this week
- Discourse Analysis Tool Suite☆29Updated this week
- Interactive TOpic Model and MEtadata Visualization. Live at: tome.lmc.gatech.edu☆13Updated 6 years ago