hugovk / gutenberg-metadataLinks
Metadata from Project Gutenberg
☆41Updated 2 months ago
Alternatives and similar repositories for gutenberg-metadata
Users that are interested in gutenberg-metadata are comparing it to the libraries listed below
Sorting:
- Citation Classification using hybrid neural network model for Wikipedia References☆31Updated 3 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆28Updated 4 years ago
- Python tools for interacting with Wikidata☆159Updated 2 years ago
- Practical Approaches to Data Science with Text☆39Updated 6 years ago
- Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)☆23Updated last year
- System for building, visualizing, and working with LDA topic models☆97Updated this week
- FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (inclu…☆66Updated last week
- Text Re-use Alignment Visualization☆38Updated 8 years ago
- A deep learning model for extracting references from text☆30Updated 2 years ago
- Homebase of the IPTC EXTRA project about rule-based text categorization☆13Updated 8 years ago
- Pipeline to generate the Standardized Project Gutenberg Corpus☆203Updated last year
- A high performance bibliographic information service: https://biblio-glutton.readthedocs.io☆146Updated 6 months ago
- A Python library for topic modeling and visualization☆67Updated 5 years ago
- A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books fo…☆114Updated 7 years ago
- A simple interface to the Project Gutenberg corpus.☆330Updated 2 years ago
- A Named-Entity Recogniser based on Grobid.☆54Updated 7 months ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆47Updated this week
- SerendipSlim is a visualization tool for exploring topic models built on large collections of text documents.☆39Updated 7 years ago
- Scripts that clean up OCR and munge Hathi metadata.☆77Updated 8 years ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆39Updated 3 years ago
- Berkeley DLab Python Intensive May 23-26☆28Updated 9 years ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆77Updated last week
- Wikidata embedding☆51Updated last year
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆51Updated 8 years ago
- CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems☆62Updated last year
- A point-and-click tool for creating and analyzing topic models produced by MALLET.☆112Updated 4 years ago
- Corpus of Open Access articles from multiple fields in Science, Technology, and Medicine.☆74Updated 8 years ago
- Project on the history of genre.☆24Updated 5 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 5 years ago