JonathanReeve / gitenberg-experiments
Scripts for scraping metadata from Project Gutenberg books, via GITenberg.
☆19Updated 6 years ago
Alternatives and similar repositories for gitenberg-experiments:
Users that are interested in gitenberg-experiments are comparing it to the libraries listed below
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 7 years ago
- Interactive TOpic Model and MEtadata Visualization. Live at: tome.lmc.gatech.edu☆13Updated 5 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- System for building, visualizing, and working with LDA topic models☆95Updated 3 weeks ago
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- Wrapper for DKPro Core to extract lingustic information from books.☆16Updated 3 years ago
- A textual corpus database for the digital humanities.☆61Updated 4 years ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated 2 years ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆26Updated 3 years ago
- SerendipSlim is a visualization tool for exploring topic models built on large collections of text documents.☆39Updated 6 years ago
- Topic Modeling Workflow in Python☆16Updated 2 years ago
- Berkeley DLab Python Intensive May 23-26☆28Updated 8 years ago
- A point-and-click tool for creating and analyzing topic models produced by MALLET.☆108Updated 4 years ago
- A Mashup Interface for Text Analysis Operations☆13Updated 3 months ago
- Events and Situations Ontology☆14Updated 6 years ago
- A temporal ordering system for events and time expressions in written text.☆43Updated 3 years ago
- ☆19Updated 8 years ago
- Citation Classification using hybrid neural network model for Wikipedia References☆28Updated 2 years ago
- Text Re-use Alignment Visualization☆38Updated 7 years ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 2 years ago
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆51Updated 7 years ago
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 3 years ago
- This is code that we will cover in my Hacking the Humanities class at Leiden University. Video tutorials will be uploaded to my YouTube c…☆31Updated 6 years ago
- Scripts to create git repositories for ALTO XML texts, like those from the British Library's scanned documents.☆31Updated 7 years ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆48Updated 2 years ago
- Practical Approaches to Data Science with Text☆39Updated 5 years ago
- A tool for analyzing the word histories of a text.☆34Updated 4 months ago
- A command-line program to download text corpora.☆34Updated 7 years ago
- Python API for KB data-services☆19Updated 5 years ago
- Scripts that clean up OCR and munge Hathi metadata.☆76Updated 7 years ago