tedunderwood / DataMunging
Scripts that clean up OCR and munge Hathi metadata.
☆75Updated 7 years ago
Alternatives and similar repositories for DataMunging:
Users that are interested in DataMunging are comparing it to the libraries listed below
- Tools for working with HTRC Feature Extraction files☆39Updated 3 weeks ago
- A digital humanities operating system that runs on a USB disk.☆31Updated 7 years ago
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆51Updated 7 years ago
- A Python library for topic modeling and visualization☆64Updated 4 years ago
- Digital Humanities Across Borders☆47Updated 9 months ago
- An R package for analysis of dramatic texts☆15Updated 2 years ago
- Project on the history of genre.☆22Updated 4 years ago
- A textual corpus database for the digital humanities.☆60Updated 4 years ago
- Detect and align similar passages☆92Updated last month
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆24Updated 2 years ago
- ☆19Updated 7 years ago
- Code and data to support the article, "How quickly do literary standards change?"☆22Updated 6 years ago
- Early Novels Database dataset