DH-Box / corpus-downloader
A command-line program to download text corpora.
☆33Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for corpus-downloader
- A textual corpus database for the digital humanities.☆59Updated 4 years ago
- A Python library for topic modeling and visualization☆64Updated 4 years ago
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆51Updated 7 years ago
- Workshop materials for our DH2018 workshop on word vectors. Created by Eun Seo Jo, Javier de la Rosa, and Scott Bailey☆15Updated 6 years ago
- A digital humanities operating system that runs on a USB disk.☆31Updated 7 years ago
- Practical Approaches to Data Science with Text☆38Updated 4 years ago
- Explore your own text collection with a topic model – without prior knowledge.☆62Updated 3 weeks ago
- The Art of Literary Text Analysis☆163Updated 5 years ago
- A point-and-click tool for creating and analyzing topic models produced by MALLET.☆106Updated 3 years ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆26Updated 2 years ago
- Course materials for Introduction to Computational Literary Analysis, taught at UC Berkeley in Summer 2018, 2019, and 2020, at Columbia U…☆89Updated 2 years ago
- Project on the history of genre.☆22Updated 4 years ago
- Named Entities Recognition Annotator Tool for Europeana Newspapers☆60Updated 6 years ago
- the EEBO TCP texts☆32Updated 6 years ago
- Topic Modeling Workflow in Python☆16Updated last year
- ☆28Updated 3 years ago
- Text Re-use Alignment Visualization☆37Updated 7 years ago
- Detect and align similar passages☆88Updated 2 months ago
- ☆19Updated 7 years ago
- Tools for text tokenization and encoding☆84Updated 3 years ago
- Python implementation of the Zeta score for contrastive text analysis☆14Updated 3 years ago
- Digital Humanities Across Borders☆46Updated 8 months ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- System for building, visualizing, and working with LDA topic models☆92Updated 3 weeks ago
- Scripts that clean up OCR and munge Hathi metadata.☆74Updated 7 years ago
- Quantitative Text Analysis for the digitale Geisteswissenschaften☆47Updated 9 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- This is a public repository for sharing, improving, and versioning "The Topic Modeling Game," a lesson developed by Lisa Rhody to teach t…☆9Updated 6 years ago
- ENGL 87400 - Text Transformations (Graduate Center, CUNY - Spring 2015)☆12Updated 9 years ago
- Contains materials for a work in progress - "A Humanist's Cookbook for Natural Language Processing in Python."☆39Updated 2 years ago