htrc / ht-text-prep
☆11Updated 7 months ago
Alternatives and similar repositories for ht-text-prep:
Users that are interested in ht-text-prep are comparing it to the libraries listed below
- Tools for working with HTRC Feature Extraction files☆39Updated last month
- Python SDK for Data API and Solr API access☆10Updated 8 months ago
- Teaching materials for the Applied Data Analysis course at DHOxSS. Data science methods to analyse humanities data.☆38Updated last year
- Code and data supporting "NovelTM Data Sets for English-Language Fiction."☆23Updated 4 years ago
- Project on the history of genre.☆22Updated 4 years ago
- ☆28Updated 3 years ago
- Early Novels Database dataset☆16Updated 6 years ago
- An R package for analysis of dramatic texts☆15Updated 2 years ago
- The GitHub repository for the AI for Humanists Project☆18Updated 11 months ago
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- Digital Humanities Across Borders☆47Updated 10 months ago
- The Digital Humanities Literacy Guidebook☆62Updated 2 years ago
- High-performance text aligner for large collections of texts☆47Updated 3 months ago
- A point-and-click tool for creating and analyzing topic models produced by MALLET.☆107Updated 3 years ago
- A digital humanities operating system that runs on a USB disk.☆31Updated 7 years ago
- A collection of Jupyter notebooks in many human and computer languages for doing digital humanities. PRs welcome!☆125Updated last year
- Practical Approaches to Data Science with Text☆39Updated 5 years ago
- A textual corpus database for the digital humanities.☆60Updated 4 years ago
- Official syllabus and course materials for English 184E: “Literary Text Mining” (Spring 2019)☆18Updated 4 years ago
- the EEBO TCP texts☆33Updated 6 years ago
- Scripts that clean up OCR and munge Hathi metadata.☆76Updated 7 years ago
- Text collections made available by the CLiGS group.☆22Updated 2 years ago
- Sharable scripts and stylesheets from the Northeastern University Women Writers Project☆23Updated 2 months ago
- Schema for modelling parliamentary debates☆21Updated 2 years ago
- Data and code to support Distant Horizons (University of Chicago Press, 2019).☆11Updated 5 years ago
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆51Updated 7 years ago
- I created this repository to provide the DH Community a compilation of free, open-source tools for creating and developing digital humani…☆32Updated last year
- The Reference Stylesheets developed and released by EpiDoc for use with XML documents following the EpiDoc schema.☆16Updated this week
- Contains materials for a work in progress - "A Humanist's Cookbook for Natural Language Processing in Python."☆39Updated 3 years ago
- Umbrella repository that describes the collections contained in any given release of ELTeC☆13Updated 3 years ago