htrc / ht-text-prep
☆11Updated 10 months ago
Alternatives and similar repositories for ht-text-prep:
Users that are interested in ht-text-prep are comparing it to the libraries listed below
- Tools for working with HTRC Feature Extraction files☆39Updated 3 months ago
- Python SDK for Data API and Solr API access☆10Updated 11 months ago
- Text collections made available by the CLiGS group.☆23Updated 3 years ago
- Project on the history of genre.☆22Updated 5 years ago
- ☆19Updated 8 years ago
- Code and data supporting "NovelTM Data Sets for English-Language Fiction."☆24Updated 4 years ago
- This is a public repository for sharing, improving, and versioning "The Topic Modeling Game," a lesson developed by Lisa Rhody to teach t…☆10Updated 6 years ago
- Teaching materials for the Applied Data Analysis course at DHOxSS. Data science methods to analyse humanities data.☆40Updated last year
- The GitHub repository for the AI for Humanists Project☆18Updated last year
- Download and manipulate HathiTrust wordcount data in the tidyverse☆9Updated 3 years ago
- Umbrella repository that describes the collections contained in any given release of ELTeC☆13Updated 3 years ago
- An R package for analysis of dramatic texts☆15Updated 2 years ago
- A Twitter data collection and appraisal application.☆51Updated 2 years ago
- Workshop materials for our DH2018 workshop on word vectors. Created by Eun Seo Jo, Javier de la Rosa, and Scott Bailey☆15Updated 6 years ago
- Within-book topic modeling on HTRC feature extraction files☆23Updated 8 years ago
- A digital humanities operating system that runs on a USB disk.☆31Updated 7 years ago
- A tool to extract canonical references from text.☆20Updated 3 years ago
- This repository provides bulk download materials for potential collections as data projects, with materials adapted and extended from the…☆10Updated 5 years ago
- Contains materials for a work in progress - "A Humanist's Cookbook for Natural Language Processing in Python."☆40Updated 3 years ago
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- Early Novels Database dataset☆16Updated 6 years ago
- A textual corpus database for the digital humanities.☆62Updated 4 years ago
- the EEBO TCP texts☆34Updated 7 years ago
- The Digital Humanities Literacy Guidebook☆66Updated 2 years ago
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆51Updated 7 years ago
- A Mashup Interface for Text Analysis Operations☆13Updated 3 months ago
- ☆33Updated 10 months ago
- High-performance text aligner for large collections of texts☆51Updated 5 months ago
- Digital Humanities Across Borders☆47Updated last year
- Netherlands eScience Center - Shifting Concepts Through Time project☆26Updated 3 years ago