Anterotesis / historical-textsLinks
Collections of english historical texts and data relating to them
☆18Updated 4 years ago
Alternatives and similar repositories for historical-texts
Users that are interested in historical-texts are comparing it to the libraries listed below
Sorting:
- Topic Modeling Workflow in Python☆16Updated 2 years ago
- Tools for TICCL☆14Updated last month
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 2 years ago
- Tools for tracking stories on news homepages☆48Updated 5 years ago
- A design prototype for DocNow to learn with☆14Updated 8 years ago
- SerendipSlim is a visualization tool for exploring topic models built on large collections of text documents.☆39Updated 7 years ago
- Tools for text tokenization and encoding☆84Updated 3 years ago
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 3 years ago
- Various functions to make bag-of-words approaches to text analysis more user-friendly☆24Updated 8 years ago
- An R package for analysis of dramatic texts☆15Updated 2 years ago
- Scripts to create git repositories for ALTO XML texts, like those from the British Library's scanned documents.☆31Updated 7 years ago
- Humanities Data Curation Record☆11Updated 8 years ago
- A digital humanities operating system that runs on a USB disk.☆31Updated 8 years ago
- Scripts that clean up OCR and munge Hathi metadata.☆76Updated 7 years ago
- A web-based, token-level annotation tool for non-standard language data☆10Updated 4 years ago
- ☆25Updated 8 years ago
- “Open terminals”, “load CSVs”, “start hacking”☆15Updated 8 years ago
- Citation Classification using hybrid neural network model for Wikipedia References☆29Updated 2 years ago
- Open Access PDF harvester☆40Updated last year
- A powerful, tagset-independent and theory-neutral meta model and API for storing, manipulating, and representing nearly all types of ling…☆15Updated 2 years ago
- a repository to help introduce and orient students to the GitHub collaboration environment, and to support DH classes.☆27Updated 4 years ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Updated 8 years ago
- A deep learning architecture for reference mining from literature in the arts and humanities.☆16Updated 5 years ago
- Berkeley DLab Python Intensive May 23-26☆28Updated 9 years ago
- Basic dataset for the linguistic data collection.☆15Updated 8 years ago
- Text-Induced Corpus Clean-up☆20Updated 2 years ago
- (Mental) maps of texts with kernel density estimation and force-directed networks.☆108Updated 10 years ago
- Personal nouns assembled from the 1890 Webster's Unabridged Dictionary.☆9Updated 8 years ago
- A Twitter data collection and appraisal application.☆51Updated 2 years ago
- Original 2016 take at what is now Linked Paths, the demonstrator for GeoJSON-T developed under a Pelagios micro-grant☆89Updated 8 years ago