Bookworm-project / BookwormDBLinks
Tools for text tokenization and encoding
☆84Updated 3 years ago
Alternatives and similar repositories for BookwormDB
Users that are interested in BookwormDB are comparing it to the libraries listed below
Sorting:
- A digital humanities operating system that runs on a USB disk.☆31Updated 7 years ago
- Scripts to create git repositories for ALTO XML texts, like those from the British Library's scanned documents.☆31Updated 7 years ago
- Topic Modeling Workflow in Python☆16Updated 2 years ago
- ☆19Updated 8 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 7 years ago
- A push-button Digital Humanities laboratory.☆127Updated 6 years ago
- A textual corpus database for the digital humanities.☆62Updated 4 years ago
- An API implementing a grammar for text analysis☆13Updated 9 years ago
- A point-and-click tool for creating and analyzing topic models produced by MALLET.☆108Updated 4 years ago
- Data conversions and examples for generating reports from twarc collections using tools such as D3.js☆55Updated 5 years ago
- Scripts that clean up OCR and munge Hathi metadata.☆76Updated 7 years ago
- Documentation for Bookworm: particularly focusing on creation aspects -☆10Updated 8 years ago
- GUI for a Bookworm web app☆15Updated 4 years ago
- See https://github.com/tworavens/tworavens for current repository for this project and http://2ra.vn for project pages.☆30Updated 6 years ago
- Personal modeling application for Linked Data.☆26Updated 6 years ago
- Free-for-all repository of TEI and plain text files for you (to do cool stuff) provided by the Digital Collections Services group at the …☆27Updated 8 years ago
- Amsterdam Content Analysis Toolkit☆46Updated 2 years ago
- (Mental) maps of texts with kernel density estimation and force-directed networks.☆108Updated 9 years ago
- ☆10Updated 9 years ago
- Using social media to steer web archiving and curation.☆15Updated 9 years ago
- The what (and how) digital humanities and news nerds want to explore together☆64Updated 9 years ago
- A design prototype for DocNow to learn with☆14Updated 8 years ago
- This is a public repository for sharing, improving, and versioning "The Topic Modeling Game," a lesson developed by Lisa Rhody to teach t…☆10Updated 7 years ago
- Humanities Data Curation Record☆11Updated 7 years ago
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆51Updated 7 years ago
- An experiment to standardize individual donor names in campaign finance data using simple graph theory and machine learning.☆65Updated 12 years ago
- Python implementation of the Zeta score for contrastive text analysis☆14Updated 3 years ago
- A command-line program to download text corpora.☆34Updated 7 years ago
- The Open Scholarly Edition of James Joyce's A Portrait of the Artist as a Young Man☆20Updated 6 years ago
- Parser and standardizer for politician, individual and organization names.☆129Updated 8 years ago