Bookworm-project / BookwormDB
Tools for text tokenization and encoding
☆84Updated 3 years ago
Alternatives and similar repositories for BookwormDB:
Users that are interested in BookwormDB are comparing it to the libraries listed below
- ☆19Updated 8 years ago
- A textual corpus database for the digital humanities.☆61Updated 4 years ago
- A push-button Digital Humanities laboratory.☆126Updated 6 years ago
- The what (and how) digital humanities and news nerds want to explore together☆64Updated 9 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 7 years ago
- Scripts to create git repositories for ALTO XML texts, like those from the British Library's scanned documents.☆31Updated 7 years ago
- Free-for-all repository of TEI and plain text files for you (to do cool stuff) provided by the Digital Collections Services group at the …☆27Updated 7 years ago
- A digital humanities operating system that runs on a USB disk.☆31Updated 7 years ago
- Humanities Data Curation Record☆11Updated 7 years ago
- A point-and-click tool for creating and analyzing topic models produced by MALLET.☆108Updated 4 years ago
- See https://github.com/tworavens/tworavens for current repository for this project and http://2ra.vn for project pages.☆30Updated 6 years ago
- Documents for the project Libraccess☆13Updated 10 years ago
- Tools for working with HTRC Feature Extraction files☆39Updated 3 months ago
- ☆10Updated 8 years ago
- (Mental) maps of texts with kernel density estimation and force-directed networks.☆108Updated 9 years ago
- Personal modeling application for Linked Data.☆26Updated 6 years ago
- An API implementing a grammar for text analysis☆13Updated 9 years ago
- Using social media to steer web archiving and curation.☆15Updated 9 years ago
- Within-book topic modeling on HTRC feature extraction files☆23Updated 8 years ago
- Original 2016 take at what is now Linked Paths, the demonstrator for GeoJSON-T developed under a Pelagios micro-grant☆89Updated 8 years ago
- Data conversions and examples for generating reports from twarc collections using tools such as D3.js☆55Updated 4 years ago
- Topic Modeling Workflow in Python☆16Updated 2 years ago
- Code and data to support the article, "How quickly do literary standards change?"☆22Updated 6 years ago
- LOD-enabled version of OpenRefine. (This project is not actively maintained anymore)☆61Updated 5 years ago
- Download and manipulate HathiTrust wordcount data in the tidyverse☆9Updated 3 years ago
- "Old SFM" -- manage rules and streams from social data sources, starting with twitter.☆86Updated last year
- Warcbase is an open-source platform for managing analyzing web archives☆162Updated 7 years ago
- Scripts that clean up OCR and munge Hathi metadata.☆76Updated 7 years ago
- ☆12Updated last year
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 3 years ago