Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.com/booknlp/booknlp)
☆317Feb 4, 2022Updated 4 years ago
Alternatives and similar repositories for book-nlp
Users that are interested in book-nlp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.☆376Dec 8, 2022Updated 3 years ago
- Download and manipulate HathiTrust wordcount data in the tidyverse☆10Jan 31, 2022Updated 4 years ago
- Contains materials for a work in progress - "A Humanist's Cookbook for Natural Language Processing in Python."☆41Nov 29, 2021Updated 4 years ago
- ☆35Feb 4, 2022Updated 4 years ago
- relationship modeling networks (NAACL 2016)☆86Jan 25, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A french litbank corpus☆10Jan 22, 2026Updated 4 months ago
- Course repo for Applied Natural Language Processing (Spring 2019)☆407Feb 2, 2022Updated 4 years ago
- Data and code for analyzing language associated with fictional characters.☆15Jan 6, 2018Updated 8 years ago
- Data and code to support Distant Horizons (University of Chicago Press, 2019).☆12Feb 28, 2019Updated 7 years ago
- Code and data supporting "NovelTM Data Sets for English-Language Fiction."☆26Dec 22, 2020Updated 5 years ago
- An approximate nearest-neighbor search for text reuse.☆12Oct 5, 2020Updated 5 years ago
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆49Jul 13, 2017Updated 8 years ago
- An NLP processing pipeline for characters in fanfiction. Developed by students at Carnegie Mellon University from 2019-2021.☆36Feb 2, 2026Updated 4 months ago
- A simple vector space model based tool for sentiment analysis of literary texts☆18Sep 17, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- MiTextExplorer - interactive browser of text and document covariates.☆24Jun 17, 2015Updated 10 years ago
- The Art of Literary Text Analysis☆170Apr 4, 2019Updated 7 years ago
- A point-and-click tool for creating and analyzing topic models produced by MALLET.☆115Mar 1, 2021Updated 5 years ago
- Digital Humanities Across Borders☆51Mar 21, 2024Updated 2 years ago
- Practical Approaches to Data Science with Text☆40Dec 6, 2019Updated 6 years ago
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆29May 13, 2020Updated 6 years ago
- Cornell INFO 3350: Text mining for history and literature, Fall 2020☆11Jan 14, 2021Updated 5 years ago
- Official syllabus and course materials for English 184E: “Literary Text Mining” (Spring 2019)☆18Jul 15, 2020Updated 5 years ago
- Visual Text Analytics for Digital Humanities☆17Apr 22, 2015Updated 11 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Collection of tools for building diachronic/historical word vectors☆449Dec 18, 2023Updated 2 years ago
- Code and data to support the article, "How quickly do literary standards change?"☆23Apr 27, 2018Updated 8 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆24Jul 18, 2019Updated 6 years ago
- A Python tool to pull the complete edit history of a Wikipedia page☆21Apr 21, 2026Updated last month
- Word generation based on n-gram models, and a cli utility to generate said models.☆17Sep 1, 2016Updated 9 years ago
- A Python Twitter bot posting recently active questions from Stack Overflow. Tweaked to run on AWS Lambda.☆10Jan 14, 2020Updated 6 years ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆28Mar 21, 2022Updated 4 years ago
- An implementation of latent Dirichlet allocation in javascript☆186Aug 1, 2022Updated 3 years ago
- A textual corpus database for the digital humanities.☆64Jul 26, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Take a MALLET to disciplinary history☆99Jul 11, 2022Updated 3 years ago
- Collect, discuss and manage feedback on OntoME☆12Dec 7, 2023Updated 2 years ago
- Text generation with entities as context☆30Jun 13, 2018Updated 8 years ago
- Tool to convert JSON formatted discussion posts on Canvas LMS into HTML files - similar to saving student text-entry assignments☆13May 20, 2022Updated 4 years ago
- Jekyll-based static site for The Programming Historian☆545Updated this week
- ☆16Apr 9, 2019Updated 7 years ago
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆128Jun 14, 2021Updated 5 years ago