dbamman / book-nlpView external linksLinks
Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.com/booknlp/booknlp)
☆315Feb 4, 2022Updated 4 years ago
Alternatives and similar repositories for book-nlp
Users that are interested in book-nlp are comparing it to the libraries listed below
Sorting:
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.☆369Dec 8, 2022Updated 3 years ago
- BookNLP, a natural language processing pipeline for books☆889Jul 31, 2024Updated last year
- relationship modeling networks (NAACL 2016)☆86Jan 25, 2021Updated 5 years ago
- A french litbank corpus☆10Jan 22, 2026Updated 3 weeks ago
- Course repo for Applied Natural Language Processing (Spring 2019)☆407Feb 2, 2022Updated 4 years ago
- Contains materials for a work in progress - "A Humanist's Cookbook for Natural Language Processing in Python."☆41Nov 29, 2021Updated 4 years ago
- The Digital Humanities Literacy Guidebook☆68Nov 11, 2022Updated 3 years ago
- (Mental) maps of texts with kernel density estimation and force-directed networks.☆108Jun 22, 2015Updated 10 years ago
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆51Jul 13, 2017Updated 8 years ago
- An NLP processing pipeline for characters in fanfiction. Developed by students at Carnegie Mellon University from 2019-2021.☆34Feb 2, 2026Updated last week
- Code and data supporting "NovelTM Data Sets for English-Language Fiction."☆26Dec 22, 2020Updated 5 years ago
- Data and code for analyzing language associated with fictional characters.☆15Jan 6, 2018Updated 8 years ago
- MiTextExplorer - interactive browser of text and document covariates.☆24Jun 17, 2015Updated 10 years ago
- Official syllabus and course materials for English 184E: “Literary Text Mining” (Spring 2019)☆18Jul 15, 2020Updated 5 years ago
- Practical Approaches to Data Science with Text☆39Dec 6, 2019Updated 6 years ago
- Collection of tools for building diachronic/historical word vectors☆445Dec 18, 2023Updated 2 years ago
- A Python Twitter bot posting recently active questions from Stack Overflow. Tweaked to run on AWS Lambda.☆10Jan 14, 2020Updated 6 years ago
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆29May 13, 2020Updated 5 years ago
- Digital Humanities Across Borders☆50Mar 21, 2024Updated last year
- A command-line program to download text corpora.☆34Aug 12, 2017Updated 8 years ago
- ☆10Apr 26, 2016Updated 9 years ago
- ☆10Jul 17, 2015Updated 10 years ago
- Neural network poetry rewriter☆21Feb 4, 2022Updated 4 years ago
- The Art of Literary Text Analysis☆168Apr 4, 2019Updated 6 years ago
- ☆47May 22, 2017Updated 8 years ago
- Scripts that clean up OCR and munge Hathi metadata.☆77Nov 4, 2017Updated 8 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Feb 27, 2024Updated last year
- An approximate nearest-neighbor search for text reuse.☆12Oct 5, 2020Updated 5 years ago
- An implementation of latent Dirichlet allocation in javascript☆185Aug 1, 2022Updated 3 years ago
- A dataset containing story plots from Wikipedia (books, movies, etc.) and the code for the extractor.☆319Sep 26, 2017Updated 8 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Jul 18, 2019Updated 6 years ago
- Tools for text tokenization and encoding☆84Sep 27, 2021Updated 4 years ago
- Code and data to support the article, "How quickly do literary standards change?"☆23Apr 27, 2018Updated 7 years ago
- Literature and Data - Spring 2016 Data Science Connector Course☆25Oct 25, 2024Updated last year
- Humanities Entity Recognition: robust, practical, efficient Named Entity Recognition for today's digital humanist☆37Mar 26, 2019Updated 6 years ago
- ☆17Feb 14, 2018Updated 7 years ago
- I wanted all of plaintext Project Gutenberg in an easy-to-use format, so I made this☆225Apr 27, 2023Updated 2 years ago
- Word generation based on n-gram models, and a cli utility to generate said models.☆17Sep 1, 2016Updated 9 years ago
- A point-and-click tool for creating and analyzing topic models produced by MALLET.☆113Mar 1, 2021Updated 4 years ago