dbamman / book-nlpLinks
Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.com/booknlp/booknlp)
☆315Updated 3 years ago
Alternatives and similar repositories for book-nlp
Users that are interested in book-nlp are comparing it to the libraries listed below
Sorting:
- Collection of tools for building diachronic/historical word vectors☆434Updated last year
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.☆358Updated 2 years ago
- A point-and-click tool for creating and analyzing topic models produced by MALLET.☆108Updated 4 years ago
- Socially-Equitable Language Identification☆78Updated 2 years ago
- Various utilities for processing the data.☆209Updated this week
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆127Updated 4 years ago
- Take a MALLET to disciplinary history☆99Updated 2 years ago
- A command-line program to download text corpora.☆34Updated 7 years ago
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆56Updated 3 weeks ago
- The official released annotations, both in .prop pointer format and as conll files. Does not contain the source texts☆139Updated 2 years ago
- Sample implementation of a politeness model, trained on the Stanford Politeness Corpus☆147Updated 3 years ago
- Practical Natural Language Processing Tools for Humans. Dependency Parsing, Syntactic Constituent Parsing, Semantic Role Labeling, Named …☆193Updated 7 years ago
- Named Entity Recognition data for Europeana Newspapers☆171Updated 2 years ago
- A Python wrapper around the topic modeling functions of MALLET.☆103Updated 7 months ago
- The Art of Literary Text Analysis☆166Updated 6 years ago
- PredPatt: Predicate-Argument Extraction from Universal Dependencies☆112Updated 4 years ago
- A dataset containing story plots from Wikipedia (books, movies, etc.) and the code for the extractor.☆315Updated 7 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆81Updated last year
- Retrofitting Word Vectors to Semantic Lexicons☆375Updated 6 years ago
- Lexicon of frame files used by Propbank annotation. A searchable, readable version of the latest release is here: http://propbank.github…☆100Updated this week
- ☆55Updated 9 years ago
- An NLP processing pipeline for characters in fanfiction. Developed by students at Carnegie Mellon University from 2019-2021.☆33Updated 9 months ago
- Package for Statistically significant linguistic change☆56Updated 2 years ago
- A toolkit for corpus linguistics☆204Updated 6 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆194Updated 4 years ago
- Digital Humanities Across Borders☆48Updated last year
- OpenCCG library for parsing and realization with CCG☆211Updated 4 years ago
- Software and resources for natural language processing.☆131Updated 8 years ago
- CONLL-U to Pandas DataFrame☆31Updated 7 years ago
- A library for topic modeling and browsing☆89Updated 6 years ago