julianbrooke / GutenTag
☆30Updated 8 years ago
Alternatives and similar repositories for GutenTag
Users that are interested in GutenTag are comparing it to the libraries listed below
Sorting:
- Practical Approaches to Data Science with Text☆39Updated 5 years ago
- [LREC 2020] EtymDB, an Etymological DataBase (v2.1)☆24Updated 3 years ago
- A tool for analyzing the word histories of a text.☆34Updated 5 months ago
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆28Updated 5 years ago
- New York Times Word Innovation Types dataset☆21Updated 4 years ago
- Featurize words into orthographic and phonological vectors.☆41Updated last year
- ☆34Updated 3 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆41Updated last year
- Project on the history of genre.☆23Updated 5 years ago
- Python tools for text☆15Updated 5 years ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆27Updated 3 years ago
- A modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.☆95Updated 3 years ago
- The curation repository for the data behind Concepticon.☆38Updated 2 weeks ago
- Workshop materials for our DH2018 workshop on word vectors. Created by Eun Seo Jo, Javier de la Rosa, and Scott Bailey☆15Updated 6 years ago
- Poetic processing, for Python.☆40Updated last year
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- A simple configurable tool for manipulating dependency trees.☆13Updated 4 months ago
- a python package for cleaning Gutenberg books and dataset☆35Updated last week
- I wanted all of plaintext Project Gutenberg in an easy-to-use format, so I made this☆223Updated 2 years ago
- A command-line program to download text corpora.☆34Updated 7 years ago
- English web corpus with 4M tokens and several annotation types☆26Updated last year
- The official repository for the The Project Dialogism Novel Corpus, a dataset of annotated quotations in full-length English novels.☆39Updated last year
- Linguistic and stylistic complexity measures for (literary) texts☆81Updated last year
- The Unicode Cookbook for Linguists☆54Updated 4 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- Python framework for processing Universal Dependencies data☆57Updated last week
- Building and Using A Seed Corpus for the Human Language Project☆11Updated 7 years ago
- Python package for stylometry☆63Updated 4 years ago
- Wrapper for DKPro Core to extract lingustic information from books.☆16Updated 3 years ago
- The GitHub repository containing all the material related to the Computational Thinking and Programming course of the Digital Humanities …☆30Updated 4 years ago