julianbrooke / GutenTagLinks
☆31Updated 8 years ago
Alternatives and similar repositories for GutenTag
Users that are interested in GutenTag are comparing it to the libraries listed below
Sorting:
- A tool for analyzing the word histories of a text.☆35Updated 11 months ago
- I wanted all of plaintext Project Gutenberg in an easy-to-use format, so I made this☆223Updated 2 years ago
- A corpus of poetry from Project Gutenberg☆209Updated 7 years ago
- Practical Approaches to Data Science with Text☆39Updated 5 years ago
- Poetic processing, for Python.☆42Updated last year
- Grammar Induction using a Template Tree Approach☆45Updated 5 months ago
- Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.co…☆315Updated 3 years ago
- This is the repository for 2018's collaborative NaNoLiPo project.☆34Updated 6 years ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆27Updated 3 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆84Updated last year
- a python package for cleaning Gutenberg books and dataset☆34Updated 5 months ago
- Various utilities for processing the data.☆213Updated this week
- linguistics tree drawing to SVG in python, aimed at Jupyter☆65Updated last year
- CONLL-U to Pandas DataFrame☆31Updated 7 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆152Updated last week
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆68Updated 3 years ago
- The curation repository for the data behind Concepticon.☆40Updated last month
- linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).☆50Updated 2 years ago
- A command-line program to download text corpora.☆34Updated 8 years ago
- An annotated corpus of argumentative microtexts☆40Updated 3 years ago
- A modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.☆98Updated 3 years ago
- A Python library for topic modeling and visualization☆66Updated 5 years ago
- Repository for the Georgetown University Multilayer Corpus (GUM)☆101Updated 3 weeks ago
- PoKi: A Large Dataset of Poems by Children☆36Updated 8 months ago
- A list of resources for conservation, development, and documentation of endangered, minority, and low or under-resourced human languages.☆35Updated 2 years ago
- New York Times Word Innovation Types dataset☆21Updated 4 years ago
- German Morphological Analyzer☆48Updated 3 years ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆23Updated 3 years ago
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆57Updated 2 months ago