yuvalpinter / nytwit
New York Times Word Innovation Types dataset
☆21Updated 4 years ago
Alternatives and similar repositories for nytwit:
Users that are interested in nytwit are comparing it to the libraries listed below
- PoKi: A Large Dataset of Poems by Children☆35Updated last month
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆28Updated 4 years ago
- English Small World of Words SWOWEN-2018☆66Updated 2 years ago
- Python tools for text☆15Updated 4 years ago
- Practical Approaches to Data Science with Text☆39Updated 5 years ago
- The RadioTalk dataset of talk radio transcripts☆58Updated 4 years ago
- Discovery of Rhyme Schemes in Poetry☆17Updated 13 years ago
- ☆33Updated 3 years ago
- ☆30Updated 8 years ago
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- Bayesian pragmatic models implemented in Python☆19Updated 8 years ago
- Matrix tools for building and inspecting latent spaces☆27Updated 6 years ago
- This is the repository for 2018's collaborative NaNoLiPo project.☆33Updated 6 years ago
- How (but not why) to do Twitter sociolinguistic analysis in the Unix Shell☆10Updated 8 years ago
- Easy to use ML model for spelling and sounding out words☆91Updated 8 months ago
- Quick implementation of Monroe et al.'s algorithm for comparing languages☆52Updated 4 years ago
- An API to access data from The New Yorker Caption Contest☆61Updated 2 years ago
- ☆11Updated 5 years ago
- Notebooks and other course materials for Emory QTM 340 (Fall 2021)☆23Updated 2 years ago
- National Poetry Generation Month 2021☆9Updated 4 years ago
- A corpus of poetry from Project Gutenberg☆201Updated 6 years ago
- The official repository for the The Project Dialogism Novel Corpus, a dataset of annotated quotations in full-length English novels.☆39Updated last year
- ☆70Updated 3 months ago
- Python library for extracting quantitative, reproducible metrics of multi-level alignment between speakers in naturalistic language corpo…☆45Updated 2 weeks ago
- Generating books from GANs trained on bitmaps of whole words☆21Updated 5 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆41Updated last year
- ☆11Updated 7 years ago
- Data Gardens -- Fall 2019, CMU☆32Updated 2 years ago
- Code for learning geographically-informed word embeddings☆22Updated 3 years ago
- Data and code for the book Enumerations: Data and Literary Study (Chicago 2018)☆25Updated 6 years ago