markriedl / WikiPlots
A dataset containing story plots from Wikipedia (books, movies, etc.) and the code for the extractor.
☆306Updated 6 years ago
Related projects: ⓘ
- Collection of tools for building diachronic/historical word vectors☆417Updated 9 months ago
- A large corpus of discourse annotations and relations on ~10K forum threads.☆238Updated 5 years ago
- Sample implementation of a politeness model, trained on the Stanford Politeness Corpus☆146Updated 2 years ago
- Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.co…☆308Updated 2 years ago
- I wanted all of plaintext Project Gutenberg in an easy-to-use format, so I made this☆209Updated last year
- Source code to accompany my paper "Poetic sound similarity vectors using phonetic features"☆166Updated 6 years ago
- A corpus of poetry from Project Gutenberg☆187Updated 6 years ago
- Generating gradients, exploring neighborhoods.☆196Updated last year
- A simple interface to the Project Gutenberg corpus.☆320Updated last year
- A simple Python interface for Darius Kazemi's Corpora Project.☆119Updated 4 years ago
- Uses a distributed word representation to finds words along the hyperchord of two input words.☆101Updated 4 years ago
- Annotated dataset of 100 works of fiction to support tasks in natural language processing and the computational humanities.☆340Updated last year
- This repository contains the three WikiReading datasets as used and described in WikiReading: A Novel Large-scale Language Understanding …☆270Updated 6 years ago
- Code and data for inducing domain-specific sentiment lexicons.☆195Updated last month
- Socially-Equitable Language Identification☆78Updated last year
- Theano code for experiments in the paper "A Hybrid Convolutional Variational Autoencoder for Text Generation."☆205Updated 5 years ago
- I have this big list of links to text stuff that I like, so I thought I'd make it into a repository.☆67Updated 6 years ago
- A python framework for learning and producing verse poetry☆73Updated 9 years ago
- Democratizing NLP!☆105Updated 9 months ago
- Python port of the Twokenize class of ark-tweet-nlp☆142Updated 6 years ago
- Code accompanying our EMNLP paper Learning Language Representations for Typology Prediction☆71Updated 7 years ago
- A corpus of 100,000 happy moments☆357Updated 6 years ago
- Generates poetry from images using convolutional and recurrent neural networks☆307Updated 7 years ago
- ☆32Updated 2 years ago
- A toolkit for corpus linguistics☆199Updated 5 years ago
- Quickly extract multi-word phrases from a corpus☆190Updated 4 years ago
- relationship modeling networks (NAACL 2016)☆87Updated 3 years ago
- A simple interface for the CMU pronouncing dictionary☆301Updated last month
- Data and analysis for the BuzzFeed News article, "Hyperpartisan Facebook Pages Are Publishing False And Misleading Information At An Alar…☆109Updated 7 years ago
- PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, an…☆479Updated last year