ageitgey / GutenbergLinks
A simple interface to the Project Gutenberg corpus.
☆17Updated 10 years ago
Alternatives and similar repositories for Gutenberg
Users that are interested in Gutenberg are comparing it to the libraries listed below
Sorting:
- Maps clauses from a text corpus onto the metrical structure of a poem☆17Updated 10 years ago
- Code for EMNLP 2016 paper: Morphological Priors for Probabilistic Word Embeddings☆53Updated 9 years ago
- Collection of scripts for visualizing high dimensional data with scikit-learn and bh_tsne☆34Updated 10 years ago
- Fast Word Clustering Software☆79Updated 11 months ago
- ☆98Updated 4 years ago
- The RadioTalk dataset of talk radio transcripts☆61Updated 4 years ago
- Exploring the shapes of stories using indico sentiment analysis APIs☆28Updated 10 years ago
- Non-distributional linguistic word vector representations.☆62Updated 8 years ago
- ☆34Updated 3 years ago
- Uses a distributed word representation to finds words along the hyperchord of two input words.☆102Updated 5 years ago
- Code for morphological transformations☆29Updated 8 years ago
- Python wrapper for Apache OpenNLP tools☆34Updated 9 years ago
- Polyglot is a language identifier for detecting text documents containing text written in more than one language, and for identifying the…☆32Updated 9 years ago
- Supervised learning for novelty detection in text☆78Updated 9 years ago
- An API to access data from The New Yorker Caption Contest☆62Updated 2 years ago
- A hack to replace Pride & Prejudice text with closest word2vec model word, and visualize results.☆61Updated 11 years ago
- Python package for stylometry☆64Updated 4 years ago
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.☆155Updated last year
- A Recurrent Neural Network trained on all existing TED Talk Transcripts. The model outputs machine generated TED Talks.☆51Updated 7 years ago
- create a browser of a corpus using a topic model; original TMVE implementation (static pages)☆47Updated 10 years ago
- Tokenize English sentences using neural networks.☆63Updated 8 years ago
- Normalizes lexically ill-formed text to its most likely clean text, e.g. "c u thr 2nite!" -> "see you there tonight!".☆63Updated 10 years ago
- Parse a text corpus and generate sentences in the same style using context-free grammar combined with a Markov chain.☆34Updated 6 years ago
- Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…☆130Updated last year
- A neural network based StoryTeller that outputs a short story from an input image☆13Updated 7 years ago
- A Large Automatically-Constructed Resource of Predicate Paraphrases☆45Updated 5 years ago
- framework for doing NER and other types of entity recognition, in Python☆68Updated 3 years ago
- Code accompanying our EMNLP paper Learning Language Representations for Typology Prediction☆72Updated 8 years ago
- Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies☆69Updated 6 years ago
- Source code to accompany my paper "Poetic sound similarity vectors using phonetic features"☆171Updated 8 years ago