ageitgey / Gutenberg
A simple interface to the Project Gutenberg corpus.
☆17Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for Gutenberg
- Maps clauses from a text corpus onto the metrical structure of a poem☆17Updated 9 years ago
- Fast Word Clustering Software☆74Updated 3 months ago
- The Non-Official Characterization (NOC) List is a knowledge-base containing semantic triples about famous people, living and dead, fictio…☆24Updated 5 years ago
- Simple rules based grapheme to phoneme in Python☆10Updated 7 years ago
- Code for EMNLP 2016 paper: Morphological Priors for Probabilistic Word Embeddings☆52Updated 7 years ago
- 2016 Presidential Campaign Speeches☆15Updated 8 years ago
- A tool for detecting sentence fragments.☆7Updated 8 years ago
- Code for learning geographically-informed word embeddings☆22Updated 2 years ago
- Using embedding-based loss functions for phonetics/speech recognition.☆17Updated 10 years ago
- Python SDK for the TextRazor Text Analytics API☆20Updated last year
- "Translate" a plot from Mark Riedl's WikiPlots corpus into a poem. For NaPoGenMo 2017.☆20Updated 7 years ago
- Jupyter extension to visualize dependency structures☆28Updated 6 years ago
- ☆97Updated 3 years ago
- Normalizes lexically ill-formed text to its most likely clean text, e.g. "c u thr 2nite!" -> "see you there tonight!".☆63Updated 9 years ago
- A temporal ordering system for events and time expressions in written text.☆43Updated 2 years ago
- ☆32Updated 2 years ago
- Collection of scripts for visualizing high dimensional data with scikit-learn and bh_tsne☆34Updated 9 years ago
- Code for morphological transformations☆29Updated 7 years ago
- The RadioTalk dataset of talk radio transcripts☆57Updated 3 years ago
- Multilingual Language Modeling Toolkit☆11Updated 7 years ago
- MiTextExplorer - interactive browser of text and document covariates.☆24Updated 9 years ago
- New York Times Word Innovation Types dataset☆21Updated 3 years ago
- A re-implementation of redpony/cdec's tokenize-anything.pl script in python☆8Updated 8 years ago
- Python package for stylometry☆56Updated 3 years ago
- Parse a text corpus and generate sentences in the same style using context-free grammar combined with a Markov chain.☆34Updated 5 years ago
- A framework to identify relations between ideas in temporal text corpora.☆29Updated 6 years ago
- Text simplification using RNNs☆56Updated 8 years ago
- Exploring the shapes of stories using indico sentiment analysis APIs☆28Updated 9 years ago
- BlackboxNLP 2019: Analyzing and interpreting neural networks for NLP☆18Updated 5 years ago
- Non-distributional linguistic word vector representations.☆62Updated 7 years ago