ageitgey / Gutenberg
A simple interface to the Project Gutenberg corpus.
☆17Updated 9 years ago
Alternatives and similar repositories for Gutenberg:
Users that are interested in Gutenberg are comparing it to the libraries listed below
- Code for EMNLP 2016 paper: Morphological Priors for Probabilistic Word Embeddings☆52Updated 8 years ago
- Maps clauses from a text corpus onto the metrical structure of a poem☆17Updated 9 years ago
- The RadioTalk dataset of talk radio transcripts☆57Updated 4 years ago
- Fast Word Clustering Software☆78Updated last month
- Support library for NLP and machine learning.☆26Updated 7 years ago
- Multilingual Language Modeling Toolkit☆11Updated 7 years ago
- Exploring the shapes of stories using indico sentiment analysis APIs☆28Updated 9 years ago
- A framework to identify relations between ideas in temporal text corpora.☆28Updated 6 years ago
- Polyglot is a language identifier for detecting text documents containing text written in more than one language, and for identifying the…☆33Updated 8 years ago
- Jupyter extension to visualize dependency structures☆28Updated 6 years ago
- Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies☆70Updated 5 years ago
- "Translate" a plot from Mark Riedl's WikiPlots corpus into a poem. For NaPoGenMo 2017.☆20Updated 7 years ago
- numeric fused-head identification and resolution☆33Updated 5 years ago
- Code for morphological transformations☆29Updated 7 years ago
- Code for my blog post on Generating Words from Embeddings☆23Updated 7 months ago
- 2016 Presidential Campaign Speeches☆15Updated 8 years ago
- This dataset contains naturally-occurring English sentences that feature non-trivial noun-verb ambiguity.☆35Updated 5 years ago
- MiTextExplorer - interactive browser of text and document covariates.☆24Updated 9 years ago
- Normalizes lexically ill-formed text to its most likely clean text, e.g. "c u thr 2nite!" -> "see you there tonight!".☆63Updated 9 years ago
- ADS Project☆14Updated 9 years ago
- Collection of scripts for visualizing high dimensional data with scikit-learn and bh_tsne☆34Updated 9 years ago
- A tool for detecting sentence fragments.☆7Updated 8 years ago
- How (but not why) to do Twitter sociolinguistic analysis in the Unix Shell☆10Updated 8 years ago
- New York Times Word Innovation Types dataset☆21Updated 4 years ago
- Analysis of gutenberg dataset☆43Updated 6 years ago
- Construction and Analysis of an Emotion Proposition Store☆28Updated 2 years ago
- ☆18Updated 6 years ago
- A neural network based StoryTeller that outputs a short story from an input image☆13Updated 6 years ago
- Implementation of a simple frame identification approach (SimpleFrameId) described in the paper "Out-of-domain FrameNet Semantic Role Lab…☆15Updated 7 years ago
- Python package for stylometry☆61Updated 3 years ago