ageitgey / Gutenberg
A simple interface to the Project Gutenberg corpus.
☆17Updated 9 years ago
Alternatives and similar repositories for Gutenberg:
Users that are interested in Gutenberg are comparing it to the libraries listed below
- Maps clauses from a text corpus onto the metrical structure of a poem☆17Updated 9 years ago
- The RadioTalk dataset of talk radio transcripts☆59Updated 4 years ago
- Code for EMNLP 2016 paper: Morphological Priors for Probabilistic Word Embeddings☆53Updated 8 years ago
- ☆34Updated 3 years ago
- Polyglot is a language identifier for detecting text documents containing text written in more than one language, and for identifying the…☆33Updated 8 years ago
- Collection of scripts for visualizing high dimensional data with scikit-learn and bh_tsne☆34Updated 9 years ago
- Fast Word Clustering Software☆78Updated 3 months ago
- Exploring the shapes of stories using indico sentiment analysis APIs☆28Updated 9 years ago
- A tool for detecting sentence fragments.☆7Updated 8 years ago
- Browser-based annotation tool for Framenet☆16Updated 10 years ago
- Tokenize tweets to determine net sentiments and locations, generate Viz for states mean sentiment☆46Updated 10 years ago
- A hack to replace Pride & Prejudice text with closest word2vec model word, and visualize results.☆61Updated 10 years ago
- Python package for stylometry☆63Updated 4 years ago
- Non-distributional linguistic word vector representations.☆62Updated 7 years ago
- MetroMaps Release☆16Updated 11 years ago
- Code for morphological transformations☆29Updated 7 years ago
- Jupyter extension to visualize dependency structures☆28Updated 7 years ago
- Visualize large text collections with WebGL☆25Updated 8 months ago
- A Recurrent Neural Network trained on all existing TED Talk Transcripts. The model outputs machine generated TED Talks.☆51Updated 7 years ago
- Library for Geo-Inferencing in Twitter Data☆28Updated 8 years ago
- ADS Project☆14Updated 9 years ago
- Using word2vec and t-SNE to compare text sources.☆19Updated 9 years ago
- ☆97Updated 3 years ago
- Utility scripts in Python☆37Updated 8 months ago
- 2016 Presidential Campaign Speeches☆15Updated 8 years ago
- A tool for text normalisation via character-level machine translation☆13Updated 4 years ago
- Opinion miner based of Machine Learning that can be trained on a corpus of KAF/NAF files☆9Updated 6 years ago
- Generalized Language Modeling toolkit☆51Updated 2 years ago
- A re-implementation of redpony/cdec's tokenize-anything.pl script in python☆8Updated 9 years ago
- A framework to identify relations between ideas in temporal text corpora.☆28Updated 7 years ago