mechanicalscribe / cmu_tf_idf
Google books word frequencies for words in the CMU Pronunciation Dictionary
☆14Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for cmu_tf_idf
- A repository of materials for a proposed class on automated story bots.☆49Updated 6 years ago
- Library for Geo-Inferencing in Twitter Data☆28Updated 8 years ago
- An implementation of latent Dirichlet allocation in javascript☆183Updated 2 years ago
- Maps clauses from a text corpus onto the metrical structure of a poem☆17Updated 9 years ago
- All the Harry Potter clusters you could ever want☆34Updated 9 years ago
- rapid nlp prototyping☆72Updated 2 years ago
- Discussion Summarization is the process of condensing a text document which is a collection of discussion threads, using CBS (Cluster Bas…☆12Updated 10 years ago
- a framework and language for exploring and analyzing feeds of social media data.☆23Updated 12 years ago
- A platform for collecting, analyzing, and visualizing social media data.☆12Updated 3 years ago
- Tools for working with Optical Character Recognition output☆16Updated 10 years ago
- Parser and standardizer for politician, individual and organization names.☆128Updated 7 years ago
- Demo code for learning_text_transformer☆25Updated 9 years ago
- Tools for text tokenization and encoding☆84Updated 3 years ago
- Intro to some NLP concepts in Python for a class☆96Updated 9 years ago
- a set of services that provide NLP facilities☆25Updated 3 years ago
- A tool for detecting sentence fragments.☆7Updated 8 years ago
- Literate data analysis with iPython notebooks and Jekyll.☆92Updated 10 years ago
- bigram / trigram analysis of wikipedia; mainly mutual info☆22Updated 12 years ago
- Generates rhyming sonnets in (mostly) iambic pentameter from any text corpus☆52Updated 10 years ago
- Supervised learning for novelty detection in text☆79Updated 8 years ago
- Python natural language processing work☆29Updated 15 years ago
- Plots various graphs for a series of plaintext files in a directory☆19Updated 8 years ago
- Exploring the shapes of stories using indico sentiment analysis APIs☆28Updated 9 years ago
- Multivariate data visualization using D3.js and React.js☆18Updated 9 years ago
- Strips boilerplate from Project Gutenberg text files☆16Updated 3 years ago
- Publicly available data for Paperscape☆44Updated 6 years ago
- The Face-o-Matic 2000 finds known faces on TV☆19Updated 6 years ago
- Embedding data into immersive environments☆23Updated 7 years ago