mit-ccc / RadioTalk
The RadioTalk dataset of talk radio transcripts
☆57Updated 4 years ago
Alternatives and similar repositories for RadioTalk:
Users that are interested in RadioTalk are comparing it to the libraries listed below
- An API to access data from The New Yorker Caption Contest☆61Updated last year
- dataset of podcasts and episodes☆14Updated 7 years ago
- New York Times Word Innovation Types dataset☆21Updated 4 years ago
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆28Updated 4 years ago
- A simple interface to the Project Gutenberg corpus.☆17Updated 9 years ago
- Collaborative web framework for analyzing text (e.g., tweets). Supports standard labeling and pairwise comparison.☆14Updated 3 years ago
- Polyglot is a language identifier for detecting text documents containing text written in more than one language, and for identifying the…☆33Updated 8 years ago
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- Code for my blog post on Generating Words from Embeddings☆23Updated 7 months ago
- Text Thresher crowd sourced text annotator☆17Updated 7 years ago
- Compare coverage across different media sources using the Juicer☆12Updated 8 years ago
- Investigative tool for extracting relevant areas from many documents☆14Updated 9 years ago
- Demonstration of the results in "Text Normalization using Memory Augmented Neural Networks", Authors: Subhojeet Pramanik, Aman Hussain☆60Updated 5 years ago
- Python tools for text☆15Updated 4 years ago
- A pipeline for detecting novel information about entities from a stream of text, updating a knowledge base about the entities, and genera…☆32Updated 5 years ago
- Easy to use ML model for spelling and sounding out words☆90Updated 7 months ago
- Jupyter extension to visualize dependency structures☆28Updated 6 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- Exploring the shapes of stories using indico sentiment analysis APIs☆28Updated 9 years ago
- Experiments to help discussion on Wikipedia talk pages☆66Updated 3 months ago
- A Python package for audio annotation and classifier training. Developed in collaboration with the WGBH Foundation and the American Archi…☆17Updated 6 years ago
- A web application tagging and retrieval of arguments in text☆29Updated last year
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 2 years ago
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Updated 5 years ago
- Source code to accompany my paper "Poetic sound similarity vectors using phonetic features"☆170Updated 7 years ago
- Barista is an open-source framework for concurrent speech processing.☆36Updated 10 years ago
- A framework to identify relations between ideas in temporal text corpora.☆28Updated 6 years ago
- ☆14Updated 6 years ago
- Uses NLP methods to parse and classify contracts from The City of New Orleans☆10Updated 9 years ago
- PoKi: A Large Dataset of Poems by Children☆35Updated last week