mit-ccc / RadioTalkLinks
The RadioTalk dataset of talk radio transcripts
☆60Updated 4 years ago
Alternatives and similar repositories for RadioTalk
Users that are interested in RadioTalk are comparing it to the libraries listed below
Sorting:
- New York Times Word Innovation Types dataset☆21Updated 4 years ago
- Code for my blog post on Generating Words from Embeddings☆23Updated 11 months ago
- Collaborative web framework for analyzing text (e.g., tweets). Supports standard labeling and pairwise comparison.☆14Updated 3 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 2 years ago
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆29Updated 5 years ago
- Experiments to help discussion on Wikipedia talk pages☆66Updated last week
- Featurize words into orthographic and phonological vectors.☆41Updated 2 years ago
- An API to access data from The New Yorker Caption Contest☆62Updated 2 years ago
- A Python package for audio annotation and classifier training. Developed in collaboration with the WGBH Foundation and the American Archi…☆17Updated 7 years ago
- ☆34Updated 3 years ago
- dataset of podcasts and episodes☆14Updated 7 years ago
- A simple interface to the Project Gutenberg corpus.☆17Updated 9 years ago
- Code for learning geographically-informed word embeddings☆22Updated 3 years ago
- Exploring the shapes of stories using indico sentiment analysis APIs☆28Updated 9 years ago
- A guide to building language technology in new languages.☆58Updated 3 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- SPARK-n-SPELL [WARNING: inactive project, not being updated]☆7Updated 8 years ago
- Code and data from our ACL 2014 paper "Humans Require Context to Infer Ironic Intent (so Computers Probably do, too)"☆15Updated 11 years ago
- A repository of materials for a proposed class on automated story bots.☆49Updated 6 years ago
- Code and data for Koenecke et al. (2020)☆29Updated 2 years ago
- Matrix tools for building and inspecting latent spaces☆27Updated 6 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated 2 years ago
- Markdown template for Dataseets for Datasets☆63Updated 3 years ago
- Open Source AI Benchmarking toolkit for benchmarking speech to text services☆55Updated last year
- How (but not why) to do Twitter sociolinguistic analysis in the Unix Shell☆10Updated 9 years ago
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Updated 6 years ago
- Gamma Agreement in Python☆44Updated last year
- Visualize large text collections with WebGL☆25Updated 9 months ago
- Compare coverage across different media sources using the Juicer☆12Updated 9 years ago