btelle / podcasts-datasetLinks
dataset of podcasts and episodes
☆14Updated 7 years ago
Alternatives and similar repositories for podcasts-dataset
Users that are interested in podcasts-dataset are comparing it to the libraries listed below
Sorting:
- The RadioTalk dataset of talk radio transcripts☆61Updated 4 years ago
- ADS Project☆14Updated 9 years ago
- Featurize words into orthographic and phonological vectors.☆41Updated 2 years ago
- A crash course for training speech recognition models using DeepSpeech.☆24Updated 4 years ago
- A Python interface to OpenFst☆88Updated 6 years ago
- Next word prediction based on N-gram language model☆12Updated 10 years ago
- A Python package to facilitate research on building and evaluating automated scoring models.☆71Updated 11 months ago
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.☆155Updated last year
- ☆32Updated 5 years ago
- Closed Caption Transcripts of News Videos from archive.org 2014--2023☆50Updated 8 months ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 3 years ago
- Catalan ALBERT (A Lite BERT for self-supervised learning of language representations)☆14Updated 5 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Updated 3 years ago
- Calculate readability scores☆43Updated 6 years ago
- ☆21Updated 7 years ago
- Markdown template for Dataseets for Datasets☆63Updated 3 years ago
- Using embedding-based loss functions for phonetics/speech recognition.☆17Updated 11 years ago
- Determines the ethnicity based on your last name☆10Updated 11 years ago
- Gamma Agreement in Python☆45Updated last year
- Cog is a tool for comparing languages using lexicostatistics and comparative linguistics techniques.☆24Updated 2 years ago
- Python tool for normilizing text and text canonicalization (DISCONTINUED)☆41Updated 12 years ago
- Experiments to help discussion on Wikipedia talk pages☆68Updated 2 weeks ago
- Multilingual grapheme-to-phoneme conversion☆20Updated 7 years ago
- Fast Word Clustering Software☆79Updated 10 months ago
- Collaborative web framework for analyzing text (e.g., tweets). Supports standard labeling and pairwise comparison.☆14Updated 4 years ago
- An API to access data from The New Yorker Caption Contest☆62Updated 2 years ago
- A simple interface to the Project Gutenberg corpus.☆17Updated 9 years ago
- Data and experiments with world population densities for comparison to addresses☆12Updated 8 years ago
- A TensorFlow Implementation of Punctuation Restoration.☆18Updated 5 years ago
- A free dataset of (almost) all publicly available podcasts.☆133Updated 11 years ago