btelle / podcasts-datasetLinks
dataset of podcasts and episodes
☆14Updated 7 years ago
Alternatives and similar repositories for podcasts-dataset
Users that are interested in podcasts-dataset are comparing it to the libraries listed below
Sorting:
- The RadioTalk dataset of talk radio transcripts☆60Updated 4 years ago
- Markdown template for Dataseets for Datasets☆63Updated 3 years ago
- Text Thresher crowd sourced text annotator☆17Updated 7 years ago
- Python tool for normilizing text and text canonicalization (DISCONTINUED)☆41Updated 12 years ago
- Dataset of approximately 10,000 podcasts from iTunes.☆91Updated 7 years ago
- A crash course for training speech recognition models using DeepSpeech.☆25Updated 4 years ago
- A simple interface to the Project Gutenberg corpus.☆17Updated 9 years ago
- motivational website to do something special this month☆21Updated last year
- ☆13Updated 9 years ago
- This is the text partitioner project for Python.☆21Updated 6 years ago
- A Python package to facilitate research on building and evaluating automated scoring models.☆70Updated 9 months ago
- Multilingual grapheme-to-phoneme conversion☆20Updated 7 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated last month
- ADS Project☆14Updated 9 years ago
- A Python interface to OpenFst☆88Updated 6 years ago
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆41Updated 3 weeks ago
- Catalan ALBERT (A Lite BERT for self-supervised learning of language representations)☆14Updated 5 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Updated 2 years ago
- creating audio preprocessing features in TensorFlow keras layers,☆14Updated 4 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Updated 5 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 2 years ago
- A free dataset of (almost) all publicly available podcasts.☆134Updated 11 years ago
- High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.☆154Updated 10 months ago
- Aggressive reddit scraper in node js☆13Updated 10 years ago
- ☆81Updated 8 years ago
- Gamma Agreement in Python☆45Updated last year
- Collaborative audio annotation tool☆17Updated 3 years ago
- ☆46Updated last month
- A Python 3 phonetics library.☆134Updated 5 years ago
- ☆74Updated last week