btelle / podcasts-dataset
dataset of podcasts and episodes
☆14Updated 6 years ago
Related projects: ⓘ
- The RadioTalk dataset of talk radio transcripts☆55Updated 3 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Updated last year
- Aggressive reddit scraper in node js☆13Updated 9 years ago
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Updated 5 years ago
- Featurize words into orthographic and phonological vectors.☆39Updated last year
- Code for my blog post on Generating Words from Embeddings☆23Updated last month
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 5 years ago
- Collaborative web framework for analyzing text (e.g., tweets). Supports standard labeling and pairwise comparison.☆14Updated 3 years ago
- A simple interface to the Project Gutenberg corpus.☆17Updated 8 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆34Updated last year
- New York Times Word Innovation Types dataset☆21Updated 3 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- motivational website to do something special this month☆20Updated 8 months ago
- Visualize large text collections with WebGL☆25Updated 2 weeks ago
- Self-supervised neural network for music recommendations.☆17Updated last year
- SPARK-n-SPELL [WARNING: inactive project, not being updated]☆7Updated 8 years ago
- Analysis of gutenberg dataset☆40Updated 5 years ago
- ADS Project☆14Updated 8 years ago
- Gamma Agreement in Python☆43Updated 6 months ago
- Dataset Release for Intent Classification from Speech☆43Updated last year
- Exploring the shapes of stories using indico sentiment analysis APIs☆28Updated 9 years ago
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆40Updated 2 weeks ago
- SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech☆22Updated last year
- Simple CTC implementation for PyTorch☆14Updated 6 years ago
- A framework to identify relations between ideas in temporal text corpora.☆28Updated 6 years ago
- Automatic Measurement of Vowel Duration for Consonant Vowel Consonant (CVC) sound files (JASA 2016)☆14Updated 7 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated last year
- ☆29Updated 2 years ago
- Multilingual grapheme-to-phoneme conversion☆19Updated 6 years ago
- An API to access data from The New Yorker Caption Contest☆60Updated last year