btelle / podcasts-dataset
dataset of podcasts and episodes
☆14Updated 7 years ago
Alternatives and similar repositories for podcasts-dataset:
Users that are interested in podcasts-dataset are comparing it to the libraries listed below
- The RadioTalk dataset of talk radio transcripts☆57Updated 4 years ago
- creating audio preprocessing features in TensorFlow keras layers,☆14Updated 3 years ago
- SPARK-n-SPELL [WARNING: inactive project, not being updated]☆7Updated 8 years ago
- A simple interface to the Project Gutenberg corpus.☆17Updated 9 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Updated 2 years ago
- Collaborative audio annotation tool☆17Updated 2 years ago
- Experiments with Hugging Face 🔬 🤗☆45Updated 6 months ago
- Catalan ALBERT (A Lite BERT for self-supervised learning of language representations)☆14Updated 4 years ago
- Markdown template for Dataseets for Datasets☆62Updated 2 years ago
- Data and experiments with world population densities for comparison to addresses☆12Updated 7 years ago
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆40Updated 5 months ago
- Determines the ethnicity based on your last name☆10Updated 10 years ago
- ☆11Updated 6 years ago
- ☆22Updated 2 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Automatic Measurement of Vowel Duration for Consonant Vowel Consonant (CVC) sound files (JASA 2016)☆14Updated 7 years ago
- Labeled data for homograph disambiguation☆55Updated last year
- Gamma Agreement in Python☆43Updated 11 months ago
- A Playground for Variational Autoencoders☆12Updated 7 years ago
- Filter Bank Implementaion as Convolutional Neural Network using Python Keras☆17Updated 2 months ago
- Collaborative web framework for analyzing text (e.g., tweets). Supports standard labeling and pairwise comparison.☆14Updated 3 years ago
- ADS Project☆14Updated 9 years ago
- Python library for n-gram models in ARPA format☆40Updated 2 years ago
- Gentle and praatio scripts for easy forced alignment☆18Updated 2 years ago
- Repository for subjective and objective evaluation of source separation algorithms☆12Updated 6 years ago
- Code for my blog post on Generating Words from Embeddings☆23Updated 6 months ago
- This is the text partitioner project for Python.☆21Updated 6 years ago
- Compare accuracies of udpipe models and spacy models which can be used for NLP annotation☆14Updated 7 years ago
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- Calculate readability scores☆40Updated 5 years ago