trecpodcasts / podcast-audio-feature-extractionLinks
Audio feature extraction and baseline search implementation for the Spotify Podcast Dataset.
☆12Updated 4 years ago
Alternatives and similar repositories for podcast-audio-feature-extraction
Users that are interested in podcast-audio-feature-extraction are comparing it to the libraries listed below
Sorting:
- Gamma Agreement in Python☆45Updated last year
- Repository for Quantifying Valence and Arousal in Text with Multilingual Pre-trained Transformers☆39Updated 2 years ago
- A family of efficient speech models for multilingual phone recognition☆30Updated last month
- Train a fiwGAN or ciwGAN model using your own training data☆14Updated 3 years ago
- phone inventory library☆17Updated 2 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- Repository for multilingual speech data resources for native languages of Zambia.☆19Updated last year
- ☆14Updated last year
- Suite for phonetic word embeddings, especially their evaluation and baseline models.☆36Updated 9 months ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Updated 3 years ago
- Bias Tests for Voice Technologies (bt4vt)☆11Updated last year
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Updated 5 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Updated 2 years ago
- ☆11Updated 4 years ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆56Updated 5 years ago
- Repo for the Wasabi datasets☆114Updated 8 months ago
- ☆34Updated 4 years ago
- XED multilingual emotion datasets☆64Updated 2 years ago
- Behavioral probing of language acquisition models at the lexical and syntactic level☆17Updated 2 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Updated 3 years ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Updated 4 years ago
- ☆18Updated last year
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆21Updated 2 years ago
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆18Updated 2 years ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆24Updated 2 months ago
- Deepspeech ASR Model for the Catalan Language☆17Updated 4 years ago
- ☆19Updated 3 years ago
- A corpus of speech from the Joe Rogan Experience podcast, consisting of 8.43 million words. It includes aligned TextGrids with phonetic a…☆21Updated 5 years ago
- ☆14Updated 2 years ago
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆36Updated 4 months ago