trecpodcasts / podcast-audio-feature-extraction
Audio feature extraction and baseline search implementation for the Spotify Podcast Dataset.
☆12Updated 3 years ago
Alternatives and similar repositories for podcast-audio-feature-extraction:
Users that are interested in podcast-audio-feature-extraction are comparing it to the libraries listed below
- ☆12Updated 9 months ago
- Gamma Agreement in Python☆43Updated last year
- phone inventory library☆16Updated last year
- A survey of corpora for Germanic low-resource languages and dialects☆25Updated 3 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- XED multilingual emotion datasets☆58Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆14Updated 2 years ago
- Unsupervised spoken sentence embeddings☆14Updated 2 years ago
- [ICASSP'23] This repo contains code for the Demux & MEmo emotion recognition models (https://arxiv.org/abs/2210.15842), as well as code t…☆21Updated last year
- A merged version of multiple open-source German speech datasets.☆31Updated 10 months ago
- Pre-training Cross-modal Transformer for Audio-and-Language Representations☆39Updated 3 years ago
- Improving Disfluency Detection by Self-Training a Self-Attentive Model☆47Updated 3 years ago
- ☆17Updated 7 months ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆21Updated 7 months ago
- ☆13Updated 2 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆47Updated last year
- A guide to building language technology in new languages.☆58Updated 3 years ago
- ASCEND Chinese-English code-switching dataset☆24Updated 2 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆17Updated 2 years ago
- VAD analysis of text using some affective lexicon (ANEW, SENTIWORDNET, and VADER)☆25Updated 3 years ago
- Repository containing the open source code of works published at the FBK MT unit.☆42Updated last week
- Repository for multilingual speech data resources for native languages of Zambia.☆15Updated 5 months ago
- Repository for Quantifying Valence and Arousal in Text with Multilingual Pre-trained Transformers☆26Updated 2 years ago
- A recipe for constituency parsing, disfluency tagging and obtaining the fluent transcripts of English Fisher dataset☆12Updated 3 years ago
- ☆42Updated 3 years ago
- Suite for phonetic word embeddings, especially their evaluation and baseline models.☆27Updated 3 weeks ago
- ☆10Updated 2 years ago
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆26Updated last year
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆16Updated last year
- A spoken question answering dataset on SQUAD☆46Updated 2 years ago