trecpodcasts / podcast-audio-feature-extraction
Audio feature extraction and baseline search implementation for the Spotify Podcast Dataset.
☆12Updated 3 years ago
Alternatives and similar repositories for podcast-audio-feature-extraction
Users that are interested in podcast-audio-feature-extraction are comparing it to the libraries listed below
Sorting:
- ☆12Updated 11 months ago
- ☆11Updated last year
- phone inventory library☆16Updated 2 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆14Updated 2 years ago
- Train a fiwGAN or ciwGAN model using your own training data☆13Updated 2 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Gamma Agreement in Python☆44Updated last year
- Repository for Quantifying Valence and Arousal in Text with Multilingual Pre-trained Transformers☆27Updated 2 years ago
- XED multilingual emotion datasets☆58Updated 2 years ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Updated 4 years ago
- ☆14Updated 2 years ago
- VAD analysis of text using some affective lexicon (ANEW, SENTIWORDNET, and VADER)☆25Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated last year
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago
- A tiny BERT for low-resource monolingual models☆31Updated 7 months ago
- ☆11Updated 3 years ago
- Improving Disfluency Detection by Self-Training a Self-Attentive Model☆47Updated 4 years ago
- asr2k☆50Updated 11 months ago
- ASCEND Chinese-English code-switching dataset☆24Updated 2 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆47Updated 2 years ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆22Updated 8 months ago
- The Gridspace-Stanford Harper Valley speech dataset. Created in support of CS224S.☆45Updated 4 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Updated 2 years ago
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆21Updated 2 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- 56 language, 1 model Multilingual ASR☆25Updated 3 years ago
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆26Updated last year
- Bias Tests for Voice Technologies (bt4vt)☆12Updated 11 months ago
- Generating artificial disfluencies from fluent text easily and promptly☆13Updated 2 years ago