trecpodcasts / podcast-audio-feature-extractionLinks
Audio feature extraction and baseline search implementation for the Spotify Podcast Dataset.
β12Updated 4 years ago
Alternatives and similar repositories for podcast-audio-feature-extraction
Users that are interested in podcast-audio-feature-extraction are comparing it to the libraries listed below
Sorting:
- Gamma Agreement in Pythonβ45Updated last year
- π― Speech Recognition Challenge by Speech Lab - IIT Madrasβ11Updated 4 years ago
- phone inventory libraryβ17Updated 2 years ago
- Repository for Quantifying Valence and Arousal in Text with Multilingual Pre-trained Transformersβ37Updated 2 years ago
- Repository for multilingual speech data resources for native languages of Zambia.β18Updated last year
- β13Updated last year
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IWβ¦β18Updated 2 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawlerβ24Updated 4 years ago
- A family of efficient speech models for multilingual phone recognitionβ23Updated last month
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The texβ¦β55Updated 5 years ago
- XED multilingual emotion datasetsβ63Updated 2 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Courtβ22Updated 2 years ago
- Train a fiwGAN or ciwGAN model using your own training dataβ13Updated 3 years ago
- babyLM WhisBERT codeβ19Updated last year
- Scripts to create speech corpora from open.bibleβ13Updated 3 years ago
- Repository containing the open source code of works published at the FBK MT unit.β54Updated 3 months ago
- asr2kβ52Updated last year
- β34Updated 4 years ago
- Behavioral probing of language acquisition models at the lexical and syntactic levelβ17Updated 2 years ago
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/β¦β35Updated 2 months ago
- A PyPI package for fast word/character error rate (WER/CER) calculationβ72Updated 2 years ago
- Suite for phonetic word embeddings, especially their evaluation and baseline models.β34Updated 7 months ago
- β10Updated 2 years ago
- β10Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year
- A guide to building language technology in new languages.β59Updated 3 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.β13Updated 2 years ago
- A merged version of multiple open-source German speech datasets.β33Updated last year
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech represβ¦β21Updated last year
- Implementation of the DIVA model of speech acquisition and production using PyTorchβ21Updated 2 years ago