Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSIVE textual corpus.
☆24Oct 8, 2025Updated 4 months ago
Alternatives and similar repositories for Speech-MASSIVE
Users that are interested in Speech-MASSIVE are comparing it to the libraries listed below
Sorting:
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 11 months ago
- ☆15Nov 10, 2025Updated 3 months ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated last year
- radiomixer☆14Feb 16, 2022Updated 4 years ago
- ☆20Sep 20, 2024Updated last year
- Crowdsourced and Automatic Speech Prominence Estimation☆25Apr 12, 2024Updated last year
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- Phoneme alignment representation compatible with multiple forced aligners☆22Apr 12, 2024Updated last year
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Dec 19, 2022Updated 3 years ago
- A toolkit dedicate for speech evaluation.☆23Sep 26, 2024Updated last year
- The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…☆63Dec 26, 2025Updated 2 months ago
- ☆28Sep 5, 2024Updated last year
- This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…☆11Dec 19, 2025Updated 2 months ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- VoxAngeles Corpus☆13Aug 23, 2025Updated 6 months ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 5 months ago
- This repository contains the code for the paper "Self-supervised Text Style Transfer using Cycle-Consistent Adversarial Networks".☆10Dec 2, 2024Updated last year
- ☆10Dec 22, 2023Updated 2 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- ☆10Mar 20, 2021Updated 4 years ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 8 months ago
- A collection of utilities for handling IPA phones.☆26Sep 24, 2023Updated 2 years ago
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- Collection of scripts from mHuBERT-147.☆32Nov 19, 2024Updated last year
- Collection of Open Source Speech Data☆164Oct 3, 2025Updated 5 months ago
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition