naver / multilingual-distilwhisperView external linksLinks
This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.
☆33Oct 23, 2025Updated 3 months ago
Alternatives and similar repositories for multilingual-distilwhisper
Users that are interested in multilingual-distilwhisper are comparing it to the libraries listed below
Sorting:
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- ☆11Aug 11, 2023Updated 2 years ago
- ☆14Aug 16, 2023Updated 2 years ago
- Neural model for prediction of stress position in Russian words☆12Jun 22, 2025Updated 7 months ago
- ☆14Aug 19, 2024Updated last year
- ☆13Dec 7, 2022Updated 3 years ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 8 months ago
- [INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…☆18Jul 23, 2024Updated last year
- ☆18Mar 13, 2024Updated last year
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆18Jun 5, 2023Updated 2 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- asr2k☆52Jun 2, 2024Updated last year
- ☆20Jul 22, 2022Updated 3 years ago
- EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System☆15Mar 31, 2019Updated 6 years ago
- A Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS☆54Dec 11, 2024Updated last year
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Mar 24, 2023Updated 2 years ago
- Bilingual-TTS (Japanese and Korean)☆32Jul 1, 2023Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- ☆38Apr 15, 2024Updated last year
- Labeled data for homograph disambiguation☆62Jun 1, 2023Updated 2 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- Russian phonetical transcription☆11Nov 19, 2025Updated 2 months ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆26Dec 12, 2024Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆27Feb 21, 2025Updated 11 months ago
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean☆23Dec 17, 2017Updated 8 years ago
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆25Aug 11, 2024Updated last year
- A unified model for zero-shot singing voice conversion and synthesis☆22Nov 30, 2022Updated 3 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- ☆10Dec 22, 2023Updated 2 years ago
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆12Oct 25, 2023Updated 2 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- 🎵 muse: Music Separation☆11Feb 14, 2024Updated 2 years ago
- ☆13Nov 22, 2022Updated 3 years ago
- text to speech☆10Mar 19, 2024Updated last year
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆174Jun 9, 2023Updated 2 years ago
- ☆11Jun 14, 2024Updated last year
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Nov 3, 2020Updated 5 years ago