skit-ai / emotion-tts-dataset
Dataset release for Emotional TTS in Indian Accent
☆35Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for emotion-tts-dataset
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆23Updated last year
- Code for ACL-IJCNLP 2021 paper "N-Best-ASR-Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses."☆15Updated 2 years ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆15Updated last year
- Dataset Release for Intent Classification from Speech☆45Updated last year
- This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"☆15Updated 7 months ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆29Updated 3 months ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆22Updated 2 years ago
- End-to-end diarization loss☆22Updated 3 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆30Updated 2 years ago
- Prosodic Speech Segmentation with Transformers☆23Updated 8 months ago
- A collection of papers related to speech model compression☆24Updated last year
- with alignment learning and continuous wavelet transform☆19Updated 2 years ago
- Temporary anonymous version☆22Updated 7 months ago
- asr2k☆48Updated 5 months ago
- Google's TPGST reimplementation.☆34Updated 4 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆35Updated 4 years ago
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- ☆42Updated 2 years ago
- Ultrafast GAN based Vocoder for Text to Speech☆50Updated 2 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 3 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆61Updated 7 months ago
- ☆24Updated 4 months ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- A handy dataset of noises for ASR☆19Updated 5 years ago