W-Wu / ERC-SLT22View external linksLinks
Code for "Distribution-based Emotion Recognition in Conversation"
☆19Feb 6, 2023Updated 3 years ago
Alternatives and similar repositories for ERC-SLT22
Users that are interested in ERC-SLT22 are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- ☆25Apr 24, 2019Updated 6 years ago
- ☆22Jul 30, 2025Updated 6 months ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- Speech synthesis using LPC☆23Jun 5, 2021Updated 4 years ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆28Aug 4, 2023Updated 2 years ago
- ☆25Mar 12, 2022Updated 3 years ago
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆12May 13, 2024Updated last year
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆18Jul 16, 2024Updated last year
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- Fast Inference in Denoising Diffusion Models via MMD Finetuning☆18Dec 4, 2023Updated 2 years ago
- Demo audio of VARA-TTS model☆20Jun 11, 2021Updated 4 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64May 30, 2023Updated 2 years ago
- ☆37May 8, 2021Updated 4 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Dec 1, 2021Updated 4 years ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆26Dec 12, 2024Updated last year
- Temporary anonymous version☆22Mar 20, 2024Updated last year
- Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"☆44Apr 10, 2023Updated 2 years ago
- Official code for Generative Marginalization Models [ICML 2024] [SPGIM 2023 Workshop Oral]☆24Aug 19, 2024Updated last year
- ☆22Nov 18, 2022Updated 3 years ago
- Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"☆68Dec 13, 2021Updated 4 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- ☆10Apr 8, 2024Updated last year
- ☆49Apr 1, 2025Updated 10 months ago
- ICASSP 2023 Accepted☆189May 6, 2024Updated last year
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Jul 16, 2023Updated 2 years ago
- ☆24Mar 30, 2024Updated last year
- An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".☆26Nov 4, 2023Updated 2 years ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆33Jul 31, 2024Updated last year
- ☆11May 7, 2022Updated 3 years ago
- ☆14Aug 16, 2023Updated 2 years ago
- TTS前,文本标准化,将数字字母处理转化为汉字☆12Apr 27, 2024Updated last year
- Multi-lingual AudioCaps☆12Nov 20, 2023Updated 2 years ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆29Nov 6, 2020Updated 5 years ago
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆39Sep 18, 2024Updated last year
- using world vocoder to extract features and make data for training neural networks☆11Oct 9, 2017Updated 8 years ago
- Simple tool for speech dataset augmentation for modeling various prosodies.☆14Jan 14, 2021Updated 5 years ago
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆83Jan 7, 2023Updated 3 years ago