nene1212 / MaskGCT-Training
Training code for MaskGCT-T2S model.
☆19Updated 3 months ago
Alternatives and similar repositories for MaskGCT-Training:
Users that are interested in MaskGCT-Training are comparing it to the libraries listed below
- ☆30Updated last year
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆24Updated 3 weeks ago
- ☆69Updated last year
- ☆51Updated 4 months ago
- ☆26Updated 10 months ago
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆75Updated 3 months ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆42Updated 3 months ago
- UMETTS: A Unified Framework for Emotional Text-to-Speech Synthesis with Multimodal Prompts☆22Updated 3 months ago
- A low-bitrate single-codebook 16 kHz speech codec based on focal modulation☆79Updated last month
- Official repo of ICASSP 2024 paper - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆52Updated 2 months ago
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆48Updated 8 months ago
- VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling☆67Updated 4 months ago
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆58Updated 2 months ago
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆43Updated 2 weeks ago
- ☆65Updated last year
- Source code for DM-Codec.☆40Updated 5 months ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆51Updated 5 months ago
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆65Updated 2 months ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆95Updated last year
- This is official repository of new SOTA diffusion models based method for speech enhancement☆38Updated 7 months ago
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆38Updated 9 months ago
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆57Updated 7 months ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆35Updated 2 weeks ago
- ☆58Updated 5 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆55Updated 4 months ago
- ☆69Updated 2 months ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆79Updated 3 months ago
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆25Updated 6 months ago
- faster inference☆27Updated 2 months ago