BrightGu / RLVCLinks
☆15Updated 2 years ago
Alternatives and similar repositories for RLVC
Users that are interested in RLVC are comparing it to the libraries listed below
Sorting:
- Any-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.☆33Updated 4 years ago
- ☆13Updated 2 years ago
- Evaluation Protocol for Large-Scale Zero-Shot TTS Literature☆93Updated 10 months ago
- StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation☆251Updated last year
- used to evaluate wavenet vocoder by rmse f0, MCD, rmse ap...☆15Updated 6 years ago
- ☆14Updated 3 months ago
- A low-bitrate single-codebook 16 / 24 kHz speech codec based on focal modulation☆140Updated 2 months ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated 2 years ago
- AGI_HER_LLM☆36Updated last month
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆86Updated this week
- ☆82Updated last year
- Official data preparation scripts for the URGENT 2024 Challenge☆87Updated 8 months ago
- This is official repository of new SOTA diffusion models based method for speech enhancement☆41Updated last year
- Official implementation of SpeechSplit2☆133Updated 3 years ago
- ☆30Updated 2 years ago
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆93Updated last year
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Updated 3 months ago
- ☆22Updated 2 months ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆45Updated 10 months ago
- ☆48Updated 11 months ago
- Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)☆152Updated 2 years ago
- Objective metrics used in several text-to-speech (TTS) papers.☆52Updated 7 months ago
- [INTERSPEECH 2024] The official implementation of EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for …☆170Updated 8 months ago
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆23Updated last year
- The open source code for SimpleSpeech series☆145Updated last year
- Official implementation of YingMusic-SVC.☆113Updated last month
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆16Updated last year
- (ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement☆87Updated 6 months ago
- [Findings of NAACL 2024] Source code of paper CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers a…☆69Updated last year
- This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…☆106Updated 2 years ago