South-Twilight / SingMOS
☆14Updated last week
Alternatives and similar repositories for SingMOS:
Users that are interested in SingMOS are comparing it to the libraries listed below
- ☆38Updated 5 months ago
- Music generation☆24Updated 9 months ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆23Updated 9 months ago
- Official repo of ICASSP 2024 paper - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆49Updated last month
- ☆22Updated 10 months ago
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆63Updated 10 months ago
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆36Updated 8 months ago
- ☆99Updated 5 months ago
- logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…☆35Updated 6 months ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆51Updated 11 months ago
- Robust Singing Voice Transcription and MIDI Extraction☆69Updated 3 months ago
- Speech Human Evaluation Estimation Toolkit (SHEET)☆52Updated 3 months ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15Updated 2 years ago
- Source code of APNet2, a vocoder☆54Updated last year
- Vocoder NSF-HiFiGAN (Moved into deepaudio)☆50Updated 2 years ago
- MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]☆40Updated 3 weeks ago
- ☆43Updated 8 months ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆48Updated 2 weeks ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆36Updated last year
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆66Updated 7 months ago
- Code for the paper "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆54Updated last month
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Updated last year
- ☆63Updated last year
- [NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching☆36Updated this week
- a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine☆35Updated last month
- AudioSR-Upsampling (any -> 48kHz)☆38Updated last year
- Vocal Remover using Deep Neural Networks☆16Updated last month
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆31Updated last month
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆24Updated 4 months ago