The source code for the paper CrossSinger (asru2023)
☆18Oct 12, 2023Updated 2 years ago
Alternatives and similar repositories for CrossSinger
Users that are interested in CrossSinger are comparing it to the libraries listed below
Sorting:
- ☆19Feb 2, 2023Updated 3 years ago
- The source code for the paper XiaoiceSing2 (interspeech2023)☆49Jan 15, 2024Updated 2 years ago
- ☆39Oct 1, 2023Updated 2 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- singing voice conversion based on glow-tts☆12Aug 20, 2023Updated 2 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 4 months ago
- This is the official repository of ``Scalable Neural Vocoder from Range-Null Space Decomposition'', which is submitted to TPAMI.☆46Oct 11, 2025Updated 5 months ago
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆68Jul 5, 2024Updated last year
- Code for "StreamingTalker: Audio-driven 3D Facial Animation with Autoregressive Diffusion Model", AAAI2026 Oral☆46Jan 16, 2026Updated 2 months ago
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆75Aug 21, 2023Updated 2 years ago
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆135Oct 23, 2024Updated last year
- CCMusic, an open Chinese music database, integrates diverse datasets. It ensures data consistency via cleaning, label refinement and stru…☆27Oct 31, 2025Updated 4 months ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 2 years ago
- [WIP]Direction based Multi-Channel Speech Separation☆14Jan 25, 2024Updated 2 years ago
- ☆15Aug 22, 2025Updated 6 months ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Dec 5, 2023Updated 2 years ago
- Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…☆14Feb 15, 2023Updated 3 years ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- VI-SVC model is just VITS without MAS and DurationPredictor.☆10Nov 9, 2023Updated 2 years ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Feb 22, 2024Updated 2 years ago
- This is a project of Interspeech2021 paper "SpecMix : A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Fea…☆11Sep 27, 2022Updated 3 years ago
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆23Jan 16, 2024Updated 2 years ago
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆42Oct 13, 2023Updated 2 years ago
- ☆52Jun 24, 2025Updated 8 months ago
- Bilingual Singing Voice Synthesis☆18Mar 25, 2024Updated last year
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- Interspeech Tutorial - Resource Efficient and Cross-Modal Learning Toward Foundation Modeling☆15Oct 9, 2023Updated 2 years ago
- ☆15Apr 2, 2025Updated 11 months ago
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- ☆26Sep 22, 2022Updated 3 years ago
- A pytorch template for beginners based on pytorch_lightning☆49Feb 1, 2024Updated 2 years ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- ☆33Jun 29, 2023Updated 2 years ago
- ☆41May 15, 2023Updated 2 years ago
- Spherical residual vector quantization (SRVQ)☆31Aug 25, 2024Updated last year
- Mutiband version of HIFIGAN☆19Nov 6, 2020Updated 5 years ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Apr 11, 2021Updated 4 years ago
- Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios"(AAAI 2026)☆91Jan 31, 2026Updated last month
- An unofficial PyTorch implementation of VALL-E☆88Aug 3, 2025Updated 7 months ago