☆49Feb 12, 2026Updated 3 weeks ago
Alternatives and similar repositories for YingMusic-Singer
Users that are interested in YingMusic-Singer are comparing it to the libraries listed below
Sorting:
- Official code for SongEcho☆41Feb 21, 2026Updated last week
- VocalVerse: A powerful vocal evaluation framework powered by the Qwen LLMs☆38Jan 22, 2026Updated last month
- Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio Generation☆135Jan 21, 2026Updated last month
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆49Sep 2, 2025Updated 6 months ago
- This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…☆76Jan 25, 2026Updated last month
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 3 months ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis☆23Feb 11, 2026Updated 3 weeks ago
- ☆83Dec 31, 2025Updated 2 months ago
- Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"☆36Feb 10, 2026Updated 3 weeks ago
- X Studio · 歌手 UI 自动化 | UI Automation for X Studio Singer☆28Mar 8, 2022Updated 3 years ago
- ☆13Mar 11, 2025Updated 11 months ago
- a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine☆42Jan 17, 2025Updated last year
- This is the repository for the work "BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective".☆64Nov 5, 2025Updated 4 months ago
- A spoken version of the textual story cloze benchmark☆20Aug 6, 2023Updated 2 years ago
- ☆13Sep 1, 2023Updated 2 years ago
- ☆17Jan 20, 2025Updated last year
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆32Jan 19, 2024Updated 2 years ago
- This repo is text to speech with learnable audio encoder without alignment with transcript reference☆53Sep 20, 2025Updated 5 months ago
- The Multi-band Excited WaveNet☆15Feb 2, 2023Updated 3 years ago
- A Foundation Model for Industrial Signal Comprehensive Representation☆57Feb 13, 2026Updated 2 weeks ago
- MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows☆124Sep 2, 2025Updated 6 months ago
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆96Oct 9, 2025Updated 4 months ago
- Some script for helping using Montreal Forced Aligner, maily for transforming Hanzi character to pinyin and extrat pause time from .textg…☆14Feb 9, 2024Updated 2 years ago
- ☆15Sep 20, 2023Updated 2 years ago
- AudioBERT 📢 : Audio Knowledge Augmented Language Model (ICASSP 2025)☆41Feb 1, 2025Updated last year
- AnyAccomp: Generalizable accompaniment generation for vocals and solo instruments, powered by a quantized melodic bottleneck.☆33Dec 22, 2025Updated 2 months ago
- poorman's ar-dit tts☆45Dec 31, 2025Updated 2 months ago
- Official Implementation for the paper: A Variational Framework for Improving Naturalness in Generative Spoken Language Models☆22Jun 18, 2025Updated 8 months ago
- dog-can-sing-song☆51Jan 9, 2026Updated last month
- The demo page for ALMTokenizer☆59Apr 14, 2025Updated 10 months ago
- A full collection of Music Informatic Retrieval (MIR) and AI Music labs.☆50Dec 27, 2024Updated last year
- "Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification" ISMIR2025☆30Sep 11, 2025Updated 5 months ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆35Sep 9, 2025Updated 5 months ago
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated 11 months ago
- MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, i…☆126Feb 13, 2026Updated 2 weeks ago
- ☆18May 4, 2025Updated 10 months ago
- Self-supervised Generative LM-based Voice Conversion☆54Apr 24, 2025Updated 10 months ago