☆73Sep 13, 2024Updated last year
Alternatives and similar repositories for RapBank
Users that are interested in RapBank are comparing it to the libraries listed below
Sorting:
- Bilingual Singing Voice Synthesis☆18Mar 25, 2024Updated last year
- [ICMR 2025] Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale …☆18Aug 17, 2025Updated 6 months ago
- MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations☆33Oct 15, 2025Updated 4 months ago
- ☆15Mar 31, 2025Updated 11 months ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆35May 7, 2025Updated 9 months ago
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 7 months ago
- ☆15Aug 22, 2025Updated 6 months ago
- The baselines of ARC-Challenge-Interspeech2026☆56Dec 1, 2025Updated 3 months ago
- Vocal Remover using Deep Neural Networks☆19Dec 31, 2024Updated last year
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- A song aesthetic evaluation toolkit trained on SongEval.☆285Jun 15, 2025Updated 8 months ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- ☆18May 4, 2025Updated 10 months ago
- g2p for english tts☆19Nov 10, 2022Updated 3 years ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆21May 26, 2025Updated 9 months ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- ☆11Mar 13, 2023Updated 2 years ago
- ☆11Aug 11, 2023Updated 2 years ago
- CTC decoder with hotwords for ASR.☆34Apr 13, 2025Updated 10 months ago
- Reimplementation of Miipher☆29Aug 16, 2023Updated 2 years ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics☆15Oct 28, 2024Updated last year
- text to speech☆10Mar 19, 2024Updated last year
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- SimplifiedTransformer simplifies transformer block without affecting training. Skip connections, projection parameters, sequential sub-bl…☆15Feb 6, 2026Updated 3 weeks ago
- ☆11Nov 7, 2024Updated last year
- Public female English corpus used for Project AI❤dol☆14Dec 28, 2025Updated 2 months ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Oct 5, 2022Updated 3 years ago
- The source code for the paper XiaoiceSing2 (interspeech2023)☆49Jan 15, 2024Updated 2 years ago
- Speech Human Evaluation Estimation Toolkit (SHEET)☆131Oct 2, 2025Updated 5 months ago
- ☆15Nov 11, 2024Updated last year
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆13Feb 5, 2025Updated last year
- SinGlow is a part of my Singing voice synthesis system. It can extract features of sound, particularly songs and musics. Then we can use …☆11Oct 9, 2021Updated 4 years ago
- CCMusic, an open Chinese music database, integrates diverse datasets. It ensures data consistency via cleaning, label refinement and stru…☆26Oct 31, 2025Updated 4 months ago