☆75Sep 13, 2024Updated last year
Alternatives and similar repositories for RapBank
Users that are interested in RapBank are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations☆41Oct 15, 2025Updated 8 months ago
- A song aesthetic evaluation toolkit trained on SongEval.☆309Apr 8, 2026Updated 2 months ago
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago
- Bilingual Singing Voice Synthesis☆18Mar 25, 2024Updated 2 years ago
- ☆18Jan 20, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The baselines of ARC-Challenge-Interspeech2026☆60Dec 1, 2025Updated 7 months ago
- ☆18May 4, 2025Updated last year
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- ☆13Sep 1, 2023Updated 2 years ago
- An instruct text-to-speech solution based on LLaSA and CosyVoice2 developed by the ASLP lab and collaborators.☆251Feb 26, 2026Updated 4 months ago
- [ICMR 2025] Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale …☆18Aug 17, 2025Updated 10 months ago
- Vocal Remover using Deep Neural Networks☆21Dec 31, 2024Updated last year
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆38May 7, 2025Updated last year
- ICASSP2026 HumDial Challenge☆48May 28, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Event Relation in Text-to-Audio (TTA) Generation☆21Feb 26, 2025Updated last year
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆35Apr 22, 2024Updated 2 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆25Aug 1, 2025Updated 11 months ago
- OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.☆494Nov 23, 2025Updated 7 months ago
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆24Sep 19, 2025Updated 9 months ago
- X-Talk is an open-source full-duplex cascaded spoken dialogue system framework enabling low-latency, interruptible, and human-like speech…☆216Updated this week
- ☆15Apr 16, 2026Updated 2 months ago
- ☆15Mar 31, 2025Updated last year
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆98Jul 4, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆33Sep 15, 2025Updated 9 months ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆14Feb 5, 2025Updated last year
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆46Mar 10, 2025Updated last year
- Speech Human Evaluation Estimation Toolkit (SHEET)☆135Mar 31, 2026Updated 3 months ago
- ☆31Nov 4, 2025Updated 8 months ago
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆100Apr 2, 2026Updated 3 months ago
- A Large-scale Wu Dialect Speech Corpus with Multi-dimensional Annotations☆157Feb 6, 2026Updated 4 months ago
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated 2 years ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Feed-forward compressor experiments source code for "Differentiable All-pole Filters for Time-varying Audio Systems".☆24Jun 10, 2024Updated 2 years ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- ☆20Nov 3, 2021Updated 4 years ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆33Mar 16, 2023Updated 3 years ago
- ☆11Mar 13, 2023Updated 3 years ago
- ☆19Aug 23, 2024Updated last year
- ☆11Dec 17, 2025Updated 6 months ago