NZqian / RapBankLinks
☆73Updated last year
Alternatives and similar repositories for RapBank
Users that are interested in RapBank are comparing it to the libraries listed below
Sorting:
- The official code repository for SongPrep: A Preprocessing Framework and End-to-end Model for Full-song Structure Parsing and Lyrics Tran…☆131Updated last month
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆188Updated last year
- LongCat Audio Tokenizer and Detokenizer☆268Updated 3 weeks ago
- ☆29Updated 6 months ago
- Text-audio foundation model from Boson AI☆116Updated 4 months ago
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆117Updated 7 months ago
- Github repository for ACL 2025 paper: Recent Advances in Speech Language Models: A Survey.☆168Updated 6 months ago
- OpenS2S : Advancing Fully Open-Source End-to-End Empathetic Large Speech Language Model☆103Updated 5 months ago
- A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.☆131Updated 3 months ago
- ☆61Updated 6 months ago
- Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation☆418Updated last month
- ☆112Updated 2 months ago
- Googleの音声復元モデルMiipher-2の再現実装の学習および推論コード。学習済みモデルも公開しています。☆30Updated 5 months ago
- ☆94Updated 2 months ago
- A curated list of Video to Audio Generation☆90Updated last month
- CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]☆214Updated 8 months ago
- Towards Fine-grained Audio Captioning with Multimodal Contextual Cues☆86Updated last week
- ☆76Updated 3 months ago
- Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction☆216Updated 10 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆182Updated 6 months ago
- ☆112Updated 7 months ago
- Official code for"DiaMoE-TTS: A Unified IPA-based Dialect TTS Framework with Mixture-of-Experts and Parameter-Efficient Zero-Shot Adaptat…☆210Updated last month
- Curated list for papers, codes and resources related to Text-to-Audio (TTA) Generation☆68Updated last week
- Your faithful, impartial partner for audio evaluation — know yourself, know your rivals. 真实评测,知己知彼。☆247Updated this week
- FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.☆235Updated 2 months ago
- ☆105Updated 2 months ago
- ☆11Updated 10 months ago
- [ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation☆295Updated 2 months ago
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆24Updated last year
- An Open-Source Project to Unify Audio Processing and Generation☆159Updated 2 weeks ago