NZqian / RapBank
☆64Updated 6 months ago
Alternatives and similar repositories for RapBank:
Users that are interested in RapBank are comparing it to the libraries listed below
- CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages☆129Updated last month
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆161Updated 10 months ago
- Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).☆106Updated 2 months ago
- ☆46Updated 2 months ago
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆94Updated 5 months ago
- Official implementation of Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large Models☆31Updated last month
- ☆54Updated 8 months ago
- ☆23Updated 3 months ago
- ☆58Updated last month
- ☆210Updated 2 weeks ago
- ☆74Updated 5 months ago
- flow mirror models from JZX AI Labs☆43Updated 6 months ago
- PodAgent: A Comprehensive Framework for Podcast Generation☆63Updated 2 weeks ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆84Updated 3 months ago
- 重构GPT-SOVITS的项目,重写了部分代码,优化了webui的使用以及增加了api调用☆27Updated 3 months ago
- Official implementation for FlowSep☆35Updated 3 months ago
- ☆18Updated 3 weeks ago
- ☆80Updated 4 months ago
- A curated list of Video to Audio Generation☆35Updated 5 months ago
- F5-TTS 推理加速,速度提升约4倍!☆64Updated 2 months ago
- The official repo for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation☆34Updated this week
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆23Updated 6 months ago
- Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"☆23Updated 3 weeks ago
- Follow the rapid development of AIGC models and applications. | 跟上AIGC模型和应用快速发展的步伐 🚀☆81Updated last year
- official code for CVPR'24 paper Diff-BGM☆60Updated 5 months ago
- Implementation of Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching (NeurIPS'24)☆32Updated 4 months ago
- small audio language model for reasoning☆50Updated last week
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆93Updated 3 months ago
- Awesome music generation model——MG²☆145Updated this week
- Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generati…☆181Updated last year