multimodal-art-projection / Open-Suno
trying to reproduce suno v3
☆25Updated 9 months ago
Alternatives and similar repositories for Open-Suno:
Users that are interested in Open-Suno are comparing it to the libraries listed below
- ☆24Updated 6 months ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆42Updated 2 months ago
- An easy-to-use, fast, and easily integrable tool for evaluating audio LLM☆19Updated this week
- LLaSA: Scaling Train Time and Test Time Compute for LLaMA based Speech Synthesis☆24Updated this week
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆33Updated last year
- ☆33Updated last month
- The open source code for LLM-Codec☆120Updated 5 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆51Updated 2 months ago
- ☆18Updated 2 weeks ago
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆15Updated 10 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆60Updated 2 months ago
- Official release of StyleTalk dataset.☆60Updated 6 months ago
- ☆18Updated 8 months ago
- BLSP-Emo: Towards Empathetic Large Speech-Language Models☆41Updated 7 months ago
- ☆34Updated 9 months ago
- A trainer for SNAC (Multi-Scale Neural Audio Codec) has replaced the decoder with Vocos.☆28Updated 2 months ago
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆50Updated 3 months ago
- 60k hours of phoneme-aligned audio from audio books☆18Updated 5 months ago
- Official repo for CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations☆43Updated this week
- Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…☆40Updated 2 months ago
- (WIP)long form speech generatoins☆29Updated last month
- [AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS☆63Updated 2 months ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆76Updated 3 weeks ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆25Updated 6 months ago
- ☆37Updated this week
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆23Updated 5 months ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆74Updated last year
- ☆9Updated 7 months ago
- Temporary anonymous version☆22Updated 9 months ago