multimodal-art-projection / Open-Suno
trying to reproduce suno v3
☆33Updated 2 months ago
Alternatives and similar repositories for Open-Suno:
Users that are interested in Open-Suno are comparing it to the libraries listed below
- ☆24Updated 3 months ago
- ☆46Updated 3 months ago
- GPT-style network for phonemization with durations of text☆64Updated last year
- small audio language model for reasoning☆58Updated last week
- A trainer for SNAC (Multi-Scale Neural Audio Codec) has replaced the decoder with Vocos.☆48Updated 5 months ago
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆18Updated last year
- This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.☆49Updated last month
- ☆35Updated last year
- ☆107Updated 2 months ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆52Updated 5 months ago
- ☆40Updated 2 months ago
- ☆23Updated last month
- ☆29Updated 9 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆65Updated 5 months ago
- ☆20Updated 6 months ago
- Official implementation for FlowSep☆42Updated 3 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆86Updated 4 months ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆27Updated 9 months ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆48Updated this week
- SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"☆35Updated last year
- Official Code for ParrotTTS☆48Updated 6 months ago
- Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…☆41Updated 2 months ago
- LUCY: Linguistic Understanding and Control Yielding Early Stage of Her☆37Updated last week
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆33Updated last year
- GPT for FACodec☆13Updated last year
- ☆41Updated last year
- Codebase and project page for EDMSound☆34Updated last year
- My vocoder experiments☆28Updated 6 months ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆27Updated last year
- The open source code for LLM-Codec☆133Updated 8 months ago