ElectricAlexis / NotaGenLinks
NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms
☆1,011Updated last month
Alternatives and similar repositories for NotaGen
Users that are interested in NotaGen are comparing it to the libraries listed below
Sorting:
- Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion☆1,671Updated last week
- InspireMusic: Music, Song, Audio Generation.☆1,107Updated 2 weeks ago
- ☆875Updated last month
- YuE: Open Full-song Generation Foundation for the GPU Poor☆397Updated 3 months ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆2,288Updated last week
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆731Updated 3 months ago
- ☆1,629Updated 2 months ago
- YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open☆5,034Updated 2 weeks ago
- OpenMusic: SOTA Text-to-music (TTM) Generation☆565Updated last month
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,091Updated last month
- [CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis☆1,547Updated 3 weeks ago
- Interface for OuteTTS models.☆1,294Updated this week
- Midi event transformer for symbolic music generation☆278Updated 5 months ago
- Generate music based on natural language prompts using LLMs running locally☆1,024Updated 3 months ago
- Official implementations for paper: VACE: All-in-One Video Creation and Editing☆2,273Updated 2 weeks ago
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,460Updated 2 weeks ago
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆276Updated last month
- A Fast TTS Engine☆502Updated 4 months ago
- Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models☆391Updated last month
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation☆962Updated 2 weeks ago
- MCP Server implementation for Ableton Live OSC control☆260Updated 2 months ago
- FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,288Updated 2 weeks ago
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆908Updated 7 months ago
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆567Updated last month
- Model for MDX23 music separation contest☆736Updated last month
- Unified automatic quality assessment for speech, music, and sound.☆493Updated last month
- first base model for full-duplex conversational audio☆1,746Updated 4 months ago
- Thera: Aliasing-Free Arbitrary-Scale Super-Resolution with Neural Heat Fields☆798Updated last month
- ☆559Updated this week
- Transform PDFs into AI podcasts for engaging on-the-go audio content.☆660Updated last month