ElectricAlexis / NotaGenLinks
NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms
☆1,076Updated 4 months ago
Alternatives and similar repositories for NotaGen
Users that are interested in NotaGen are comparing it to the libraries listed below
Sorting:
- ☆817Updated last week
- A fundamental toolkit designed for music, song, and audio generation☆1,182Updated 3 months ago
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆773Updated last month
- Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion☆1,899Updated last month
- YuE: Open Full-song Generation Foundation for the GPU Poor☆434Updated 6 months ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆2,912Updated 2 months ago
- [CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis☆1,817Updated last week
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,259Updated 4 months ago
- ☆714Updated 3 weeks ago
- YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open☆5,435Updated 2 months ago
- ☆1,882Updated 5 months ago
- Generate music based on natural language prompts using LLMs running locally☆1,133Updated 6 months ago
- FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝☆618Updated last year
- ☆514Updated last week
- Midi event transformer for symbolic music generation☆309Updated 8 months ago
- ☆631Updated last month
- Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models☆460Updated 2 weeks ago
- Interface for OuteTTS models.☆1,366Updated 2 months ago
- CVPR2025☆888Updated 3 months ago
- Thera: Aliasing-Free Arbitrary-Scale Super-Resolution with Neural Heat Fields☆818Updated last month
- Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model☆1,997Updated this week
- ☆1,813Updated 2 months ago
- Generative models for conditional audio generation☆3,411Updated last month
- ☆269Updated last year
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆307Updated last month
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆626Updated last year
- A curated compilation of AI-driven generative music resources and projects. Explore the blend of machine learning algorithms and musical …☆387Updated last year
- first base model for full-duplex conversational audio☆1,752Updated 7 months ago
- Text-to-Music Generation with Rectified Flow Transformers☆1,709Updated 8 months ago
- Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.☆2,276Updated this week