ElectricAlexis / NotaGen
NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms
☆840Updated last week
Alternatives and similar repositories for NotaGen:
Users that are interested in NotaGen are comparing it to the libraries listed below
- InspireMusic: A Unified Framework for Music, Song, Audio Generation.☆1,010Updated last week
- Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion☆1,268Updated this week
- YuE: Open Full-song Generation Foundation for the GPU Poor☆350Updated last month
- ☆289Updated 2 weeks ago
- ☆692Updated this week
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆690Updated 3 weeks ago
- OpenMusic: SOTA Text-to-music (TTM) Generation☆543Updated last month
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆926Updated last month
- YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open☆4,547Updated last week
- Implementation of F5-TTS in MLX☆509Updated last week
- Interface for OuteTTS models.☆957Updated last month
- [CVPR 2025] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis☆1,237Updated 2 weeks ago
- first base model for full-duplex conversational audio☆1,725Updated 2 months ago
- A Fast TTS Engine☆471Updated 2 months ago
- A text-to-speech (TTS) and Speech-to-Speech (STS) library built on Apple's MLX framework, providing efficient speech synthesis on Apple S…☆366Updated last week
- Thera: Aliasing-Free Arbitrary-Scale Super-Resolution with Neural Heat Fields☆708Updated this week
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆261Updated 3 weeks ago
- Generate music based on natural language prompts using LLMs running locally☆925Updated last month
- Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models☆331Updated 2 weeks ago
- Transform PDFs into AI podcasts for engaging on-the-go audio content.☆599Updated this week
- TTS Towards Human-Sounding Speech☆3,162Updated this week
- Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds☆862Updated this week
- 🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity☆767Updated last week
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,213Updated this week
- Official implementation of SVFR.☆778Updated 2 months ago
- ☆715Updated last month
- Convert any PDF into a podcast episode!☆705Updated last week
- ☆591Updated last week
- Text-to-Music Generation with Rectified Flow Transformers☆1,681Updated 3 months ago
- CVPR2025☆810Updated last week