ElectricAlexis / NotaGen
NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms
☆968Updated 3 weeks ago
Alternatives and similar repositories for NotaGen
Users that are interested in NotaGen are comparing it to the libraries listed below
Sorting:
- Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion☆1,565Updated this week
- ☆786Updated last week
- ACE-Step: A Step Towards Music Generation Foundation Model☆1,766Updated this week
- InspireMusic: A Unified Framework for Music, Song, Audio Generation.☆1,086Updated this week
- YuE: Open Full-song Generation Foundation for the GPU Poor☆385Updated 3 months ago
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆717Updated 2 months ago
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,024Updated 3 weeks ago
- [CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis☆1,486Updated this week
- Interface for OuteTTS models.☆1,214Updated 2 weeks ago
- Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models☆375Updated last week
- OpenMusic: SOTA Text-to-music (TTM) Generation☆559Updated 2 weeks ago
- YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open☆4,945Updated this week
- FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝☆578Updated 9 months ago
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆269Updated 3 weeks ago
- Implementation of F5-TTS in MLX☆535Updated last month
- ☆1,530Updated last month
- Thera: Aliasing-Free Arbitrary-Scale Super-Resolution with Neural Heat Fields☆786Updated 2 weeks ago
- An AI-powered interactive avatar engine using Live2D, LLM, ASR, TTS, and RVC. Ideal for VTubing, streaming, and virtual assistant applica…☆740Updated 3 weeks ago
- Awesome music generation model——MG²☆154Updated last month
- ☆742Updated 2 months ago
- Run Orpheus 3B Locally With LM Studio☆396Updated last month
- Sesame CSM 1B Voice Cloning☆293Updated last month
- ☆223Updated last month
- A Fast TTS Engine☆495Updated 3 months ago
- ☆289Updated last week
- An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.☆331Updated last week
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆335Updated 3 weeks ago
- first base model for full-duplex conversational audio☆1,738Updated 4 months ago
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆902Updated 6 months ago
- FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,073Updated this week