magenta / magenta-realtimeLinks
☆605Updated last week
Alternatives and similar repositories for magenta-realtime
Users that are interested in magenta-realtime are comparing it to the libraries listed below
Sorting:
- ☆498Updated 3 weeks ago
- NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms☆1,057Updated 2 months ago
- ☆620Updated 2 weeks ago
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆756Updated last month
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆298Updated 3 months ago
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆301Updated last week
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆261Updated last month
- ☆222Updated last week
- Kyutai with an "eye"☆207Updated 3 months ago
- Sesame CSM 1B Voice Cloning☆312Updated 3 months ago
- YuE: Open Full-song Generation Foundation for the GPU Poor☆414Updated 5 months ago
- Generative models for conditional audio generation☆158Updated 5 months ago
- Examples of using the llasa-tts models locally☆175Updated 2 months ago
- Awesome music generation model——MG²☆159Updated 3 months ago
- Run Orpheus 3B Locally With LM Studio☆432Updated 3 months ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆97Updated 3 months ago
- The official GitHub page for the survey paper "Foundation Models for Music: A Survey".☆208Updated 10 months ago
- Make text LLMs listen and speak☆512Updated this week
- ☆546Updated this week
- Streaming and Fine-tuning for Chatterbox TTS☆128Updated 3 weeks ago
- Repository of AudioX☆1,027Updated 2 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆175Updated 2 months ago
- ☆257Updated this week
- OpenMusic: SOTA Text-to-music (TTM) Generation☆601Updated 2 weeks ago
- Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), suppor…☆277Updated last month
- [ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation☆245Updated last week
- PyTorch implementation of Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities.☆496Updated this week
- Realtime demo, Streaming and Finetuning code for CSM☆341Updated last month
- Unified automatic quality assessment for speech, music, and sound.☆531Updated last month
- YuE: Open Full-song Generation Foundation Model, something similar to Suno.ai but open☆69Updated 2 months ago