magenta / magenta-realtimeLinks
☆181Updated this week
Alternatives and similar repositories for magenta-realtime
Users that are interested in magenta-realtime are comparing it to the libraries listed below
Sorting:
- Streaming and Fine-tuning for Chatterbox TTS☆99Updated last week
- Delayed Streams Modeling (DSM) is a flexible formulation for streaming, multimodal sequence-to-sequence learning.☆211Updated this week
- Awesome music generation model——MG²☆157Updated 2 months ago
- The official GitHub page for the survey paper "Foundation Models for Music: A Survey".☆206Updated 9 months ago
- Examples of using the llasa-tts models locally☆173Updated 2 months ago
- A random walk voice style cloning application for Kokoro text to speech☆98Updated this week
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆173Updated 2 months ago
- Generative models for conditional audio generation☆157Updated 4 months ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆258Updated 3 weeks ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆84Updated last month
- ☆113Updated 3 months ago
- Fine-tune your own MusicGen with LoRA☆136Updated last year
- Sing an idea ➡️ AI music sample🔥🎶☆110Updated last year
- YuE with mp3 extend, exllama and GUI☆53Updated 3 months ago
- Text-to-Music Generation with Rectified Flow Transformer☆64Updated 3 weeks ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆59Updated last year
- Unified automatic quality assessment for speech, music, and sound.☆512Updated 2 weeks ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆98Updated 3 months ago
- ☆190Updated last year
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆284Updated last week
- ☆238Updated 2 months ago
- Open Audio Watermarking Tool☆209Updated last month
- Fine-tune Stable Audio Open with DiT ControlNet.☆235Updated last month
- Fourier Dual Diffusion☆53Updated this week
- A simple, hackable text-to-speech system in PyTorch and MLX☆164Updated 4 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆104Updated last month
- a notebook containing scripts, documentation, and examples for finetuning musicgen☆92Updated last year
- Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.☆228Updated this week
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆174Updated 2 months ago
- ☆174Updated 5 months ago