JosefAlbers / e2tts-mlx
Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX
☆23Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for e2tts-mlx
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆18Updated last month
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆71Updated 9 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆81Updated last month
- ☆61Updated 3 months ago
- ☆107Updated last year
- Implementation of F5-TTS in Swift using MLX☆39Updated 3 weeks ago
- Text-to-Music Generation with Rectified Flow Transformer☆45Updated 2 months ago
- Sing an idea ➡️ AI music sample🔥🎶☆90Updated 6 months ago
- Video+code lecture on building nanoGPT from scratch☆64Updated 4 months ago
- VALL-E 2 reproduction☆83Updated 3 months ago
- Running the F5-TTS by ONNX Runtime☆27Updated last week
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆45Updated last week
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆135Updated 3 months ago
- Uses deepgram/whisper/custom models to create an LJSpeech dataset for voice model fine tuning☆12Updated last week
- ASR + diarization model server with speculative decoding☆49Updated 5 months ago
- Collection of Open Source Speech Data☆144Updated this week
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated 5 months ago
- Gradio UI for a Cog API☆64Updated 7 months ago
- ☆87Updated 6 months ago
- Swift implementation of Flux.1 using mlx-swift☆63Updated last week
- Cog wrapper for collabora/WhisperSpeech☆24Updated 8 months ago
- Supervoice diffusion enhance☆25Updated 3 months ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆38Updated last month
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆13Updated 2 weeks ago
- Scripts to create your own moe models using mlx☆86Updated 8 months ago
- ☆34Updated 6 months ago
- Python tools for WhisperKit: Model conversion, optimization and evaluation☆167Updated this week
- Find out why your CoreML model isn't running on the Neural Engine!☆24Updated 4 months ago
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆147Updated 3 weeks ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆16Updated last month