lucasnewman / e2-tts-mlx
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX
☆18Updated last month
Related projects ⓘ
Alternatives and complementary repositories for e2-tts-mlx
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆23Updated 3 weeks ago
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆13Updated last week
- ☆61Updated 3 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆45Updated this week
- Find out why your CoreML model isn't running on the Neural Engine!☆24Updated 4 months ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆38Updated last month
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆81Updated last month
- Implementation of F5-TTS in Swift using MLX☆39Updated 3 weeks ago
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated 5 months ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 5 months ago
- Training Models Daily☆17Updated 10 months ago
- ☆107Updated last year
- Joint speech-language model - respond directly to audio!☆30Updated 5 months ago
- Text-To-Speech for NotebookLM☆16Updated last week
- Uses deepgram/whisper/custom models to create an LJSpeech dataset for voice model fine tuning☆12Updated this week
- Supervoice diffusion enhance☆25Updated 3 months ago
- Collection of scripts from mHuBERT-147.☆22Updated 4 months ago
- Codebase and project page for EDMSound☆29Updated 11 months ago
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆71Updated 9 months ago
- MLX Swift implementation of Andrej Karpathy's Let's build GPT video☆53Updated 6 months ago
- Real-time end-to-end singing voice convertion☆18Updated last week
- VALL-E 2 reproduction☆83Updated 3 months ago
- ☆25Updated 10 months ago
- Examples of apps built with Nendo, the AI Audio Tool Suite☆56Updated 8 months ago
- Profile your CoreML models directly from Python 🐍☆23Updated 3 weeks ago
- Gradio Client in Rust.☆23Updated last month
- ☆34Updated 6 months ago
- CLIP-Finder enables semantic offline searches of images from gallery photos using natural language descriptions or the camera. Built on A…☆60Updated 3 months ago
- a new family of super small music generation models focusing on experimental music and latent space exploration capabilities☆34Updated 6 months ago
- Speaker Diarization with Transformers☆59Updated 5 months ago