JosefAlbers / e2tts-mlx
Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX
☆23Updated last month
Related projects ⓘ
Alternatives and complementary repositories for e2tts-mlx
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆18Updated last month
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆71Updated 9 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆84Updated last month
- ☆61Updated 3 months ago
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated 5 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆45Updated 2 weeks ago
- Collection of Open Source Speech Data☆146Updated last week
- Uses deepgram/whisper/custom models to create an LJSpeech dataset for voice model fine tuning☆12Updated 2 weeks ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆138Updated 4 months ago
- ☆87Updated 6 months ago
- Speaker Diarization with Transformers☆59Updated 6 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆55Updated last week
- Cog wrapper for collabora/WhisperSpeech☆25Updated 8 months ago
- VoiceBox neural network implementation☆96Updated 3 months ago
- Implementation of F5-TTS in Swift using MLX☆41Updated last month
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆43Updated last month
- Joint speech-language model - respond directly to audio!☆30Updated 6 months ago
- VALL-E 2 reproduction☆87Updated 4 months ago
- Supervoice diffusion enhance☆24Updated 4 months ago
- All the world is a play, we are but actors in it.☆47Updated 4 months ago
- LlamaVoice is a llama-based large voice generation model, providing inference and training ability.☆222Updated 2 months ago
- Audio tokenization, in the fastest way possible!☆45Updated 2 months ago
- [WIP] AI Try-On plugin for Chrome☆25Updated 8 months ago
- Video+code lecture on building nanoGPT from scratch☆64Updated 5 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆39Updated 3 weeks ago
- ASR + diarization model server with speculative decoding☆50Updated 5 months ago
- Scripts to create your own moe models using mlx☆86Updated 8 months ago
- A ggml (C++) re-implementation of tortoise-tts☆159Updated 3 months ago
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆13Updated 3 weeks ago
- Grammar checker with a keyboard shortcut for Ollama and Apple MLX with Automator on macOS.☆76Updated 9 months ago