JosefAlbers / e2tts-mlx
Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX
☆27Updated 6 months ago
Alternatives and similar repositories for e2tts-mlx:
Users that are interested in e2tts-mlx are comparing it to the libraries listed below
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆20Updated 6 months ago
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆80Updated last year
- ☆62Updated 9 months ago
- Open TTS models, built for streaming on the edge☆41Updated last month
- StyleTTS 2 Optimized Training Fork☆28Updated 3 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated 3 weeks ago
- Audio tokenization, in the fastest way possible!☆51Updated 8 months ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆141Updated 3 weeks ago
- TTS support with GGML☆32Updated this week
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated 11 months ago
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆16Updated last month
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆17Updated 6 months ago
- Supervoice diffusion enhance☆26Updated 9 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆11Updated last month
- Hanasu is a human-like TTS model based on the multilingual Himitsu V1 transformer-based encoder and VITS architecture☆26Updated 3 weeks ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆28Updated last year
- A simple, hackable text-to-speech system in PyTorch and MLX☆154Updated 2 months ago
- VoiceBox neural network implementation☆106Updated 9 months ago
- ☆89Updated last month
- Open-source and reproducible benchmarks for Speaker Diarization☆23Updated 2 weeks ago
- convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible☆14Updated last year
- Speaker Diarization with Transformers☆64Updated 11 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 6 months ago
- A lightweight Python library for running TTS models with a unified API.☆18Updated 2 months ago
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆53Updated 3 weeks ago
- A little file for doing LLM-assisted prompt expansion and image generation using Flux.schnell - complete with prompt history, prompt queu…☆26Updated 8 months ago
- zero-shot realtime TTS system, fully offline, free and open source☆34Updated 2 weeks ago
- High quality text-to-speech based on StyleTTS 2.☆37Updated this week
- Joint speech-language model - respond directly to audio!☆30Updated 11 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year