lucasnewman / e2-tts-mlxView external linksLinks
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX
☆21Oct 8, 2024Updated last year
Alternatives and similar repositories for e2-tts-mlx
Users that are interested in e2-tts-mlx are comparing it to the libraries listed below
Sorting:
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆29Oct 15, 2024Updated last year
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆23Oct 30, 2024Updated last year
- ☆70Sep 3, 2024Updated last year
- FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.☆25Dec 11, 2025Updated 2 months ago
- ☆14Aug 1, 2025Updated 6 months ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 2 years ago
- StyleTTS 2 Optimized Training Fork☆33Feb 2, 2025Updated last year
- PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.☆63Sep 8, 2025Updated 5 months ago
- Ultra-minimal autoregressive diffusion model for image generation☆21Dec 26, 2025Updated last month
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- 🧠 Retrieval Augmented Generation (RAG) example☆19Aug 18, 2025Updated 5 months ago
- python library with a set of tools for simple debugging of python programs☆19Apr 17, 2023Updated 2 years ago
- ☆25Oct 7, 2025Updated 4 months ago
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆63Jan 7, 2023Updated 3 years ago
- vibevoice real time 0.5B swift port☆28Dec 12, 2025Updated 2 months ago
- ☆16Dec 12, 2023Updated 2 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Introduction to MLX for Swift developers☆45Jun 23, 2025Updated 7 months ago
- End-To-End SpeechSynthesis system with knowledge distillation☆18Jul 16, 2022Updated 3 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- Learning differentiable temporal resolution on time-series data.☆36Nov 12, 2022Updated 3 years ago
- A simple, hackable text-to-speech system in PyTorch and MLX☆186Aug 3, 2025Updated 6 months ago
- ☆25Jan 21, 2026Updated 3 weeks ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Jun 18, 2022Updated 3 years ago
- ☆20Jul 13, 2022Updated 3 years ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆22Oct 10, 2025Updated 4 months ago
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- GUI tools for WORLD vocoder☆22Dec 19, 2024Updated last year
- ☆55Nov 5, 2024Updated last year
- Shared personal notes created while working with the Apple MLX machine learning framework☆24Dec 12, 2025Updated 2 months ago
- Implementation of F5-TTS in MLX☆606Mar 19, 2025Updated 10 months ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Jun 23, 2022Updated 3 years ago
- 44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。☆24Sep 1, 2023Updated 2 years ago
- ☆24Sep 27, 2022Updated 3 years ago
- Perceptual image hashing in the browser without using HTML canvas☆23Sep 23, 2021Updated 4 years ago