JosefAlbers/e2tts-mlx

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JosefAlbers/e2tts-mlx)

JosefAlbers / e2tts-mlx

Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX

☆29

Alternatives and similar repositories for e2tts-mlx

Users that are interested in e2tts-mlx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lucasnewman / e2-tts-mlx
View on GitHub
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX
☆21Oct 8, 2024Updated last year
ArenAcikgoz / Whisper-Alignment
View on GitHub
Forced alignment decoder for Whisper.
☆16Mar 13, 2024Updated 2 years ago
ORI-Muchim / Efficient-Speech
View on GitHub
Lightweight Korean TTS Model based on FastSpeech2
☆15Mar 4, 2026Updated 4 months ago
NTIA / alignnet
View on GitHub
Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.
☆18Aug 1, 2025Updated 11 months ago
PecholaL / MAIN-VC
View on GitHub
Lightweight Speech Representation Learning for One-Shot Voice Conversion
☆23Dec 12, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mushanshanshan / ESLTTS
View on GitHub
ESLTTS dataset
☆16Feb 6, 2025Updated last year
kjw11 / Speaker-Aware-CTC
View on GitHub
Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.
☆22May 26, 2025Updated last year
mzbac / vibevoice.swift
View on GitHub
vibevoice real time 0.5B swift port
☆31Dec 12, 2025Updated 7 months ago
h-munakata / Lighthouse-Wrapper-for-Audio-Moment-Retrieval
View on GitHub
☆13Mar 23, 2026Updated 4 months ago
duyichao / NPDA-KNN-ST
View on GitHub
Official implementation of EMNLP'2022 paper "Non-Parametric Domain Adaptation for End-to-End Speech Translation"
☆11Oct 26, 2022Updated 3 years ago
bshall / dusted
View on GitHub
DUSTED: Spoken-Term Discovery using Discrete Speech Units
☆17Oct 2, 2024Updated last year
j-csc / mlx_bark
View on GitHub
Port of Suno's Bark TTS transformer in Apple's MLX Framework
☆89Feb 11, 2024Updated 2 years ago
xiaoxue1117 / speech-mamba-public
View on GitHub
☆15Nov 26, 2024Updated last year
fakerybakery / simpletts
View on GitHub
A lightweight Python library for running TTS models with a unified API.
☆20Feb 18, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jeffrey-fong / Invoker
View on GitHub
The one who calls upon functions - Function-Calling Language Model
☆36Oct 2, 2023Updated 2 years ago
lucasnewman / f5-tts-mlx
View on GitHub
Implementation of F5-TTS in MLX
☆644Mar 19, 2025Updated last year
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
kyegomez / USM
View on GitHub
Implementation of Google's USM speech model in Pytorch
☆36Jul 20, 2026Updated last week
vincentamato / mlx-coconut
View on GitHub
An MLX port of Meta's Coconut reasoning model
☆16Sep 2, 2025Updated 10 months ago
ferologics / Piwork
View on GitHub
Work with Pi
☆15Feb 9, 2026Updated 5 months ago
johnmartinsson / differentiable-mel-spectrogram
View on GitHub
The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …
☆24Dec 21, 2024Updated last year
thamquocdung / eCMU
View on GitHub
eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)
☆10Oct 30, 2024Updated last year
alexisdmacintyre / SpeechBreathingToolbox
View on GitHub
Tools for the automatic detection of speech-related inhalation events and characterisation of the speech respiratory cycle.
☆11Feb 17, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ductuantruong / speaker_age_estimation_ssl_study
View on GitHub
[APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
☆14Oct 19, 2022Updated 3 years ago
duerig / StyleTTS2
View on GitHub
StyleTTS 2 Optimized Training Fork
☆32Feb 2, 2025Updated last year
lavendery / UUG
View on GitHub
☆21Sep 14, 2025Updated 10 months ago
projectlucas / efficient_whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆19Dec 1, 2022Updated 3 years ago
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Dec 3, 2024Updated last year
YoshikiMas / madeon-asr
View on GitHub
[SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition
☆19Dec 1, 2024Updated last year
AI-S2-Lab / GPT-Talker
View on GitHub
[ACMMM'2024] Generative Expressive Conversational Speech Synthesis
☆45Oct 28, 2024Updated last year
apple / ml-acn-embed
View on GitHub
Acoustic Neighbor Embeddings
☆33Jul 13, 2025Updated last year
unixpickle / honeycrisp
View on GitHub
🍎 A Swift framework for deep learning on Apple Silicon
☆33May 26, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
wangzhaode / mnn-asr
View on GitHub
mnn asr demo.
☆27Mar 24, 2025Updated last year
xmos / sln_voice
View on GitHub
XCORE-VOICE Solution
☆20Apr 8, 2026Updated 3 months ago
nineninesix-ai / KaniTTS-Finetune-pipeline
View on GitHub
☆27Nov 3, 2025Updated 8 months ago
stockeh / mlx-optimizers
View on GitHub
A collection of optimizers for MLX
☆57Dec 12, 2025Updated 7 months ago
lucadellalib / audiocodecs
View on GitHub
A collections of audio codecs with a standardized API
☆43Apr 15, 2026Updated 3 months ago
R1ckShi / FrontEnd-AEC
View on GitHub
Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.
☆19Apr 22, 2019Updated 7 years ago
zjlww / ardit-web
View on GitHub
☆27Aug 2, 2024Updated last year