lucasnewman / f5-tts-mlxLinks

Implementation of F5-TTS in MLX

☆594

Alternatives and similar repositories for f5-tts-mlx

Users that are interested in f5-tts-mlx are comparing it to the libraries listed below

Sorting:

edwko / OuteTTS
Interface for OuteTTS models.
☆1,400Updated 4 months ago
senstella / csm-mlx
An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.
☆380Updated 3 months ago
mustafaaljadery / lightning-whisper-mlx
An extremely fast implementation of whisper optimized for Apple Silicon using MLX.
☆805Updated last year
isaiahbjork / orpheus-tts-local
Run Orpheus 3B Locally With LM Studio
☆485Updated 7 months ago
aiola-lab / whisper-medusa
Whisper with Medusa heads
☆863Updated 3 months ago
amanvirparhar / weebo
A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.
☆246Updated 9 months ago
senstella / parakeet-mlx
An implementation of the Nvidia's Parakeet models for Apple Silicon using MLX.
☆575Updated last week
JosefAlbers / whisper-turbo-mlx
Blazing fast whisper turbo for ASR (speech-to-text) tasks
☆217Updated last year
Standard-Intelligence / hertz-dev
first base model for full-duplex conversational audio
☆1,767Updated 10 months ago
astramind-ai / Auralis
A Fast TTS Engine
☆566Updated 9 months ago
isaiahbjork / csm-voice-cloning
Sesame CSM 1B Voice Cloning
☆324Updated 8 months ago
freddyaboulton / orpheus-cpp
Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)
☆343Updated 7 months ago
RayFernando1337 / MLX-Auto-Subtitled-Video-Generator
Generate accurate transcripts using Apple's MLX framework
☆442Updated 6 months ago
argmaxinc / DiffusionKit
On-device Image Generation for Apple Silicon
☆664Updated 7 months ago
revdotcom / reverb
Open source inference code for Rev's model
☆433Updated 6 months ago
argmaxinc / whisperkittools
Python tools for WhisperKit: Model conversion, optimization and evaluation
☆231Updated last week
lhl / voicechat2
Local SRT/LLM/TTS Voicechat
☆739Updated last year
Lex-au / Orpheus-FastAPI
High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.
☆598Updated 4 months ago
phildougherty / sesame_csm_openai
OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT
☆421Updated last month
facebookresearch / spiritlm
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
☆925Updated last year
Marvis-Labs / marvis-tts
☆300Updated 2 months ago
fluxions-ai / vui
☆635Updated this week
arcee-ai / fastmlx
FastMLX is a high performance production ready API to host MLX models.
☆332Updated 7 months ago
madroidmaq / mlx-omni-server
MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…
☆596Updated 3 weeks ago
pipecat-ai / smart-turn
☆1,013Updated 2 months ago
kyutai-labs / hibiki
Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…
☆1,322Updated 7 months ago
ivanfioravanti / qwen-image-mps
Qwen Image models through MPS
☆220Updated last week
davidbrowne17 / csm-streaming
Realtime demo, Streaming and Finetuning code for CSM
☆415Updated 2 months ago
playht / PlayDiffusion
☆527Updated last month
devnen / Dia-TTS-Server
Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), suppor…
☆329Updated 5 months ago