Blaizzy / mlx-audio
A text-to-speech (TTS) and Speech-to-Speech (STS) library built on Apple's MLX framework, providing efficient speech synthesis on Apple Silicon.
☆728Updated this week
Alternatives and similar repositories for mlx-audio:
Users that are interested in mlx-audio are comparing it to the libraries listed below
- An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.☆328Updated this week
- MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…☆350Updated 3 weeks ago
- Implementation of F5-TTS in MLX☆527Updated last month
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆1,228Updated this week
- Generate accurate transcripts using Apple's MLX framework☆397Updated last week
- FastMLX is a high performance production ready API to host MLX models.☆297Updated last month
- Blazing fast whisper turbo for ASR (speech-to-text) tasks☆204Updated 6 months ago
- Make Mac apps accessible for AI agents☆976Updated 2 months ago
- Run LLMs with MLX☆599Updated this week
- Apple MLX engine for LM Studio☆535Updated last week
- 🤖✨ChatMLX is a modern, open-source, high-performance chat application for MacOS based on large language models.☆781Updated last month
- An extremely fast implementation of whisper optimized for Apple Silicon using MLX.☆696Updated 11 months ago
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,013Updated 3 weeks ago
- Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗☆639Updated last month
- On-device Image Generation for Apple Silicon☆614Updated 3 weeks ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆544Updated last month
- A MLX port of FLUX based on the Huggingface Diffusers implementation.☆1,343Updated this week
- Run Orpheus 3B Locally With LM Studio☆392Updated last month
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆316Updated 2 weeks ago
- Interface for OuteTTS models.☆1,209Updated last week
- Big & Small LLMs working together☆733Updated this week
- An AI cursor for desktop using Gemini 2.0 Flash (Experimental)☆322Updated 2 months ago
- The easiest way to run the fastest MLX-based LLMs locally☆279Updated 6 months ago
- ☆694Updated 2 weeks ago
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆272Updated 3 weeks ago
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆224Updated 3 months ago
- ☆381Updated last week
- Collection of apple-native tools for the model context protocol.☆1,491Updated 3 weeks ago
- The official ElevenLabs MCP server☆647Updated last week
- Lightweight coding agent that runs in your terminal☆1,660Updated this week