Blaizzy / mlx-audioLinks
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
☆5,842Updated this week
Alternatives and similar repositories for mlx-audio
Users that are interested in mlx-audio are comparing it to the libraries listed below
Sorting:
- Simultaneous speech-to-text model☆9,644Updated 3 weeks ago
- Run LLMs with MLX☆3,577Updated this week
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆2,108Updated this week
- Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.☆2,832Updated 2 weeks ago
- The python library for real-time communication☆4,519Updated 3 weeks ago
- On-device TTS model by Neuphonic☆4,768Updated last week
- Towards Human-Sounding Speech☆5,935Updated 2 months ago
- VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning☆5,715Updated 2 weeks ago
- Fast and accurate automatic speech recognition (ASR) for edge devices☆3,128Updated 2 months ago
- Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation☆4,477Updated 7 months ago
- https://hf.co/hexgrad/Kokoro-82M☆5,574Updated 6 months ago
- A fast multimodal LLM for real-time voice☆4,349Updated last month
- Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and se…☆4,582Updated last week
- An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Co…☆5,996Updated 2 months ago
- The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trai…☆3,256Updated last month
- AI edge infrastructure for macOS. Run local or cloud models, share tools across apps via MCP, and power AI workflows with a native, alway…☆3,339Updated this week
- Make Mac apps accessible for AI agents☆1,751Updated 11 months ago
- Local-first AI Notepad for Private Meetings☆7,674Updated this week
- SoTA open-source TTS☆22,346Updated last week
- Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full…☆12,404Updated this week
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching☆4,384Updated last month
- MLX native implementations of state-of-the-art generative image models☆1,807Updated this week
- Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streamin…☆6,994Updated this week
- Send a phone call from AI agent, in an API call. Or, directly call the bot from the configured phone number!☆6,167Updated 3 months ago
- An implementation of the Nvidia's Parakeet models for Apple Silicon using MLX.☆797Updated last month
- Have a natural, spoken conversation with AI!☆3,506Updated 6 months ago
- On-device Speech Recognition for Apple Silicon☆5,574Updated last week
- MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…☆661Updated last month
- Recursive-Open-Meta-Agent v0.1 (Beta). A meta-agent framework to build high-performance multi-agent systems.☆4,909Updated 3 weeks ago
- Generate audiobooks from EPUBs, PDFs and text with synchronized captions.☆4,102Updated last month