Blaizzy / mlx-audioLinks
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
☆5,842Updated this week
Alternatives and similar repositories for mlx-audio
Users that are interested in mlx-audio are comparing it to the libraries listed below
Sorting:
- Run LLMs with MLX☆3,577Updated this week
- Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.☆2,832Updated 2 weeks ago
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆2,108Updated this week
- On-device TTS model by Neuphonic☆4,768Updated last week
- Local-first AI Notepad for Private Meetings☆7,674Updated this week
- Connect any LLM to your internal knowledge sources and chat with it in real time alongside your team. OSS alternative to NotebookLM, Perp…☆12,819Updated this week
- Simultaneous speech-to-text model☆9,673Updated 3 weeks ago
- SoTA open-source TTS☆22,346Updated last week
- Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and se…☆4,582Updated last week
- The python library for real-time communication☆4,519Updated 3 weeks ago
- VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning☆5,851Updated 2 weeks ago
- The world's first open-source multimodal creative assistant This is a substitute for Canva and Manus that prioritizes privacy and is usa…☆5,849Updated 3 months ago
- Make Mac apps accessible for AI agents☆1,751Updated 11 months ago
- State-of-the-art TTS model under 25MB 😻☆9,590Updated last week
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,378Updated 9 months ago
- A fast multimodal LLM for real-time voice☆4,349Updated last month
- https://hf.co/hexgrad/Kokoro-82M☆5,574Updated 6 months ago
- Generate code from the terminal!☆2,730Updated this week
- The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trai…☆3,286Updated last month
- A TTS that fits in your CPU (and pocket)☆2,995Updated last week
- Local-first AI coworker, with memory☆4,351Updated this week
- Collection of apple-native tools for the model context protocol.☆2,993Updated 6 months ago
- Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.☆2,588Updated 2 weeks ago
- Fast and accurate automatic speech recognition (ASR) for edge devices☆3,128Updated 2 months ago
- An implementation of the Nvidia's Parakeet models for Apple Silicon using MLX.☆797Updated last month
- Towards Human-Sounding Speech☆5,935Updated 2 months ago
- AI edge infrastructure for macOS. Run local or cloud models, share tools across apps via MCP, and power AI workflows with a native, alway…☆3,339Updated this week
- A native macOS app that allows users to chat with a local LLM that can respond with information from files, folders and websites on your …☆3,178Updated 2 months ago
- Voice-to-text app for macOS to transcribe what you say to text almost instantly☆3,626Updated this week
- Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval…☆13,011Updated this week