Blaizzy / mlx-audioLinks
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
☆2,470Updated this week
Alternatives and similar repositories for mlx-audio
Users that are interested in mlx-audio are comparing it to the libraries listed below
Sorting:
- Secure AI-powered meeting notetaker that runs on your device and keeps your data private.☆2,592Updated this week
- Make Mac apps accessible for AI agents☆1,402Updated 4 months ago
- Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.☆1,657Updated this week
- AI-powered multi-agent builder☆3,306Updated this week
- Towards Human-Sounding Speech☆5,196Updated 2 months ago
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,222Updated 2 months ago
- A fast multimodal LLM for real-time voice☆4,087Updated this week
- The python library for real-time communication☆4,115Updated this week
- Open Source Alternative to NotebookLM / Perplexity / Glean, connected to external sources such as search engines (Tavily, Linkup), Slack,…☆5,883Updated this week
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆1,470Updated this week
- c/ua is the Docker Container for Computer-Use AI Agents.☆8,945Updated this week
- SoTA open-source TTS☆9,264Updated 3 weeks ago
- Local realtime voice AI☆2,330Updated 4 months ago
- Agent S: an open agentic framework that uses computers like a human☆5,679Updated 3 weeks ago
- StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language mo…☆3,939Updated 2 months ago
- Airweave lets agents search any app☆2,739Updated this week
- Run LLMs with MLX☆1,276Updated this week
- A community driven registry service for Model Context Protocol (MCP) servers.☆1,858Updated this week
- Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗☆665Updated this week
- Building blocks for rapid development of GenAI applications☆1,486Updated this week
- Exa is a Web Search API | This is Exa MCP (Model Context Protocol)☆1,805Updated last week
- https://hf.co/hexgrad/Kokoro-82M☆3,484Updated last week
- No need to switch browsers, just use Dia on Chrome or on Arc.☆1,099Updated last week
- Lightweight coding agent that runs in your terminal☆1,900Updated 2 months ago
- ☆1,287Updated 2 months ago
- Open Source Application for Advanced LLM + Diffusion Engineering: interact, train, fine-tune, and evaluate large language models on your …☆3,519Updated last week
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching☆3,197Updated last week
- ACI.dev is the open source tool-calling platform that hooks up 600+ tools into any agentic IDE or custom AI agent through direct function…☆4,268Updated this week
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the web☆2,297Updated last month
- Implementation of F5-TTS in MLX☆560Updated 3 months ago