Blaizzy / mlx-audioLinks
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
☆2,263Updated this week
Alternatives and similar repositories for mlx-audio
Users that are interested in mlx-audio are comparing it to the libraries listed below
Sorting:
- AI-powered multi-agent builder☆2,921Updated this week
- SoTA open-source TTS☆2,296Updated this week
- The python library for real-time communication☆3,970Updated last week
- Make Mac apps accessible for AI agents☆1,065Updated 2 months ago
- Airweave lets agents search any app☆2,471Updated this week
- Claude can perform Web Search | Exa with MCP (Model Context Protocol)☆1,641Updated last week
- c/ua is the Docker Container for Computer-Use AI Agents.☆8,005Updated last week
- Privacy-first AI Notepad for back-to-back meetings☆2,320Updated this week
- Towards Human-Sounding Speech☆4,842Updated 3 weeks ago
- Lightweight coding agent that runs in your terminal☆1,821Updated 3 weeks ago
- Open Source Alternative to NotebookLM / Perplexity / Glean, connected to external sources such as search engines (Tavily, Linkup), Slack,…☆4,603Updated last week
- Connect Supabase to your AI assistants☆1,481Updated 2 weeks ago
- Agent S: an open agentic framework that uses computers like a human☆5,221Updated last week
- A fast multimodal LLM for real-time voice☆3,968Updated 3 months ago
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the web☆2,195Updated last week
- Official repository for LTX-Video☆6,281Updated this week
- Interface for OuteTTS models.☆1,283Updated this week
- Allow LLMs to control a browser with Browserbase and Stagehand☆1,760Updated last week
- Collection of apple-native tools for the model context protocol.☆1,624Updated last month
- Have a natural, spoken conversation with AI!☆2,375Updated 2 weeks ago
- ☆2,226Updated this week
- An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.☆344Updated 2 weeks ago
- TTS with kokoro and onnx runtime☆2,008Updated 3 weeks ago
- ⚙️ Create and run workflows (RPA 2.0)☆3,004Updated this week
- Open source multi-modal RAG for building AI apps over private knowledge.☆2,385Updated this week
- This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025☆3,965Updated 3 weeks ago
- Open Source Application for Advanced LLM Engineering: interact, train, fine-tune, and evaluate large language models on your own computer…☆3,329Updated this week
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆1,293Updated this week
- Fast and accurate automatic speech recognition (ASR) for edge devices☆2,718Updated 2 weeks ago
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,054Updated last month