neuphonic / neutts-airLinks
On-device TTS model by Neuphonic
☆3,965Updated last week
Alternatives and similar repositories for neutts-air
Users that are interested in neutts-air are comparing it to the libraries listed below
Sorting:
- Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.☆2,588Updated last month
- Official Python toolkit for the Qwen3-ASR API. Parallel high‑throughput calls, robust long‑audio transcription, multi‑sample‑rate support…☆680Updated 3 weeks ago
- Make text LLMs listen and speak☆966Updated last week
- Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages☆1,058Updated this week
- Optimized Whisper models for streaming and on-device use☆507Updated this week
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆598Updated 4 months ago
- Build, enrich, and transform datasets using AI models with no code☆1,559Updated 3 weeks ago
- ☆635Updated this week
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,322Updated 7 months ago
- VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning☆2,067Updated last month
- Build AI applications that can see, hear, and speak using your screens, microphones, and cameras as inputs.☆1,022Updated this week
- ☆2,131Updated last week
- Interface for OuteTTS models.☆1,400Updated 4 months ago
- A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includ…☆899Updated 2 months ago
- Open-Source Memory Engine for LLMs, AI Agents & Multi-Agent Systems☆1,663Updated this week
- An open-source implementation of Whisper☆455Updated 2 weeks ago
- SoTA open-source TTS☆14,470Updated last month
- Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, im…☆2,858Updated last month
- ☆253Updated 2 weeks ago
- ☆527Updated last month
- Open-source framework for developing real-time multimodal conversational AI agents.☆522Updated this week
- Lemonade helps users run local LLMs with the highest performance by configuring state-of-the-art inference engines for their NPUs and GPU…☆1,590Updated last week
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆2,901Updated this week
- The Open Source Alternative to Cluely - A lightning-fast, privacy-first AI assistant that works seamlessly during meetings, interviews, a…☆1,187Updated this week
- Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗☆689Updated 4 months ago
- Run Orpheus 3B Locally With LM Studio☆485Updated 7 months ago
- Frontier Open-Source Text-to-Speech☆9,887Updated 2 months ago
- Tencent Hunyuan A13B (short as Hunyuan-A13B), an innovative and open-source LLM built on a fine-grained MoE architecture.☆805Updated 4 months ago
- AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts structured SERP results from Google/Bing/B…☆2,378Updated 3 weeks ago
- ☆1,013Updated last month