neuphonic / neutts-airLinks
On-device TTS model by Neuphonic
☆4,093Updated 2 weeks ago
Alternatives and similar repositories for neutts-air
Users that are interested in neutts-air are comparing it to the libraries listed below
Sorting:
- Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.☆2,616Updated last week
- Lightning-fast, on-device TTS — running natively via ONNX.☆1,579Updated last week
- TTS model capable of streaming conversational audio in realtime.☆631Updated last week
- Make text LLMs listen and speak☆1,008Updated 2 weeks ago
- Official Python toolkit for the Qwen3-ASR API. Parallel high‑throughput calls, robust long‑audio transcription, multi‑sample‑rate support…☆698Updated last month
- Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages☆2,358Updated 2 weeks ago
- VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning☆2,221Updated last month
- ☆2,197Updated last week
- Optimized Whisper models for streaming and on-device use☆681Updated this week
- Build, enrich, and transform datasets using AI models with no code☆1,586Updated last month
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆2,979Updated this week
- Frontier Open-Source Text-to-Speech☆10,088Updated 3 months ago
- Interface for OuteTTS models.☆1,414Updated 5 months ago
- VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)☆778Updated this week
- A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includ…☆934Updated 2 weeks ago
- Towards Human-Sounding Speech☆5,781Updated 7 months ago
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,332Updated 7 months ago
- ☆635Updated 3 weeks ago
- Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗☆692Updated 4 months ago
- MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.☆3,000Updated 4 months ago
- Generate code from the terminal!☆2,599Updated this week
- Have a natural, spoken conversation with AI!☆3,372Updated 4 months ago
- Recursive-Open-Meta-Agent v0.1 (Beta). A meta-agent framework to build high-performance multi-agent systems.☆4,594Updated this week
- OCR model that handles complex tables, forms, handwriting with full layout.☆3,036Updated 2 weeks ago
- An open-source implementation of Whisper☆466Updated last month
- ☆530Updated 2 months ago
- Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, im…☆3,018Updated last month
- State-of-the-art TTS model under 25MB 😻☆9,162Updated 3 months ago
- Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows☆1,255Updated this week
- Build AI applications that can see, hear, and speak using your screens, microphones, and cameras as inputs.☆1,042Updated this week