neuphonic / neutts-airLinks

On-device TTS model by Neuphonic

☆4,273

Alternatives and similar repositories for neutts-air

Users that are interested in neutts-air are comparing it to the libraries listed below

Sorting:

supertone-inc / supertonic
Lightning-Fast, On-Device TTS — running natively via ONNX.
☆1,891Updated last week
nari-labs / dia2
TTS model capable of streaming conversational audio in realtime.
☆920Updated 3 weeks ago
kyutai-labs / delayed-streams-modeling
Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.
☆2,663Updated last month
facebookresearch / omnilingual-asr
Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages
☆2,504Updated last week
kyutai-labs / unmute
Make text LLMs listen and speak
☆1,044Updated 2 weeks ago
OpenBMB / VoxCPM
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
☆3,098Updated this week
TheStageAI / TheWhisper
Optimized Whisper models for streaming and on-device use
☆772Updated last week
facebookresearch / sam-audio
The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trai…
☆2,506Updated this week
QwenLM / Qwen3-ASR-Toolkit
Official Python toolkit for the Qwen3-ASR API. Parallel high‑throughput calls, robust long‑audio transcription, multi‑sample‑rate support…
☆727Updated 2 months ago
edwko / OuteTTS
Interface for OuteTTS models.
☆1,419Updated 6 months ago
lemonade-sdk / lemonade
Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs. Join our discord: https…
☆1,882Updated last week
Mega4alik / ollm
☆2,233Updated 3 weeks ago
fluxions-ai / vui
☆635Updated last month
Lex-au / Orpheus-FastAPI
High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.
☆623Updated 5 months ago
OHF-Voice / piper1-gpl
Fast and local neural text-to-speech engine
☆2,132Updated last month
huggingface / aisheets
Build, enrich, and transform datasets using AI models with no code
☆1,608Updated 2 months ago
gabber-dev / gabber
Build AI applications that can see, hear, and speak using your screens, microphones, and cameras as inputs.
☆1,063Updated this week
sofdog-gh / realtime-transcription-fastrtc
Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗
☆691Updated 5 months ago
playht / PlayDiffusion
☆532Updated 2 months ago
magenta / magenta-realtime
☆943Updated last week
KittenML / KittenTTS
State-of-the-art TTS model under 25MB 😻
☆9,279Updated 4 months ago
QuentinFuxa / WhisperLiveKit
Simultaneous speech-to-text model
☆9,311Updated last week
kyutai-labs / hibiki
Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…
☆1,345Updated 8 months ago
nazdridoy / kokoro-tts
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includ…
☆1,005Updated last week
Blaizzy / mlx-audio
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…
☆3,122Updated this week
resemble-ai / chatterbox
SoTA open-source TTS
☆16,517Updated last week
allenai / OLMoASR
An open-source implementation of Whisper
☆469Updated last month
CaviraOSS / OpenMemory
Local persistent memory store for LLM applications including claude desktop, github copilot, codex, antigravity, etc.
☆2,681Updated this week
minitap-ai / mobile-use
AI agents can now use real Android and iOS apps, just like a human.
☆2,013Updated last week
datalab-to / chandra
OCR model that handles complex tables, forms, handwriting with full layout.
☆4,004Updated last week