neuphonic / neuttsLinks
On-device TTS model by Neuphonic
☆4,718Updated 3 weeks ago
Alternatives and similar repositories for neutts
Users that are interested in neutts are comparing it to the libraries listed below
Sorting:
- TTS model capable of streaming conversational audio in realtime.☆1,027Updated 2 months ago
- Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.☆2,552Updated 2 weeks ago
- VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning☆5,715Updated last week
- Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages☆2,620Updated last month
- Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.☆2,822Updated last week
- A TTS that fits in your CPU (and pocket)☆2,683Updated last week
- The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trai…☆3,256Updated last month
- PersonaPlex code.☆4,504Updated last week
- Make text LLMs listen and speak☆1,152Updated last week
- Soprano: Instant, Ultra-Realistic Text-to-Speech☆1,137Updated 3 weeks ago
- Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streamin…☆6,204Updated last week
- A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.☆632Updated last week
- Official Python toolkit for the Qwen3-ASR API. Parallel high‑throughput calls, robust long‑audio transcription, multi‑sample‑rate support…☆800Updated 3 months ago
- AirLLM 70B inference with single 4GB GPU☆2,193Updated 5 months ago
- Controllable and fast Text-to-Speech for over 7000 languages!☆2,163Updated last week
- Optimized Whisper models for streaming and on-device use☆816Updated this week
- ☆2,271Updated 2 months ago
- State-of-the-art TTS model under 25MB 😻☆9,494Updated 5 months ago
- SoTA open-source TTS☆22,024Updated last month
- ☆385Updated 3 months ago
- PageLM is a community driven version of NotebookLM & a education platform that transforms study materials into interactive resources like…☆1,253Updated 2 months ago
- OCR model that handles complex tables, forms, handwriting with full layout.☆4,733Updated 3 weeks ago
- Interface for OuteTTS models.☆1,421Updated 7 months ago
- ☆511Updated last week
- A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includ…☆1,148Updated last month
- Build, enrich, and transform datasets using AI models with no code☆1,620Updated 3 months ago
- Build AI applications that can see, hear, and speak using your screens, microphones, and cameras as inputs.☆1,078Updated last month
- Controllable and fast Text-to-Speech for over 7000 languages!☆322Updated 7 months ago
- Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…☆1,368Updated 9 months ago
- Whisper-Flow is a framework designed to enable real-time transcription of audio content using OpenAI’s Whisper model. Rather than process…☆557Updated this week