istupakov / onnx-asrLinks
A lightweight Python package for Automatic Speech Recognition using ONNX models
☆204Updated last week
Alternatives and similar repositories for onnx-asr
Users that are interested in onnx-asr are comparing it to the libraries listed below
Sorting:
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆218Updated 8 months ago
- Streaming and Fine-tuning for Chatterbox TTS☆253Updated 6 months ago
- ☆319Updated last year
- ☆60Updated 3 weeks ago
- Efficient approach to speaker diarization using voice characteristics extraction☆105Updated 6 months ago
- A random walk voice style cloning application for Kokoro text to speech☆196Updated 6 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆331Updated this week
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆127Updated 5 months ago
- Open TTS models, built for streaming on the edge☆44Updated 9 months ago
- ☆345Updated 3 months ago
- ☆382Updated last year
- ☆294Updated 5 months ago
- G2P☆383Updated 4 months ago
- Very fast, accurate speaker diarization☆203Updated this week
- VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency☆181Updated 2 months ago
- Fast audio super resolution from 16khz to 48khz.☆167Updated this week
- ☆339Updated 4 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated last year
- ☆158Updated 3 weeks ago
- Python bindings for whisper.cpp☆313Updated last week
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆69Updated 2 months ago
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆292Updated 7 months ago
- ☆52Updated last week
- whisper.cpp bindings for python☆108Updated 2 years ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆131Updated 4 months ago
- ☆100Updated last year
- Collection of Open Source Speech Data☆164Updated 3 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆196Updated 8 months ago
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆142Updated 7 months ago