istupakov / onnx-asrLinks
Automatic Speech Recognition in Python using ONNX models
☆172Updated this week
Alternatives and similar repositories for onnx-asr
Users that are interested in onnx-asr are comparing it to the libraries listed below
Sorting:
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆216Updated 7 months ago
- ☆318Updated last year
- A random walk voice style cloning application for Kokoro text to speech☆184Updated 5 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆105Updated 5 months ago
- Python bindings for whisper.cpp☆303Updated 2 weeks ago
- ☆382Updated last year
- whisper.cpp bindings for python☆108Updated 2 years ago
- Streaming and Fine-tuning for Chatterbox TTS☆229Updated 5 months ago
- G2P☆368Updated 4 months ago
- Open TTS models, built for streaming on the edge☆44Updated 8 months ago
- ☆289Updated 4 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆127Updated 4 months ago
- ☆60Updated 3 weeks ago
- Простой IPA фонемизатор на базе ruaccent-encoder☆24Updated 7 months ago
- Text To Speech Synthesis with Vosk☆228Updated 2 weeks ago
- ☆157Updated 3 weeks ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆69Updated last month
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆31Updated last year
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆289Updated 2 months ago
- ☆50Updated last week
- ☆338Updated 2 months ago
- ☆64Updated 10 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆99Updated last year
- Простой нормализатор текстов перед синтезом речи☆41Updated last year
- VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency☆174Updated last month
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆129Updated 4 months ago
- ☆100Updated last year
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆291Updated 6 months ago
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆138Updated 6 months ago
- Open-source reproducible benchmarks from Argmax☆70Updated last week