istupakov / onnx-asrLinks
Automatic Speech Recognition in Python using ONNX models
☆106Updated 2 weeks ago
Alternatives and similar repositories for onnx-asr
Users that are interested in onnx-asr are comparing it to the libraries listed below
Sorting:
- ☆307Updated last year
- ☆123Updated this week
- ☆377Updated 11 months ago
- Finetune VITS and MMS using HuggingFace's tools☆162Updated last year
- ☆53Updated 3 weeks ago
- Speaker Diarization with Transformers☆69Updated 2 months ago
- ☆59Updated 7 months ago
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆122Updated 3 months ago
- Простой IPA фонемизатор на базе ruaccent-encoder☆23Updated 4 months ago
- Normalize Text in Russian☆27Updated last year
- Python bindings for whisper.cpp☆282Updated last week
- Text To Speech Synthesis with Vosk☆206Updated 3 weeks ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆201Updated 4 months ago
- ☆38Updated 3 years ago
- Open TTS models, built for streaming on the edge☆43Updated 5 months ago
- Collection of Open Source Speech Data☆159Updated 9 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆30Updated last year
- Improving transcription performance of OpenAI Whisper for CPU based deployment☆249Updated 2 years ago
- Простой нормализатор текстов перед синтезом речи☆37Updated last year
- Open-source reproducible benchmarks from Argmax☆53Updated this week
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆217Updated 10 months ago
- whisper.cpp bindings for python☆101Updated 2 years ago
- Efficient approach to speaker diarization using voice characteristics extraction☆99Updated 2 months ago
- Use quantized versions of Whisper to speed up inference☆12Updated 10 months ago
- ☆28Updated 3 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆67Updated last month
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆119Updated last month
- ☆359Updated last year
- A simple, hackable text-to-speech system in PyTorch and MLX☆172Updated 3 weeks ago