istupakov / onnx-asrLinks
Automatic Speech Recognition in Python using ONNX models
☆130Updated last month
Alternatives and similar repositories for onnx-asr
Users that are interested in onnx-asr are comparing it to the libraries listed below
Sorting:
- ☆310Updated last year
- A random walk voice style cloning application for Kokoro text to speech☆142Updated 3 months ago
- ☆378Updated last year
- ☆61Updated 8 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆213Updated 5 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆102Updated 3 months ago
- Python bindings for whisper.cpp☆292Updated 3 weeks ago
- ☆59Updated 3 weeks ago
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆129Updated 4 months ago
- ☆133Updated 2 weeks ago
- Text To Speech Synthesis with Vosk☆218Updated last month
- Joint speech-language model - respond directly to audio!☆372Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- Простой IPA фонемизатор на базе ruaccent-encoder☆24Updated 5 months ago
- Collection of Open Source Speech Data☆160Updated last week
- G2P☆323Updated 2 months ago
- Простой нормализатор текстов перед синтезом речи☆40Updated last year
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆31Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆125Updated 2 months ago
- Finetune VITS and MMS using HuggingFace's tools☆166Updated last year
- ☆282Updated 2 months ago
- whisper.cpp bindings for python☆106Updated 2 years ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆185Updated 5 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated last month
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆284Updated 4 months ago
- Normalize Text in Russian☆28Updated last year
- Very fast, accurate speaker diarization☆145Updated last week
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆217Updated 11 months ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆471Updated last year
- Streaming and Fine-tuning for Chatterbox TTS☆193Updated 3 months ago