istupakov / onnx-asrLinks
Automatic Speech Recognition in Python using ONNX models
☆68Updated 2 weeks ago
Alternatives and similar repositories for onnx-asr
Users that are interested in onnx-asr are comparing it to the libraries listed below
Sorting:
- Normalize Text in Russian☆27Updated last year
- Простой IPA фонемизатор на базе ruaccent-encoder☆21Updated 3 months ago
- ☆51Updated 2 weeks ago
- Open TTS models, built for streaming on the edge☆43Updated 4 months ago
- ☆300Updated last year
- ☆106Updated 3 weeks ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆90Updated last month
- Простой нормализатор текстов перед синтезом речи☆33Updated last year
- ☆370Updated 10 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆63Updated last month
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆115Updated last month
- Text To Speech Synthesis with Vosk☆197Updated this week
- Finetune VITS and MMS using HuggingFace's tools☆159Updated last year
- Use quantized versions of Whisper to speed up inference☆12Updated 9 months ago
- Official implementation of the TTS model Lina-Speech☆166Updated 6 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- Collection of Open Source Speech Data☆159Updated 8 months ago
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…☆214Updated last month
- Efficient approach to speaker diarization using voice characteristics extraction☆97Updated last month
- whisper.cpp bindings for python☆98Updated last year
- ☆260Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆189Updated 2 months ago
- ☆157Updated 7 months ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆82Updated 8 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆175Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆28Updated 11 months ago
- ☆260Updated last week
- ☆59Updated 5 months ago
- ☆199Updated last month
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆41Updated 2 weeks ago