istupakov / onnx-asrLinks
Automatic Speech Recognition in Python using ONNX models
☆120Updated last month
Alternatives and similar repositories for onnx-asr
Users that are interested in onnx-asr are comparing it to the libraries listed below
Sorting:
- ☆309Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆206Updated 4 months ago
- ☆60Updated 7 months ago
- ☆127Updated 3 weeks ago
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆127Updated 4 months ago
- Collection of Open Source Speech Data☆160Updated this week
- ☆377Updated last year
- ☆55Updated this week
- Простой IPA фонемизатор на базе ruaccent-encoder☆24Updated 5 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆122Updated last month
- Speaker Diarization with Transformers☆69Updated 3 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆185Updated 4 months ago
- Open-source reproducible benchmarks from Argmax☆58Updated last week
- Normalize Text in Russian☆27Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated last week
- Finetune VITS and MMS using HuggingFace's tools☆163Updated last year
- ☆280Updated last month
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…☆218Updated 4 months ago
- whisper.cpp bindings for python☆101Updated 2 years ago
- Open TTS models, built for streaming on the edge☆42Updated 6 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆31Updated last year
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆123Updated last month
- Use quantized versions of Whisper to speed up inference☆12Updated 11 months ago
- Text To Speech Synthesis with Vosk☆212Updated 3 weeks ago
- ☆294Updated 2 months ago
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆278Updated 4 months ago
- Official implementation of the TTS model Lina-Speech☆169Updated 8 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆218Updated 10 months ago
- Простой нормализатор текстов перед синтезом речи☆38Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year