istupakov / onnx-asrLinks
Automatic Speech Recognition in Python using ONNX models
☆51Updated 3 weeks ago
Alternatives and similar repositories for onnx-asr
Users that are interested in onnx-asr are comparing it to the libraries listed below
Sorting:
- Простой нормализатор текстов перед синтезом речи☆33Updated last year
- ☆43Updated last week
- Простой IPA фонемизатор на базе ruaccent-encoder☆21Updated 2 months ago
- Tools and agents for automated research.☆28Updated 2 weeks ago
- Normalize Text in Russian☆27Updated last year
- Framework for processing and filtering datasets☆27Updated 10 months ago
- ☆104Updated 3 weeks ago
- ☆365Updated 9 months ago
- ☆52Updated 4 months ago
- Russian open TTS dataset☆15Updated 5 years ago
- Text To Speech Synthesis with Vosk☆195Updated last month
- ☆26Updated 3 weeks ago
- Efficient approach to speaker diarization using voice characteristics extraction☆96Updated last week
- Open TTS models, built for streaming on the edge☆43Updated 3 months ago
- Use quantized versions of Whisper to speed up inference☆12Updated 8 months ago
- ☆296Updated last year
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆110Updated last month
- Delayed Streams Modeling (DSM) is a flexible formulation for streaming, multimodal sequence-to-sequence learning.☆211Updated this week
- T5-based (russian) text normalization☆21Updated last year
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆42Updated 3 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆84Updated last month
- ☆31Updated 9 months ago
- ☆54Updated 4 months ago
- Simple audio AE☆12Updated 7 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated 3 weeks ago
- VALL-E 2 reproduction☆129Updated 11 months ago
- MERA (Multimodal Evaluation for Russian-language Architectures) is a new open benchmark for the Russian language for evaluating fundament…☆61Updated 8 months ago
- Простой расстановщик ударений с обработкой омографов☆128Updated 8 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆174Updated 2 months ago
- ☆151Updated 6 months ago