avryhof / speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
☆13Updated 2 years ago
Alternatives and similar repositories for speech_recognition:
Users that are interested in speech_recognition are comparing it to the libraries listed below
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆13Updated last year
- Evaluation of STT models for german language☆15Updated 3 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Updated 5 years ago
- AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech☆13Updated 2 years ago
- Speaker diarization service☆21Updated last month
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 3 years ago
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆11Updated 2 months ago
- Fast and accurate natural language detection. Detector written in Python. Nito-ELD, ELD.☆15Updated last year
- ☆11Updated 9 years ago
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆16Updated last week
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- Faster Whisper ASR transcription with CTranslate2☆19Updated 3 months ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- ☆9Updated 3 weeks ago
- StyleTTS 2 Optimized Training Fork☆18Updated this week
- zero-shot realtime TTS system, fully offline, free and open source☆25Updated 2 weeks ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆20Updated 10 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 11 months ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆16Updated 2 years ago
- Simple audio recorder that sends WAV from browser to server in Python (Flask).☆31Updated 2 years ago
- My guide to create an italian TTS with Coqui☆14Updated 2 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆14Updated 9 months ago
- Keras(Tensorflow) implementations of Automatic Speech Recognition☆23Updated 3 years ago
- A semi-supervised sequence-to-sequence ASR☆10Updated 2 years ago
- ☆22Updated 3 years ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆16Updated 5 months ago
- phonetic similarity algorithms☆12Updated 6 years ago
- C++ version of pyannote audio overlapped speech detection pipeline☆10Updated 11 months ago