avryhof / speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
☆13Updated 2 years ago
Alternatives and similar repositories for speech_recognition:
Users that are interested in speech_recognition are comparing it to the libraries listed below
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆12Updated 4 months ago
- AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech☆15Updated 2 years ago
- a repository for trainabale tts multi speaker☆14Updated 3 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆13Updated last year
- zero-shot realtime TTS system, fully offline, free and open source☆28Updated last week
- Open source cross-platform implementation of MRCP protocol☆19Updated 3 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- Russian phonetical transcription☆9Updated last year
- ☆9Updated last week
- ☆11Updated 9 years ago
- ☆8Updated last year
- wake word spotting with kaldi☆19Updated 4 years ago
- StyleTTS 2 Optimized Training Fork☆24Updated last month
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- ☆22Updated 3 years ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated last month
- VoxLingua107 recipe for SpeechBrain☆13Updated 3 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆18Updated 2 years ago
- Project of Singing Voice Conversion.☆14Updated last year
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆22Updated 7 months ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated last month
- ☆13Updated 3 years ago
- Faster Whisper ASR transcription with CTranslate2☆19Updated 4 months ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆36Updated 4 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- Sequence to sequence model for Arabic punctuation prediction.☆12Updated 5 years ago