avryhof / speech_recognitionLinks
Speech recognition module for Python, supporting several engines and APIs, online and offline.
☆13Updated 3 years ago
Alternatives and similar repositories for speech_recognition
Users that are interested in speech_recognition are comparing it to the libraries listed below
Sorting:
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Sequence to sequence model for Arabic punctuation prediction.☆12Updated 5 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆18Updated 5 months ago
- ☆13Updated 10 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Updated 2 years ago
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆14Updated 10 months ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆20Updated 3 years ago
- IPA Phonemizer/Dephonemizer for 139 human languages☆38Updated last week
- Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)☆16Updated last year
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Updated 6 years ago
- ☆17Updated 4 years ago
- ☆11Updated 3 years ago
- Project of Singing Voice Conversion.☆15Updated last year
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆12Updated 11 months ago
- Using OpenVINO to speed up MeloTTS inference☆13Updated 10 months ago
- wake word spotting with kaldi☆19Updated 4 years ago
- Launch your speech synthesis within one minute.☆12Updated last year
- Coqui Inference Engine☆41Updated 4 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆29Updated 2 years ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 8 months ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated 7 months ago
- a repository for trainabale tts multi speaker☆14Updated 3 years ago
- Transfer learning approach to pronunciation scoring☆10Updated last year
- ☆22Updated 4 years ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆24Updated 4 years ago
- Open source cross-platform implementation of MRCP protocol☆20Updated 3 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11Updated 3 years ago
- ☆12Updated 3 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago