avryhof / speech_recognitionLinks
Speech recognition module for Python, supporting several engines and APIs, online and offline.
☆13Updated 3 years ago
Alternatives and similar repositories for speech_recognition
Users that are interested in speech_recognition are comparing it to the libraries listed below
Sorting:
- Evaluation of STT models for german language☆15Updated 3 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Sequence to sequence model for Arabic punctuation prediction.☆12Updated 5 years ago
- IPA Phonemizer/Dephonemizer for 139 human languages☆30Updated this week
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆15Updated 2 years ago
- a repository for trainabale tts multi speaker☆14Updated 3 years ago
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆13Updated 8 months ago
- Russian phonetical transcription☆10Updated last year
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆20Updated 2 years ago
- AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech☆18Updated 3 years ago
- ☆11Updated last year
- ☆11Updated 10 years ago
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆17Updated last year
- Russian accentuator and IPA transcriber☆14Updated 10 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆29Updated 2 years ago
- Open source cross-platform implementation of MRCP protocol☆20Updated 3 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated 5 months ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆20Updated 3 years ago
- BBB plugin for automatic subtitles in conference calls☆29Updated 3 years ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 6 months ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆15Updated 5 months ago
- Coqui Inference Engine☆40Updated 3 years ago
- ☆8Updated 2 years ago
- wake word spotting with kaldi☆19Updated 4 years ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.☆10Updated 5 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- ☆11Updated last week