avryhof / speech_recognitionLinks
Speech recognition module for Python, supporting several engines and APIs, online and offline.
☆13Updated 3 years ago
Alternatives and similar repositories for speech_recognition
Users that are interested in speech_recognition are comparing it to the libraries listed below
Sorting:
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆15Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated 2 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- ☆11Updated 9 years ago
- Speech to text library for Rhasspy using Kaldi☆14Updated last year
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆13Updated 7 months ago
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆16Updated last year
- ☆17Updated 2 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated 4 months ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆20Updated 3 years ago
- phonetic similarity algorithms☆13Updated 6 years ago
- Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)☆16Updated last year
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 5 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Updated 5 years ago
- Sequence to sequence model for Arabic punctuation prediction.☆12Updated 5 years ago
- Pronounce Arabic words☆19Updated 6 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆28Updated 2 years ago
- 🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.☆42Updated 3 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- ☆11Updated 3 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆34Updated this week
- IPA Phonemizer/Dephonemizer for 139 human languages☆27Updated last month
- ☆22Updated 3 years ago
- VoxLingua107 recipe for SpeechBrain☆13Updated 3 years ago
- ☆8Updated 2 years ago
- a repository for trainabale tts multi speaker☆14Updated 3 years ago
- AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech☆18Updated 2 years ago