avryhof / speech_recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
☆13Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for speech_recognition
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated last year
- ☆11Updated 9 years ago
- Evaluation of STT models for german language☆15Updated 2 years ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆12Updated 2 months ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆13Updated last year
- Easy tool that splits given audio based on speaker.☆11Updated 10 months ago
- Creation of a multi user audio first annotation tool - GSoC 2021☆27Updated last year
- AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech☆13Updated 2 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆20Updated 2 years ago
- Speaker diarization service☆19Updated this week
- TTS Client for Coqui TTS server☆13Updated last year
- Tools for convert Text to IPA in python☆18Updated last year
- A very basic demonstration connecting speech recognition and text-to-speech☆19Updated 4 years ago
- phonetic similarity algorithms☆12Updated 6 years ago
- ☆9Updated last month
- ☆16Updated 3 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆24Updated last year
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 6 months ago
- ☆22Updated 3 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated 11 months ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆18Updated 8 months ago
- This repository is for wake-word detection in speech using recurrent neural networks☆17Updated 5 years ago
- Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)☆15Updated 8 months ago
- ☕🇧🇷 Scripts para o Kaldi em Português Brasileiro☆48Updated 2 years ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆14Updated this week
- AsoSoft Speech Corpus can be used for spoken language processing tasks in Central Kurdish such as speech recognition, speaker recognition…☆10Updated 2 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Updated 5 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 3 years ago
- Simple audio recorder that sends WAV from browser to server in Python (Flask).☆32Updated 2 years ago
- a repository for trainabale tts multi speaker☆14Updated 2 years ago