pguyot / zamia-speechLinks
Open tools and data for cloudless automatic speech recognition
☆11Updated 6 years ago
Alternatives and similar repositories for zamia-speech
Users that are interested in zamia-speech are comparing it to the libraries listed below
Sorting:
- Deepspeech ASR Model for the Catalan Language☆17Updated 4 years ago
 - Open Source Wearable Microphone Array Glasses for Multi-Speaker Speech Recognition☆17Updated 3 years ago
 - Classify audio with neural nets on embedded systems like the Raspberry Pi☆87Updated last year
 - Neural text to speech system that uses eSpeak as a text/phoneme front-end☆16Updated 4 years ago
 - Evaluation of STT models for german language☆15Updated 3 years ago
 - Festvox voice building tools☆104Updated 2 months ago
 - How to create your own model for vosk☆75Updated 4 years ago
 - Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆18Updated 6 months ago
 - This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Updated 2 years ago
 - Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆17Updated last year
 - Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆27Updated 4 years ago
 - TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.☆11Updated 5 years ago
 - A simple, but performant framework for mapping speech directly to categories and intents.☆22Updated last year
 - a repository for trainabale tts multi speaker☆14Updated 3 years ago
 - Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Updated 11 months ago
 - Framework for one-shot multispeaker system based on Deep Learning☆19Updated 4 years ago
 - phone inventory library☆17Updated 2 years ago
 - Anaouder mouezh e Brezhoneg gant Vosk☆14Updated 3 months ago
 - Coqui Inference Engine☆41Updated 4 years ago
 - ☆17Updated 4 years ago
 - steps to perform text-based speaker diarization with kaldi toolkit☆12Updated 7 years ago
 - A multilingual ASR model that can recognize ten Turkic languages—Azerbaijani, Bashkir, Chuvash, Kazakh, Kyrgyz, Sakha, Tatar, Turkish, Uy…☆74Updated 3 months ago
 - Python package for noise supression in audio based on DNN☆21Updated 2 years ago
 - Transfer learning approach to pronunciation scoring☆11Updated last year
 - Speech to text library for Rhasspy using Kaldi☆14Updated last year
 - ☆11Updated 3 years ago
 - Python wrapper for phonetisaurus grapheme to phoneme tool☆12Updated 4 years ago
 - PolEval 2021 Task 1☆15Updated 3 years ago
 - Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆36Updated 3 years ago
 - Multilingual Grapheme to Phoneme☆50Updated 9 years ago