mosave / LVTerminal
Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)
☆16Updated last year
Alternatives and similar repositories for LVTerminal:
Users that are interested in LVTerminal are comparing it to the libraries listed below
- a repository for trainabale tts multi speaker☆14Updated 3 years ago
- Russian phonetical transcription☆9Updated last year
- Evaluation of STT models for german language☆15Updated 3 years ago
- Keyword Spotting (KWS) API wrapper for TFLite streaming models.☆12Updated 3 years ago
- Tensorflow-based wake word detection☆11Updated 5 months ago
- Russian accentuator and IPA transcriber☆10Updated 6 months ago
- ☆10Updated last month
- Simple audio AE☆12Updated 4 months ago
- ☆8Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆20Updated last year
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Updated 3 years ago
- This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…☆9Updated 3 years ago
- ☆8Updated last year
- A simple, but performant framework for mapping speech directly to categories and intents.☆19Updated 7 months ago
- Transfer learning approach to pronunciation scoring☆10Updated last year
- Target speaker automatic speech recognition (TS-ASR)☆11Updated last year
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Updated 5 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆18Updated 2 years ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆12Updated 6 months ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Updated 7 months ago
- A simple command line tool to calculate WER for ASR.☆14Updated 5 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆14Updated 3 weeks ago
- Generate samples using Piper to train wake word models☆29Updated last year
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 2 months ago
- ☆19Updated last year
- ☆10Updated 5 months ago
- Simple Kaldi recipe for forced alignment☆10Updated last year
- [InterSpeech'2021] Reinforcement Learning for Emotional Text-to-Speech Synthesis with Improved Emotion Discriminability☆8Updated 6 months ago
- T5-based (russian) text normalization☆20Updated last year
- The aim of this project is to make voice assistants more responsive towards whisper to some extent.☆10Updated 5 years ago