mosave / LVTerminalLinks
Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)
☆16Updated last year
Alternatives and similar repositories for LVTerminal
Users that are interested in LVTerminal are comparing it to the libraries listed below
Sorting:
- a repository for trainabale tts multi speaker☆14Updated 3 years ago
- Thai Grapheme to Phoneme (G2P) Wiktionary Corpus☆10Updated 2 years ago
- Russian phonetical transcription☆10Updated last year
- Speech to text library for Rhasspy using Kaldi☆14Updated last year
- ☆12Updated 3 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆16Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated 2 years ago
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆13Updated 7 months ago
- Wenet speech to text for react native☆10Updated 2 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Updated 3 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆19Updated 2 years ago
- ☆8Updated 2 years ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆31Updated 10 months ago
- Голосовой терминал для MajorDoMo☆26Updated 2 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆17Updated 7 months ago
- ☆11Updated 3 years ago
- Normalize Text in Russian☆27Updated last year
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆20Updated 3 years ago
- Unofficial implementation of wavenext vocoder☆46Updated 9 months ago
- ☆13Updated 3 years ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 4 months ago
- ☆17Updated 2 years ago
- Russian accentuator and IPA transcriber☆13Updated 8 months ago
- BurrMill core☆21Updated 3 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆24Updated last year
- Simple audio AE☆12Updated 6 months ago
- This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…☆9Updated 3 years ago
- ☆21Updated 5 years ago