mosave / LVTerminalLinks
Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)
☆16Updated last year
Alternatives and similar repositories for LVTerminal
Users that are interested in LVTerminal are comparing it to the libraries listed below
Sorting:
- a repository for trainabale tts multi speaker☆14Updated 3 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- Russian phonetical transcription☆10Updated last year
- STT VOSK REST API☆9Updated last year
- Голосовой терминал для MajorDoMo☆27Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Russian accentuator and IPA transcriber☆13Updated 9 months ago
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆13Updated 3 years ago
- ☆13Updated 3 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Updated 3 years ago
- Thai Grapheme to Phoneme (G2P) Wiktionary Corpus☆11Updated 2 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆20Updated 3 years ago
- ☆12Updated 3 years ago
- Transfer learning approach to pronunciation scoring☆10Updated last year
- Simple audio AE☆12Updated 7 months ago
- ☆21Updated 5 years ago
- Normalize Text in Russian☆27Updated last year
- Generate samples using Piper to train wake word models☆38Updated last year
- Simple Kaldi recipe for forced alignment☆10Updated last year
- Target speaker automatic speech recognition (TS-ASR)☆11Updated last year
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Updated 9 months ago
- BurrMill core☆21Updated 3 years ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- The aim of this project is to make voice assistants more responsive towards whisper to some extent.☆10Updated 6 years ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 5 months ago
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆13Updated 6 months ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆15Updated 2 years ago
- ☆26Updated 3 weeks ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Updated 3 years ago
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆9Updated 7 months ago