mosave / LVTerminal
Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)
☆15Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for LVTerminal
- a repository for trainabale tts multi speaker☆14Updated 2 years ago
- This is a TTS model based on VITS that can control the output speech emotion through natural language and control the speaker through ref…☆4Updated 2 months ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆16Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆13Updated 3 weeks ago
- Russian phonetical transcription☆9Updated 11 months ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 5 months ago
- Evaluation of STT models for german language☆15Updated 2 years ago
- ☆10Updated 2 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆12Updated 6 months ago
- ☆8Updated last year
- ☆15Updated last week
- ☆9Updated last month
- Speech to text library for Rhasspy using Kaldi☆14Updated 11 months ago
- proof of concept conversation orchestrator with a speech-language model☆13Updated 3 weeks ago
- source code of EfficientTTS 2☆12Updated 8 months ago
- Wenet speech to text for react native☆10Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated last year
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆18Updated this week
- ☆17Updated 3 months ago
- ☆12Updated 3 months ago
- Normalize Text in Russian☆22Updated last year
- A handy dataset of noises for ASR☆19Updated 5 years ago
- Simple Kaldi recipe for forced alignment☆10Updated last year
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Updated 2 months ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆18Updated 7 months ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- Easy tool that splits given audio based on speaker.☆11Updated 10 months ago
- Unofficial implementation of wavenext vocoder☆31Updated 2 months ago