jim-schwoebel / nalaLinks
π¦ Nala is an agile open-source voice assistant framework (20+ actions).
β35Updated last year
Alternatives and similar repositories for nala
Users that are interested in nala are comparing it to the libraries listed below
Sorting:
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- TTS Client for Coqui TTS serverβ13Updated 2 years ago
- π Coqui's machine learning job schedulerβ32Updated 3 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ26Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Updated 2 years ago
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated 2 years ago
- πΈTTS recipes for different datasetsβ87Updated 2 years ago
- Demonstration of gpt-2 model with flask+uwsgi+nginx in web environment containerized in docker for quick deployment.β13Updated 2 years ago
- 24-hour Automatic Speech Recognitionβ27Updated 4 years ago
- Gentle and praatio scripts for easy forced alignmentβ18Updated 2 years ago
- β76Updated 3 years ago
- The History of Speech Recognition to the Year 2030β13Updated 3 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawlerβ24Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β102Updated 2 years ago
- A crash course for training speech recognition models using DeepSpeech.β25Updated 4 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, reβ¦β47Updated 2 years ago
- Conversational AI Benchmark.β68Updated 2 years ago
- Training BERT for punctuation taskβ10Updated 4 years ago
- This repository is for wake-word detection in speech using recurrent neural networksβ17Updated 6 years ago
- Experiments with Hugging Face π¬ π€β44Updated 10 months ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)β27Updated 10 years ago
- β11Updated 10 years ago
- Audio processing using deep neural networks. Speaker identification using voice embeddings.β13Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ26Updated 2 years ago
- A command line interface to combine text information from subtitles with voice data in the video. Provides a convenient way to generate tβ¦β19Updated last year
- A very basic demonstration connecting speech recognition and text-to-speechβ20Updated 5 years ago
- Code for AccentDB.β22Updated 4 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languagesβ13Updated 2 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysoxβ13Updated 7 years ago
- π« check your data, before you wreck your modelβ16Updated 2 years ago