jim-schwoebel / nala
π¦ Nala is an agile open-source voice assistant framework (20+ actions).
β35Updated last year
Related projects β
Alternatives and complementary repositories for nala
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- TTS Client for Coqui TTS serverβ13Updated last year
- Experiments with generating GPT-2 fanfiction on specified topics.β11Updated 5 years ago
- Demonstration of gpt-2 model with flask+uwsgi+nginx in web environment containerized in docker for quick deployment.β13Updated last year
- A module for normalising text.β9Updated 5 years ago
- π Coqui's machine learning job schedulerβ32Updated 3 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysoxβ13Updated 6 years ago
- This repository is for wake-word detection in speech using recurrent neural networksβ17Updated 5 years ago
- Zoom Audio Transcription offlineβ32Updated 4 years ago
- automatically align transcribed audio and generate a wav2letter training corpusβ35Updated last year
- β74Updated 3 years ago
- Python library for audio augmentationβ83Updated last year
- Gentle and praatio scripts for easy forced alignmentβ18Updated 2 years ago
- How to run GPU accelerated Signal Processing in TensorFlowβ23Updated 6 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ25Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Updated last year
- 24-hour Automatic Speech Recognitionβ27Updated 3 years ago
- The History of Speech Recognition to the Year 2030β11Updated 3 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ24Updated last year
- This is a phonemic multilingual (Russian-English) Implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Sβ¦β51Updated 4 years ago
- Phoneme prediction from speech mel-spectrograms using RNN.β13Updated 5 years ago
- Experiments with Hugging Face π¬ π€β45Updated 3 months ago
- π₯ π€ The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterancesβ¦β28Updated 3 years ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- SoundNet, built in Keras with pre-trained 8-layer model.β29Updated 5 years ago
- Generate embedding vectors from audio filesβ56Updated last year
- Coqui Inference Engineβ38Updated 3 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.β14Updated 4 years ago
- Python bindings for SoX, aiming to replicate a subset of the command line sox utility.β53Updated 3 years ago