jim-schwoebel / nala
π¦ Nala is an agile open-source voice assistant framework (20+ actions).
β35Updated last year
Alternatives and similar repositories for nala:
Users that are interested in nala are comparing it to the libraries listed below
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated last year
- TTS Client for Coqui TTS serverβ13Updated 2 years ago
- 24-hour Automatic Speech Recognitionβ27Updated 3 years ago
- β74Updated 3 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ26Updated 2 years ago
- Zoom Audio Transcription offlineβ32Updated 4 years ago
- π Coqui's machine learning job schedulerβ32Updated 3 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Updated 2 years ago
- The History of Speech Recognition to the Year 2030β12Updated 3 years ago
- This repository is for wake-word detection in speech using recurrent neural networksβ17Updated 6 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β101Updated last year
- Demonstration of gpt-2 model with flask+uwsgi+nginx in web environment containerized in docker for quick deployment.β13Updated last year
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated last year
- OCTRA is a web-application for the orthographic transcription of audio files.β39Updated last week
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Updated 6 years ago
- Dataset Release for Intent Classification from Speechβ46Updated 3 weeks ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)β27Updated 10 years ago
- πΈTTS recipes for different datasetsβ86Updated 2 years ago
- ParallelWaveGAN adaptation for Mozilla TTSβ15Updated 4 years ago
- Experiments with generating GPT-2 fanfiction on specified topics.β11Updated 5 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoderβ64Updated 6 years ago
- β11Updated 9 years ago
- Create modular, cross-browser, web audio pipelines to record and process audio in background threads. Comes with modules for VAD, ASR, reβ¦β47Updated last year
- Python library for audio augmentationβ83Updated last year
- Gentle and praatio scripts for easy forced alignmentβ18Updated 2 years ago
- Pytorch implementation of Deepmind's WaveRNN modelβ121Updated 5 years ago
- Phonetic and phonological vocoding platformβ16Updated 8 years ago
- Code for AccentDB.β20Updated 3 years ago
- Code for the paper "Investigating the effect of residual and highway connections in speech enhancement models"β11Updated 5 years ago