jim-schwoebel / nala
π¦ Nala is an agile open-source voice assistant framework (20+ actions).
β35Updated last year
Alternatives and similar repositories for nala:
Users that are interested in nala are comparing it to the libraries listed below
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ26Updated 2 years ago
- β11Updated 9 years ago
- TTS Client for Coqui TTS serverβ13Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Updated last year
- A crash course for training speech recognition models using DeepSpeech.β24Updated 3 years ago
- This repository is for wake-word detection in speech using recurrent neural networksβ17Updated 5 years ago
- 24-hour Automatic Speech Recognitionβ27Updated 3 years ago
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated last year
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysoxβ13Updated 7 years ago
- Gentle and praatio scripts for easy forced alignmentβ18Updated 2 years ago
- Demonstration of gpt-2 model with flask+uwsgi+nginx in web environment containerized in docker for quick deployment.β13Updated last year
- The History of Speech Recognition to the Year 2030β12Updated 3 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawlerβ23Updated 3 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated last year
- A very basic demonstration connecting speech recognition and text-to-speechβ19Updated 4 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?β34Updated 6 years ago
- π Coqui's machine learning job schedulerβ32Updated 3 years ago
- GPT-jax based on the official huggingface libraryβ13Updated 3 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoderβ64Updated 6 years ago
- A module for normalising text.β9Updated 5 years ago
- β11Updated 3 years ago
- Easily turn large sets of audio urls to an audio dataset.β20Updated 2 years ago
- Using YouTube to prepare a speech recognition dataset for any languageβ10Updated 3 years ago
- A simple pyaudio microphone interfaceβ11Updated 6 years ago
- Evaluation of STT models for german languageβ15Updated 3 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription usingβ¦β28Updated last year
- Zoom Audio Transcription offlineβ32Updated 4 years ago
- Interface for using TTS and vocoder models in the form of a text editorβ19Updated 2 years ago
- β74Updated 3 years ago