jim-schwoebel / nalaLinks
π¦ Nala is an agile open-source voice assistant framework (20+ actions).
β35Updated 2 years ago
Alternatives and similar repositories for nala
Users that are interested in nala are comparing it to the libraries listed below
Sorting:
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Updated 2 years ago
- This repository is for wake-word detection in speech using recurrent neural networksβ17Updated 6 years ago
- TTS Client for Coqui TTS serverβ13Updated 2 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ26Updated 2 years ago
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- β12Updated 10 years ago
- speech engine training projectsβ29Updated 4 years ago
- β76Updated 3 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environmentsβ102Updated 5 years ago
- OCTRA is a web-application for the orthographic transcription of audio files.β39Updated last week
- π Coqui's machine learning job schedulerβ32Updated 3 years ago
- 24-hour Automatic Speech Recognitionβ27Updated 4 years ago
- A very basic demonstration connecting speech recognition and text-to-speechβ20Updated 5 years ago
- Phoneme prediction from speech mel-spectrograms using RNN.β15Updated 6 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β101Updated 2 years ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogniβ¦β24Updated 4 years ago
- INACTIVE - http://mzl.la/ghe-archive - Tools to create ARPA models from cmu pocketsphinx dictionaries for proper g2p generationβ21Updated 6 years ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- Web-based tool for straight-forward class annotation of audio filesβ11Updated 5 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.β128Updated 9 months ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.β129Updated 4 years ago
- Pytorch implementation of Deepmind's WaveRNN modelβ121Updated 6 years ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)β27Updated 10 years ago
- A crash course for training speech recognition models using DeepSpeech.β25Updated 4 years ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".β28Updated 3 years ago
- Zoom Audio Transcription offlineβ32Updated 4 years ago
- On-device Speech-to-Index engine powered by deep learningβ37Updated 4 months ago
- Tools for working with the CMU Pronunciation Dictionaryβ36Updated 7 years ago