jim-schwoebel / nala
π¦ Nala is an agile open-source voice assistant framework (20+ actions).
β35Updated last year
Alternatives and similar repositories for nala:
Users that are interested in nala are comparing it to the libraries listed below
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated last year
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ26Updated 2 years ago
- 24-hour Automatic Speech Recognitionβ27Updated 3 years ago
- π Coqui's machine learning job schedulerβ32Updated 3 years ago
- This repository is for wake-word detection in speech using recurrent neural networksβ17Updated 6 years ago
- TTS Client for Coqui TTS serverβ13Updated 2 years ago
- Demonstration of gpt-2 model with flask+uwsgi+nginx in web environment containerized in docker for quick deployment.β13Updated last year
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Updated 5 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Updated 2 years ago
- Conversational AI Benchmark.β66Updated last year
- Zoom Audio Transcription offlineβ32Updated 4 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated last year
- A crash course for training speech recognition models using DeepSpeech.β24Updated 3 years ago
- A module for normalising text.β9Updated 5 years ago
- Gentle and praatio scripts for easy forced alignmentβ18Updated 2 years ago
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) paβ¦β17Updated 9 years ago
- Experiments with Hugging Face π¬ π€β44Updated 6 months ago
- ParallelWaveGAN adaptation for Mozilla TTSβ15Updated 4 years ago
- OCTRA is a web-application for the orthographic transcription of audio files.β38Updated last week
- Training BERT for punctuation taskβ10Updated 4 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR modelsβ31Updated 3 years ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogniβ¦β24Updated 3 years ago
- Speaker diarization and speech to textβ14Updated 4 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a lβ¦β22Updated 7 months ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawlerβ23Updated 3 years ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipelineβ32Updated 2 years ago
- Speaker diarization via transfer learningβ27Updated 5 years ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)β27Updated 10 years ago
- β11Updated 9 years ago