dscripka / openSpeechToIntent
A simple, but performant framework for mapping speech directly to categories and intents.
☆16Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for openSpeechToIntent
- Evaluation of STT models for german language☆15Updated 2 years ago
- ☆17Updated last year
- A handy dataset of noises for ASR☆19Updated 5 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 3 months ago
- ☆16Updated 3 years ago
- ☆11Updated 3 years ago
- ☆10Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆15Updated 2 weeks ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- ☆8Updated last year
- Goodness of Pronunciation algorithm using PyKaldi☆13Updated 2 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆13Updated last year
- Keyword Spotting (KWS) API wrapper for TFLite streaming models.☆12Updated 3 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆13Updated last month
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 8 months ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆42Updated 3 years ago
- Coqui Inference Engine☆38Updated 3 years ago
- ☆9Updated last year
- Repo for the paper "Plug-and-Play Multilingual Few-shot Spoken Words Recognition"☆16Updated last year
- This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…☆8Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆18Updated 7 months ago
- (Si)mply a (Re)search front-end for Text-To-Speech Synthesis.☆10Updated 6 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆36Updated 2 years ago
- phone inventory library☆15Updated last year
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆24Updated 3 years ago
- wake word spotting with kaldi☆19Updated 3 years ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆16Updated last year
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆14Updated 2 years ago