dscripka / openSpeechToIntentLinks
A simple, but performant framework for mapping speech directly to categories and intents.
☆22Updated last year
Alternatives and similar repositories for openSpeechToIntent
Users that are interested in openSpeechToIntent are comparing it to the libraries listed below
Sorting:
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Updated 11 months ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- Tensorflow-based wake word detection☆16Updated 2 weeks ago
- Wake word detection with custom phrases without model training☆21Updated 2 months ago
- wake word spotting with kaldi☆19Updated 4 years ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆24Updated 4 years ago
- This repository is for wake-word detection in speech using recurrent neural networks☆17Updated 6 years ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆18Updated 7 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆63Updated 2 years ago
- Sisyphus recipies for ASR☆18Updated last week
- A handy dataset of noises for ASR☆22Updated 6 years ago
- On-device noise suppression powered by deep learning☆76Updated 3 months ago
- Coqui Inference Engine☆41Updated 4 years ago
- Add n-gram and large language model (LLM) support to Whisper models.☆35Updated 6 months ago
- ☆55Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆29Updated 2 years ago
- ☆11Updated 2 months ago
- ☆17Updated 2 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆27Updated 8 months ago
- Deepspeech ASR Model for the Catalan Language☆17Updated 4 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- ☆11Updated 4 years ago
- Linguistic processing for Common Voice☆58Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- ☆21Updated 7 years ago
- ☆17Updated 4 years ago
- phone inventory library☆17Updated 2 years ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year