dscripka / openSpeechToIntent
A simple, but performant framework for mapping speech directly to categories and intents.
☆18Updated 7 months ago
Alternatives and similar repositories for openSpeechToIntent:
Users that are interested in openSpeechToIntent are comparing it to the libraries listed below
- Evaluation of STT models for german language☆15Updated 3 years ago
- A handy dataset of noises for ASR☆20Updated 5 years ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- Speech to text library for Rhasspy using Kaldi☆14Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆20Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆20Updated last week
- Generate samples using Piper to train wake word models☆28Updated last year
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Updated last year
- Coqui Inference Engine☆38Updated 3 years ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆10Updated 3 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆15Updated 5 months ago
- ☆11Updated 3 years ago
- ☆11Updated 3 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆14Updated 2 years ago
- ☆12Updated last month
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 7 months ago
- ☆17Updated last year
- A library of speech gadgets.☆13Updated 2 years ago
- ☆20Updated 6 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆30Updated 10 months ago
- C++ version of pyannote audio overlapped speech detection pipeline☆12Updated last year
- 56 language, 1 model Multilingual ASR☆25Updated 3 years ago
- wake word spotting with kaldi☆19Updated 4 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Updated 2 years ago
- Collection of scripts from mHuBERT-147.☆24Updated 4 months ago
- Forced alignment decoder for Whisper.☆14Updated last year
- A Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS☆35Updated 3 months ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Updated 2 weeks ago