fquirin / speech-recognition-experimentsLinks
Experiments to test different speech recognition systems for SEPIA Framework
☆62Updated 2 years ago
Alternatives and similar repositories for speech-recognition-experiments
Users that are interested in speech-recognition-experiments are comparing it to the libraries listed below
Sorting:
- On-device voice activity detection (VAD) powered by deep learning☆241Updated this week
- ONNX Inference of Pyannote Segmentation☆97Updated last year
- ONNX and TensorRT implementation of Whisper☆66Updated 2 years ago
- On-device noise suppression powered by deep learning☆80Updated 2 weeks ago
- openvino version of openai/whisper☆180Updated 2 years ago
- ONNX implementation of Whisper. PyTorch free.☆102Updated last year
- A simple, but performant framework for mapping speech directly to categories and intents.☆24Updated last year
- Python bindings of speexdsp noise suppression library☆46Updated 3 years ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆22Updated last year
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆31Updated last year
- A curated list of awesome voice activity detection☆71Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆121Updated 2 years ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆93Updated 3 months ago
- An open-source, easily accessible package for training and deploying Speech-to-Intent models on microcontrollers and SBCs☆48Updated last year
- Zero-shot Audio Classification using Whisper☆79Updated 3 years ago
- Snowboy reimplementation☆93Updated 3 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆110Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- Open models for Coqui STT☆149Updated 2 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- How to create your own model for vosk☆74Updated 4 years ago
- Tunable pipelines☆41Updated 4 months ago
- Port of Funasr's Paraformer model in C/C++☆39Updated last year
- In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keywor…☆107Updated 3 years ago
- Apply machine learning model DTLN for noise suppression and acoustic echo cancellation on Raspberry Pi☆83Updated 4 years ago
- C++ library for converting text to phonemes for Piper☆137Updated 6 months ago
- Colab notebooks for Next-gen Kaldi☆29Updated 3 months ago
- Python Wrapper of Silero VAD☆64Updated 8 months ago
- Using OpenVINO to speed up MeloTTS inference☆15Updated last year