fquirin / speech-recognition-experimentsLinks
Experiments to test different speech recognition systems for SEPIA Framework
☆61Updated 2 years ago
Alternatives and similar repositories for speech-recognition-experiments
Users that are interested in speech-recognition-experiments are comparing it to the libraries listed below
Sorting:
- ONNX Inference of Pyannote Segmentation☆93Updated 9 months ago
- On-device voice activity detection (VAD) powered by deep learning☆230Updated last week
- ONNX and TensorRT implementation of Whisper☆64Updated 2 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆86Updated last year
- Using OpenVINO to speed up MeloTTS inference☆13Updated 11 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆113Updated 2 years ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆81Updated last week
- Port of Funasr's Paraformer model in C/C++☆34Updated last year
- An open-source, easily accessible package for training and deploying Speech-to-Intent models on microcontrollers and SBCs☆46Updated last year
- Python bindings of speexdsp noise suppression library☆40Updated 2 years ago
- Keyword Spotting (KWS) API wrapper for TFLite streaming models.☆12Updated 4 years ago
- ONNX implementation of Whisper. PyTorch free.☆99Updated 10 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆118Updated 2 years ago
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆64Updated 2 years ago
- openvino version of openai/whisper☆175Updated last year
- On-device noise suppression powered by deep learning☆75Updated 2 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- A curated list of awesome voice activity detection☆66Updated 10 months ago
- How to create your own model for vosk☆74Updated 4 years ago
- Colab notebooks for Next-gen Kaldi☆28Updated last week
- Python Wrapper of Silero VAD☆61Updated 5 months ago
- Open models for Coqui STT☆144Updated 2 years ago
- A simple, but performant framework for mapping speech directly to categories and intents.☆21Updated last year
- OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognitio…☆66Updated 3 years ago
- Zero-shot Audio Classification using Whisper☆78Updated 2 years ago
- Apply machine learning model DTLN for noise suppression and acoustic echo cancellation on Raspberry Pi☆71Updated 3 years ago
- An even smaller speech recognizer / force aligner☆36Updated 9 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆31Updated last year
- Cantonese Text to Speech with VITS implementation☆36Updated 2 years ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated 2 years ago