fquirin / speech-recognition-experimentsLinks
Experiments to test different speech recognition systems for SEPIA Framework
☆60Updated 2 years ago
Alternatives and similar repositories for speech-recognition-experiments
Users that are interested in speech-recognition-experiments are comparing it to the libraries listed below
Sorting:
- ONNX Inference of Pyannote Segmentation☆91Updated 6 months ago
- Python bindings of speexdsp noise suppression library☆39Updated 2 years ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- ONNX and TensorRT implementation of Whisper☆63Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learning☆218Updated this week
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆115Updated 2 years ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆64Updated 2 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆102Updated 2 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆78Updated 10 months ago
- ONNX implementation of Whisper. PyTorch free.☆99Updated 7 months ago
- Python Wrapper of Silero VAD☆55Updated last month
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago
- Port of Funasr's Paraformer model in C/C++☆32Updated last year
- openvino version of openai/whisper☆167Updated last year
- Using OpenVINO to speed up MeloTTS inference☆11Updated 7 months ago
- Colab notebooks for Next-gen Kaldi☆27Updated 2 months ago
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆24Updated last year
- Speaker diarization service☆23Updated 2 months ago
- Snowboy reimplementation☆89Updated 3 years ago
- Utilizes ONNX Runtime for audio denoising.☆55Updated this week
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆98Updated 8 months ago
- Kaldi-compatible online fbank extractor without external dependencies☆107Updated this week
- 达摩fsmn vad c++推理服务☆14Updated 2 years ago
- ☆28Updated 4 months ago
- Keyword Spotting (KWS) API wrapper for TFLite streaming models.☆12Updated 3 years ago
- On-device noise suppression powered by deep learning☆72Updated last week
- Apply machine learning model DTLN for noise suppression and acoustic echo cancellation on Raspberry Pi☆67Updated 3 years ago
- Efficient approach to speaker diarization using voice characteristics extraction☆96Updated last week
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆40Updated 6 months ago