fquirin / speech-recognition-experiments
Experiments to test different speech recognition systems for SEPIA Framework
☆58Updated last year
Alternatives and similar repositories for speech-recognition-experiments:
Users that are interested in speech-recognition-experiments are comparing it to the libraries listed below
- ONNX Inference of Pyannote Segmentation☆80Updated last month
- C++ version of pyannote audio speaker diarizaiton pipeline☆20Updated last year
- ONNX and TensorRT implementation of Whisper☆61Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆198Updated this week
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆107Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆78Updated 2 years ago
- Kaldi-compatible online fbank extractor without external dependencies☆87Updated 2 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆76Updated last year
- Python bindings of speexdsp noise suppression library☆36Updated 2 years ago
- ASR client for Triton ASR Service☆26Updated 2 months ago
- Python Wrapper of Silero VAD☆47Updated last month
- openvino version of openai/whisper☆12Updated 4 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆58Updated 6 months ago
- ☆38Updated 3 years ago
- ONNX implementation of Whisper. PyTorch free.☆92Updated 3 months ago
- Utilizes ONNX Runtime for audio denoising.☆32Updated last week
- Real-Time Whisper Voice Recognition with vosk model feedback.☆109Updated last year
- Colab notebooks for Next-gen Kaldi☆26Updated last week
- 达摩fsmn vad c++推理服务☆12Updated last year
- Onnx wrapper for espnet infrernce model☆161Updated 4 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆92Updated 9 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆93Updated 4 months ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆42Updated last year
- whisper.cpp bindings for python☆87Updated last year
- Port of Funasr's Paraformer model in C/C++☆28Updated 8 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆88Updated 9 months ago
- Coqui Inference Engine☆38Updated 3 years ago
- On-device noise suppression powered by deep learning☆66Updated this week
- On-device speaker diarization powered by deep learning☆38Updated last week
- ☆21Updated 2 weeks ago