fquirin / speech-recognition-experiments
Experiments to test different speech recognition systems for SEPIA Framework
☆59Updated last year
Alternatives and similar repositories for speech-recognition-experiments:
Users that are interested in speech-recognition-experiments are comparing it to the libraries listed below
- ONNX Inference of Pyannote Segmentation☆81Updated 3 months ago
- ONNX and TensorRT implementation of Whisper☆61Updated last year
- C++ version of pyannote audio speaker diarizaiton pipeline☆19Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆202Updated last week
- ONNX implementation of Whisper. PyTorch free.☆92Updated 4 months ago
- Zero-shot Audio Classification using Whisper☆80Updated 2 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆85Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆94Updated 5 months ago
- Python Wrapper of Silero VAD☆48Updated 3 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆109Updated 2 years ago
- Kaldi-compatible online fbank extractor without external dependencies☆90Updated 2 weeks ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆60Updated last year
- C++ library for converting text to phonemes for Piper☆112Updated last year
- Snowboy reimplementation☆84Updated 2 years ago
- openvino version of openai/whisper☆166Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆70Updated 7 months ago
- Python bindings of speexdsp noise suppression library☆38Updated 2 years ago
- On-device noise suppression powered by deep learning☆68Updated last week
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆162Updated last year
- Apply machine learning model DTLN for noise suppression and acoustic echo cancellation on Raspberry Pi☆63Updated 3 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 10 months ago
- Tunable pipelines☆31Updated last month
- ☆38Updated 3 years ago
- Open models for Coqui STT☆134Updated last year
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆170Updated 3 months ago
- On-device speaker diarization powered by deep learning☆39Updated last week
- An open-source, easily accessible package for training and deploying Speech-to-Intent models on microcontrollers and SBCs☆40Updated last year
- openvino version of openai/whisper☆13Updated 5 months ago
- Onnx wrapper for espnet infrernce model☆161Updated 5 months ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year