fquirin / speech-recognition-experiments
Experiments to test different speech recognition systems for SEPIA Framework
☆57Updated last year
Related projects ⓘ
Alternatives and complementary repositories for speech-recognition-experiments
- ONNX Inference of Pyannote Segmentation☆66Updated 2 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆18Updated 9 months ago
- On-device voice activity detection (VAD) powered by deep learning☆179Updated last week
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆66Updated last year
- Open models for Coqui STT☆122Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆44Updated 3 months ago
- ONNX and TensorRT implementation of Whisper☆59Updated last year
- Python Wrapper of Silero VAD☆42Updated 3 weeks ago
- An even smaller speech recognizer / force aligner☆32Updated last week
- openvino version of openai/whisper☆161Updated last year
- How to create your own model for vosk☆65Updated 3 years ago
- On-device noise suppression powered by deep learning☆63Updated last month
- Coqui Inference Engine☆38Updated 3 years ago
- Colab notebooks for Next-gen Kaldi☆26Updated 3 weeks ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆100Updated last year
- Port of Funasr's Paraformer model in C/C++☆25Updated 5 months ago
- Python bindings of speexdsp noise suppression library☆34Updated 2 years ago
- whisper.cpp bindings for python☆77Updated last year
- Onnx wrapper for espnet infrernce model☆156Updated last month
- ☆66Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆141Updated 6 months ago
- Kaldi-compatible online fbank extractor without external dependencies☆80Updated last month
- Keyword Spotting (KWS) API wrapper for TFLite streaming models.☆12Updated 3 years ago
- C++ library for converting text to phonemes for Piper☆89Updated 8 months ago
- Zero-shot Audio Classification using Whisper☆74Updated last year
- ☆34Updated 10 months ago
- Create an LJSpeech structured voice dataset on wave input☆21Updated last month
- ONNX implementation of Whisper. PyTorch free.☆85Updated this week
- ☆38Updated 2 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆84Updated last month