fquirin / speech-recognition-experimentsLinks
Experiments to test different speech recognition systems for SEPIA Framework
☆60Updated 2 years ago
Alternatives and similar repositories for speech-recognition-experiments
Users that are interested in speech-recognition-experiments are comparing it to the libraries listed below
Sorting:
- On-device voice activity detection (VAD) powered by deep learning☆216Updated 3 weeks ago
- An open-source, easily accessible package for training and deploying Speech-to-Intent models on microcontrollers and SBCs☆42Updated last year
- ONNX Inference of Pyannote Segmentation☆90Updated 5 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- openvino version of openai/whisper☆166Updated last year
- ONNX and TensorRT implementation of Whisper☆63Updated 2 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆100Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago
- Port of Funasr's Paraformer model in C/C++☆31Updated 11 months ago
- ONNX implementation of Whisper. PyTorch free.☆97Updated 6 months ago
- Python bindings of speexdsp noise suppression library☆38Updated 2 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆112Updated last year
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆62Updated last month
- Apply machine learning model DTLN for noise suppression and acoustic echo cancellation on Raspberry Pi☆66Updated 3 years ago
- On-device noise suppression powered by deep learning☆70Updated 3 weeks ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆98Updated 7 months ago
- Kaldi-compatible online fbank extractor without external dependencies☆103Updated last week
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆27Updated 10 months ago
- Open models for Coqui STT☆139Updated 2 years ago
- OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognitio…☆64Updated 3 years ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆168Updated last year
- Python Wrapper of Silero VAD☆54Updated 3 weeks ago
- Tunable pipelines☆34Updated 3 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆114Updated 2 years ago
- A sample Android app using [whisper.cpp](https://github.com/ggerganov/whisper.cpp/) to do voice-to-text transcriptions.☆64Updated last year
- Keyword Spotting (KWS) API wrapper for TFLite streaming models.☆12Updated 3 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆173Updated 5 months ago
- ☆38Updated 3 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated last year
- C++ library for converting text to phonemes for Piper☆119Updated last year