fquirin / speech-recognition-experimentsLinks
Experiments to test different speech recognition systems for SEPIA Framework
☆62Updated 2 years ago
Alternatives and similar repositories for speech-recognition-experiments
Users that are interested in speech-recognition-experiments are comparing it to the libraries listed below
Sorting:
- On-device voice activity detection (VAD) powered by deep learning☆235Updated this week
- On-device noise suppression powered by deep learning☆77Updated last week
- ONNX Inference of Pyannote Segmentation☆97Updated 11 months ago
- ONNX and TensorRT implementation of Whisper☆65Updated 2 years ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆88Updated 2 months ago
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago
- openvino version of openai/whisper☆178Updated 2 years ago
- ONNX implementation of Whisper. PyTorch free.☆102Updated last year
- C++ version of pyannote audio speaker diarizaiton pipeline☆22Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆31Updated last year
- Python bindings of speexdsp noise suppression library☆44Updated 3 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆120Updated 2 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆103Updated last year
- A curated list of awesome voice activity detection☆69Updated last year
- Open models for Coqui STT☆148Updated 2 years ago
- An open-source, easily accessible package for training and deploying Speech-to-Intent models on microcontrollers and SBCs☆47Updated last year
- How to create your own model for vosk☆75Updated 4 years ago
- [WIP] Scripts for fine-tuning Whisper☆222Updated 2 years ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆179Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- Port of Funasr's Paraformer model in C/C++☆39Updated last year
- Kaldi-compatible online fbank extractor without external dependencies☆131Updated 2 months ago
- This repository is a collection of TTS Models in TFLite☆201Updated 4 years ago
- Onnx compatible styletts2 code☆13Updated 6 months ago
- Snowboy reimplementation☆92Updated 3 years ago
- A simple, but performant framework for mapping speech directly to categories and intents.☆22Updated last year
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆153Updated last year
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 3 years ago