fquirin / speech-recognition-experimentsLinks
Experiments to test different speech recognition systems for SEPIA Framework
☆60Updated 2 years ago
Alternatives and similar repositories for speech-recognition-experiments
Users that are interested in speech-recognition-experiments are comparing it to the libraries listed below
Sorting:
- On-device voice activity detection (VAD) powered by deep learning☆222Updated 2 weeks ago
- ONNX Inference of Pyannote Segmentation☆92Updated 7 months ago
- ONNX and TensorRT implementation of Whisper☆64Updated 2 years ago
- ONNX implementation of Whisper. PyTorch free.☆101Updated 8 months ago
- A curated list of awesome voice activity detection☆59Updated 8 months ago
- openvino version of openai/whisper☆170Updated last year
- C++ library for converting text to phonemes for Piper☆128Updated 3 weeks ago
- On-device noise suppression powered by deep learning☆73Updated 3 weeks ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆83Updated 11 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆70Updated last month
- An open-source, easily accessible package for training and deploying Speech-to-Intent models on microcontrollers and SBCs☆45Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆116Updated 2 years ago
- Open models for Coqui STT☆141Updated 2 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆106Updated 2 years ago
- Python bindings of speexdsp noise suppression library☆40Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago
- openvino version of openai/whisper☆14Updated 9 months ago
- A simple, but performant framework for mapping speech directly to categories and intents.☆21Updated last year
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆174Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year
- A live speech recognition using Facebooks wav2vec 2.0 model.☆362Updated last year
- Port of Funasr's Paraformer model in C/C++☆33Updated last year
- Apply machine learning model DTLN for noise suppression and acoustic echo cancellation on Raspberry Pi☆68Updated 3 years ago
- Batch Support for OpenAI Whisper☆94Updated last year
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆28Updated last year
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆252Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- Optimized OpenAI's Whisper TFLite Port for Efficient Offline Inference on Edge Devices☆250Updated 11 months ago
- This repository is a collection of TTS Models in TFLite☆196Updated 4 years ago