IntendedConsequence / vadc
Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech
☆12Updated 7 months ago
Alternatives and similar repositories for vadc:
Users that are interested in vadc are comparing it to the libraries listed below
- ncnn HiFi-GAN☆26Updated 6 months ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆16Updated last year
- Port of Funasr's Paraformer model in C/C++☆32Updated 10 months ago
- convert spleeter pretrained model to pytorch and onnx, then convert to mnn☆20Updated 4 years ago
- mnn asr demo.☆16Updated last month
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- noise reduction☆17Updated 9 months ago
- Project of Singing Voice Conversion.☆14Updated last year
- ☆15Updated 8 months ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆52Updated last week
- Python bindings of speexdsp noise suppression library☆38Updated 2 years ago
- Equal Loudness Filter☆10Updated 6 years ago
- silero-vad pytorch implement☆17Updated 5 months ago
- Codebase for "Transcription free filler word detection with Neural semi-CRFs" [ICASSP2023]☆8Updated 9 months ago
- Voice Framework☆14Updated last week
- mnn tts demo.☆14Updated last month
- C code to extract mfcc or fbank features from wav files☆16Updated 5 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆72Updated 8 months ago
- ☆20Updated last year
- The aim of this project is to make voice assistants more responsive towards whisper to some extent.☆10Updated 5 years ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated last year
- ☆10Updated 5 months ago
- Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.☆19Updated 6 years ago
- speaker-disentangled speech linguistic content quantizer☆11Updated last month
- audiomod is a project for audio modifications, including audio manipulators such as time-stretching, pitch-shifing, formant-changing, and…☆3Updated 9 months ago
- A library for adding punctuation into a text from ASR.☆17Updated last year
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆24Updated last year
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- ☆28Updated 3 years ago
- A repository for code used to produce the results the ICASSP 2024 paper: "SELF-SUPERVISED PRETRAINING FOR ROBUST PERSONALIZED VOICE ACTIV…☆11Updated 5 months ago