daanzu / py-silero-vad-lite
Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies
☆10Updated 3 months ago
Alternatives and similar repositories for py-silero-vad-lite:
Users that are interested in py-silero-vad-lite are comparing it to the libraries listed below
- ☆11Updated 3 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Updated 2 years ago
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆24Updated 5 months ago
- C++ version of pyannote audio overlapped speech detection pipeline☆12Updated last year
- ☆10Updated 4 months ago
- source code of EfficientTTS 2☆12Updated last year
- A handy dataset of noises for ASR☆19Updated 5 years ago
- ☆13Updated 6 months ago
- ☆12Updated 4 months ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- ☆22Updated last month
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆19Updated 4 months ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆22Updated 2 years ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆24Updated 10 months ago
- ☆9Updated 2 weeks ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆15Updated 4 months ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆16Updated 9 months ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Updated 6 months ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆20Updated last year
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆20Updated 3 years ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆12Updated 2 months ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆18Updated last year
- Reimplementation of Miipher☆20Updated last year
- ☆13Updated 3 years ago
- ☆28Updated last year