bigcash / awesome-vad
A curated list of awesome voice activity detection
☆39Updated 3 months ago
Alternatives and similar repositories for awesome-vad:
Users that are interested in awesome-vad are comparing it to the libraries listed below
- Speaker change detection using SincNet and an LSTM/Transformer☆47Updated 8 months ago
- Tunable pipelines☆31Updated last week
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆80Updated last year
- ☆54Updated 8 months ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated 2 years ago
- ☆21Updated last month
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆82Updated 2 weeks ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆59Updated 6 months ago
- Implementation of Google's USM speech model in Pytorch☆29Updated last month
- Reproducible experimental protocols for multimedia (audio, video, text) database☆97Updated 3 weeks ago
- Putting flows on top of neural transducers for better TTS☆62Updated 2 weeks ago
- Official Code for ParrotTTS☆48Updated 4 months ago
- Python bindings of speexdsp noise suppression library☆37Updated 2 years ago
- ☆56Updated 2 years ago
- a lightweight voice conversion☆81Updated 6 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆20Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆93Updated 4 months ago
- ☆12Updated 2 years ago
- ☆19Updated last year
- ☆31Updated 11 months ago
- Adaptive Vocoder for Custom Voice☆59Updated 2 years ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆29Updated 10 months ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆58Updated 2 weeks ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆94Updated 7 months ago
- ☆33Updated 3 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- ☆22Updated 3 years ago
- List of repositories relevant to VITS.☆36Updated 2 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year