rhasspy / pysilero-vad
Mike/Projects/pysilero-vad.git
☆13Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for pysilero-vad
- Clustering-based methods for overlapping diarization☆68Updated 10 months ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆34Updated last year
- Error correction back-end for speaker diarization☆12Updated last year
- This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…☆26Updated 2 months ago
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆59Updated 2 years ago
- ☆27Updated 7 months ago
- Official repository of NeXt-TDNN for speaker verification☆56Updated last month
- Went online decode demo☆29Updated 3 years ago
- Discriminative Training of VBx Diarization☆18Updated last month
- Official Repository For VoxBlink2☆49Updated 3 months ago
- Auto-KWS 2021 Challenge 1st place solution.☆9Updated 3 years ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆22Updated last month
- ONNX Inference of Pyannote Segmentation☆65Updated 2 months ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆63Updated 2 years ago
- Target Speaker Extraction Toolkit☆109Updated last week
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆29Updated 3 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆43Updated 2 months ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆34Updated 4 months ago
- ☆43Updated 9 months ago
- Online streaming speaker change detection model in Pytorch☆36Updated last year
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.☆35Updated 2 years ago
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆26Updated 3 years ago
- Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…☆19Updated last year
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆48Updated 2 years ago
- ☆41Updated last year
- Python Wrapper of Silero VAD☆41Updated 2 weeks ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆64Updated 5 months ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆47Updated 4 months ago
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆37Updated 5 months ago