Tr-VAD: An Efficient Transformer based Voice Activity Detection Model
☆17Aug 1, 2024Updated last year
Alternatives and similar repositories for Tr-VAD
Users that are interested in Tr-VAD are comparing it to the libraries listed below
Sorting:
- This is the public repository for SALSA-Lite features for polyphonic sound event localization and detection using microphone arrays.☆12Dec 3, 2021Updated 4 years ago
- Pytorch implementation of "spectro-temporal attention-based voice activity detection"☆13Jun 4, 2024Updated last year
- ASLP Summer Inter@NPU☆12Jul 30, 2024Updated last year
- Codebase of the submitted work in ICASSP 2023☆14Nov 30, 2022Updated 3 years ago
- Implementation of CGMM-MVDR beamforming used for Clarity challenge☆14Jan 14, 2022Updated 4 years ago
- A repository for code used to produce the results the ICASSP 2024 paper: "SELF-SUPERVISED PRETRAINING FOR ROBUST PERSONALIZED VOICE ACTIV…☆21Nov 25, 2024Updated last year
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Jan 20, 2025Updated last year
- A comprehensive framework to test audio comprehension of Large Audio Language Models.☆59Jan 23, 2026Updated last month
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆76Jul 29, 2024Updated last year
- This is the official implementation of PGUSE☆34Jun 7, 2025Updated 9 months ago
- ☆22Jul 10, 2025Updated 8 months ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 2 years ago
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆79Sep 22, 2022Updated 3 years ago
- Exploring Binary Classification Loss for Speaker Verification☆18Jul 18, 2023Updated 2 years ago
- Landing Page for Divide and Remaster v3☆25Jul 29, 2025Updated 7 months ago
- An End-to-End Pipeline for Enhanced French Text-to-Speech with SSML Prosody Control☆31Jan 13, 2026Updated last month
- A toolkit dedicate for speech evaluation.☆23Sep 26, 2024Updated last year
- Universal differential equations for ecologists☆14Mar 2, 2026Updated last week
- Ablation study of local spectral attention (LSA) for full-band speech enhancement (SE)☆28Sep 16, 2023Updated 2 years ago
- Spherical residual vector quantization (SRVQ)☆31Aug 25, 2024Updated last year
- This is the official implementation of reverberant speech to room impulse response estimator☆41Aug 7, 2024Updated last year
- ☆28Dec 22, 2021Updated 4 years ago
- multi-scale time domain speaker extraction☆72Jun 7, 2021Updated 4 years ago
- Simple PyTorch Denoisers for Waveform Audio☆41Mar 1, 2026Updated last week
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆41Oct 20, 2025Updated 4 months ago
- The program ranked first in Audio-only track of DCASE2024 Challenge task3.☆20Mar 2, 2026Updated last week
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆38Oct 27, 2025Updated 4 months ago
- PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind☆65Sep 22, 2025Updated 5 months ago
- Continual Resilient (CoRe) Optimizer for PyTorch☆11Jun 10, 2024Updated last year
- ☆46Jan 14, 2025Updated last year
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆166Dec 12, 2025Updated 2 months ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆151Jun 5, 2025Updated 9 months ago
- Code for ICASSP 2024 paper WhisperSeg: Positive Transfer of the Whisper Speech Transformer to Human and Animal Voice Activity Detection☆40Jul 25, 2025Updated 7 months ago
- Open repository of simulated Room Impulse Responses (RIR) accompanying the paper "Hearing Anywhere in Any Environment"☆71Aug 11, 2025Updated 6 months ago
- Generate synthetic wind noise signals based on a wind speed profile (Python)☆48Apr 23, 2024Updated last year
- Ecoacoustic analysis platform empowering conservationists to analyze acoustic data and to derive insights about the ecosystem at scale☆17Updated this week
- Active noise controller (ANC) design: a practical primer☆13Jan 8, 2026Updated 2 months ago
- This repository will contain links to the most famous available books of ML that are online☆12Oct 15, 2024Updated last year
- ☆15Sep 16, 2024Updated last year