HolgerBovbjerg / SSL-PVADView external linksLinks
A repository for code used to produce the results the ICASSP 2024 paper: "SELF-SUPERVISED PRETRAINING FOR ROBUST PERSONALIZED VOICE ACTIVITY DETECTION IN ADVERSE CONDITIONS"
☆21Nov 25, 2024Updated last year
Alternatives and similar repositories for SSL-PVAD
Users that are interested in SSL-PVAD are comparing it to the libraries listed below
Sorting:
- ☆11Nov 7, 2024Updated last year
- Accompanying repository for the paper "Automatic Music Mixing Using a Generative Model of Effect Embeddings"☆21Jan 18, 2026Updated 3 weeks ago
- Convert a mono channel recording into binaural playback with headphones and loudspeakers☆12Dec 6, 2023Updated 2 years ago
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆23Aug 14, 2025Updated 5 months ago
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Nov 14, 2023Updated 2 years ago
- Audio samples for the paper 'Phase-aware music super-resolution using generative adversarial networks'☆14May 15, 2020Updated 5 years ago
- Tr-VAD: An Efficient Transformer based Voice Activity Detection Model☆17Aug 1, 2024Updated last year
- Implementation of Sheffield entry for Clarity enhancement challenge.☆18Apr 19, 2022Updated 3 years ago
- ☆23Feb 2, 2022Updated 4 years ago
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆22Jul 25, 2024Updated last year
- An End-to-End Pipeline for Enhanced French Text-to-Speech with SSML Prosody Control☆30Jan 13, 2026Updated last month
- This is the official implementation for εar-VAE model including inference and evaluation parts, more details coming soon...☆54Jan 18, 2026Updated 3 weeks ago
- Lightweight Speech Representation Learning for One-Shot Voice Conversion☆24Dec 12, 2024Updated last year
- ☆25Feb 28, 2023Updated 2 years ago
- Reimplementation of Miipher☆29Aug 16, 2023Updated 2 years ago
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆38Oct 28, 2025Updated 3 months ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆60Sep 19, 2024Updated last year
- ☆32Jan 9, 2024Updated 2 years ago
- Ablation study of local spectral attention (LSA) for full-band speech enhancement (SE)☆28Sep 16, 2023Updated 2 years ago
- Comprehensive benchmark suite comparing pitch detection algorithms across multiple datasets.☆56Sep 1, 2025Updated 5 months ago
- An example of a speech enhancement model deployed with TensorRT.☆77Mar 24, 2025Updated 10 months ago
- NOMAD: Non-Matching Audio Distance (ICASSP 2024)☆30Jun 17, 2025Updated 7 months ago
- OpenFLAM: Framewise Language Audio Model☆88Jan 14, 2026Updated 3 weeks ago
- An unofficial pytorch implementation of "STREAMVC: REAL-TIME LOW-LATENCY VOICE CONVERSION".☆81Apr 15, 2025Updated 9 months ago
- ☆43Feb 14, 2025Updated last year
- FUSION is an open-source project aimed at revolutionizing networking through the simulation of advanced SD-EONs and AI-enhanced networks,…☆13Jan 30, 2026Updated 2 weeks ago
- Landing Page for All Things Source Separation☆36Sep 12, 2025Updated 5 months ago
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆41Oct 20, 2025Updated 3 months ago
- Fast algorithm for determined blind source separation with update of demixing filters with joint adjustment of the remaining sources.☆34Mar 22, 2021Updated 4 years ago
- ☆36Jan 6, 2026Updated last month
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆79Sep 22, 2022Updated 3 years ago
- misc programming languages☆11Jan 10, 2023Updated 3 years ago
- Artifact code release for paper "Uniform-Cost Multi-Path Routing for Reconfigurable Data Center Networks"☆12Sep 5, 2024Updated last year
- Bandwidth Extension of Historical Recordings using Generative Adversarial Networks☆35May 25, 2023Updated 2 years ago
- Continual Resilient (CoRe) Optimizer for PyTorch☆11Jun 10, 2024Updated last year
- Text-To-Speech for NotebookLM☆37Jul 20, 2025Updated 6 months ago
- ☆11Sep 4, 2023Updated 2 years ago
- Code for ICASSP 2024 paper WhisperSeg: Positive Transfer of the Whisper Speech Transformer to Human and Animal Voice Activity Detection☆39Jul 25, 2025Updated 6 months ago
- ☆46Jan 14, 2025Updated last year