voiceboxneurips / voicebox
☆17Updated last year
Related projects ⓘ
Alternatives and complementary repositories for voicebox
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆19Updated 11 months ago
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆13Updated 2 months ago
- This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Spea…☆52Updated last year
- Official implementation of the INTERSPEECH 2024 paper: Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detect…☆24Updated 2 months ago
- ☆13Updated last month
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆35Updated 2 years ago
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆17Updated 2 years ago
- TODO☆35Updated last year
- ☆26Updated last year
- Lightweight Speech Representation Learning for One-Shot Voice Conversion☆16Updated this week
- For students who would like to apply for RA, PhD, postdoc in audio research.☆24Updated 3 weeks ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆15Updated 2 months ago
- acnn for text-independent speaker recognition☆9Updated 2 years ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆27Updated 4 months ago
- ☆19Updated last year
- ☆15Updated 2 years ago
- ☆46Updated last year
- Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement☆20Updated 3 years ago
- ☆12Updated 2 years ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆16Updated last year
- Pytorch implementation of Extended U-Net for Speaker Verification in Noisy Environments☆27Updated last year
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆14Updated 3 months ago
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆16Updated 3 weeks ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆35Updated last month
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆29Updated last month
- FastAudio is a Learnable Audio Frontend team Magnum's designed for the ASVspoof 2021 challenge☆45Updated last year
- Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learning☆22Updated last year
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆30Updated 3 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆49Updated 2 weeks ago
- A deepfake audio dataset for detecting fake speech from codec-based speech synthesis systems, Interspeech 2024☆13Updated 3 months ago