voiceboxneurips / voicebox
☆15Updated last year
Related projects: ⓘ
- This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Spea…☆49Updated 11 months ago
- For students who would like to apply for RA, PhD, postdoc in audio research.☆22Updated 11 months ago
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆13Updated 2 weeks ago
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆18Updated 9 months ago
- Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learning☆21Updated last year
- ☆26Updated last year
- acnn for text-independent speaker recognition☆9Updated 2 years ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆16Updated last year
- Lightweight Speech Representation Learning for One-Shot Voice Conversion☆13Updated last month
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆33Updated 9 months ago
- ☆27Updated last year
- FastAudio is a Learnable Audio Frontend team Magnum's designed for the ASVspoof 2021 challenge☆42Updated last year
- This is the pytorch implementation of our work titled "An Efficient Temporary Deepfake Location Approach Based Embeddings for Partially S…☆8Updated 4 months ago
- ☆12Updated 2 years ago
- ☆19Updated last year
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆18Updated last year
- Self-supervised Speaker Diarization Interspeech 2022 Implementation☆9Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆33Updated last year
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆37Updated 3 months ago
- Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"☆12Updated 11 months ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆42Updated 2 years ago
- Query-conditioned target sound extraction model☆14Updated 3 months ago
- ☆27Updated this week
- SASV2 baseline, a track on ASVspoof5 phase2 challenge☆22Updated 2 months ago
- Computes the Mel-Cepstral Distance of two WAV files based on the paper "Mel-Cepstral Distance Measure for Objective Speech Quality Assess…☆46Updated 7 months ago
- Official implementation of the INTERSPEECH 2024 paper: Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detect…☆18Updated this week
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆42Updated this week
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆15Updated 2 years ago
- Implementation of SpatialCodec.☆51Updated 11 months ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆24Updated last year