voiceboxneurips / voicebox
☆20Updated 2 years ago
Alternatives and similar repositories for voicebox:
Users that are interested in voicebox are comparing it to the libraries listed below
- Official implementation of the INTERSPEECH 2024 paper: Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detect…☆37Updated 4 months ago
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆51Updated last week
- Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learning☆22Updated last year
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆52Updated 2 months ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆18Updated last year
- ☆46Updated 9 months ago
- ☆48Updated 7 months ago
- ☆50Updated last year
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆20Updated 7 months ago
- US-based professors who work on audio. For students who would like to apply for RA, PhD, postdoc in audio research.☆25Updated 3 weeks ago
- This is the pytorch implementation of our work titled "An Efficient Temporary Deepfake Location Approach Based Embeddings for Partially S…☆18Updated 5 months ago
- ☆17Updated last week
- This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…☆97Updated last year
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated last year
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆23Updated last year
- ☆30Updated 5 months ago
- Query-conditioned target sound extraction model☆21Updated last month
- Lightweight Speech Representation Learning for One-Shot Voice Conversion☆23Updated 4 months ago
- Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation☆28Updated last year
- MSP-Podcast Challenge Baseline Code☆21Updated 10 months ago
- This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Spea…☆60Updated last year
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆40Updated 2 years ago
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆31Updated last month
- ☆20Updated 7 months ago
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆25Updated 6 months ago
- Official data preparation scripts for the URGENT 2024 Challenge☆76Updated 3 months ago
- Synthesis speech detection based on Breathing-Talking-Silence sounds☆20Updated 6 months ago
- It includes papers on speech&audio field. Now update: ICLR2023-2025, ICML2023-2024, NeurIPS2023-2024, ACMMM2024, AAAI2024, ACL2024, EMNLP…☆49Updated this week
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆26Updated last month
- ☆19Updated last year