MKT-Dataoceanai / CNVSRC2023Baseline

Baseline system for CNVSRC2023 (Chinese Continuous Visual Speech Recognition Challenge 2023)

☆21

Related projects ⓘ

Alternatives and complementary repositories for CNVSRC2023Baseline

TaoRuijie / MFV-KSD
Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)
☆14Updated 3 months ago
TaoRuijie / AVCleanse
ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'
☆32Updated 2 years ago
zexupan / MuSE
☆32Updated 3 years ago
TakHemlata / RawBoost-antispoofing
This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Spea…
☆50Updated last year
zexupan / reentry
☆14Updated 2 years ago
xieyuankun / Codecfake
This is the official repo of our work titled "The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio".
☆40Updated last month
mispchallenge / misp2022_baseline
☆26Updated last year
nii-yamagishilab / PartialSpoof
☆39Updated 3 months ago
asvspoof-challenge / asvspoof5
☆31Updated last month
KunZhou9646 / Emovox
This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".
☆81Updated 2 years ago
wngh1187 / RawNeXt
Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…
☆23Updated 2 years ago
YoungSeng / SRD-VC
Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)
☆111Updated 9 months ago
mispchallenge / MISP-ICME-AVSR
☆17Updated 10 months ago
SVDDChallenge / CtrSVDD2024_Baseline
Baseline system for SVDD 2024 Challenge CtrSVDD track
☆18Updated last month
KunZhou9646 / controllable_evc_code
This is the code for controllable EVC framework for seen and unseen emotion generation.
☆41Updated 3 years ago
cogmhear / avse_challenge
COG-MHEAR Audio-Visual Speech Enhancement Challenge
☆33Updated 7 months ago
zyzisyz / mfa_conformer
☆136Updated last year
zexupan / USEV
☆13Updated 4 months ago
Levent9 / Zero-shot-FaceVC
☆17Updated 8 months ago
kimho1wq / MR-RawNet
This repository contains official pytorch implementation and pre-trained models for the MR-RawNet.
☆10Updated 4 months ago
lin9x / AV-Sepformer
☆45Updated last year
Hunterhuan / sphereface2_speaker_verification
Exploring Binary Classification Loss for Speaker Verification
☆14Updated last year
sinhat98 / adapter-wavlm
☆41Updated last year
shkim816 / temporal_dynamic_cnn
TDY-CNN for text-independent speaker verification
☆17Updated 2 years ago
TaoRuijie / Loss-Gated-Learning
ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'
☆87Updated last year
john852517791 / awesome-fake-audio-detection
A list of tools, papers and code related to Fake Audio Detection.
☆22Updated this week
FFSVC / FFSVC2022_Baseline_System
☆32Updated 2 years ago
lixucuhk / ASV-anti-spoofing-with-Res2Net
Implementation of the paper: Replay and Synthetic Speech Detection with Res2Net architecture (ICASSP 2021) https://arxiv.org/abs/2010.150…
☆75Updated 3 years ago
IMLHF / SpecAugmentPyTorch
A Pytorch (support batch and channel) implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech…
☆11Updated 3 months ago
VoxBlink / ScriptsForVoxBlink
A repo containing download guidance and corresponding scripts of the VoxBlink dataset.
☆22Updated 6 months ago