wanglin-lw / ST-CapsLinks
☆11Updated 2 years ago
Alternatives and similar repositories for ST-Caps
Users that are interested in ST-Caps are comparing it to the libraries listed below
Sorting:
- ☆176Updated last year
- ☆117Updated 7 months ago
- This is the official repo of our work titled "The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio".☆64Updated last year
- ☆19Updated last year
- A list of tools, papers and code related to Fake Audio Detection.☆207Updated 2 weeks ago
- This repository includes the code to reproduce our paper "Automatic speaker verification spoofing and deepfake detection using wav2vec 2.…☆155Updated 2 years ago
- This reporsitory contains metadata of WavCaps dataset and codes for downstream tasks.☆252Updated last year
- PyTorch Implementation of SimulLR☆11Updated 4 years ago
- Code for LAVSS: Location-Guided Audio-Visual Spatial Audio Separation☆16Updated 10 months ago
- [CVPR 2024] AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation☆43Updated last year
- This is the repo of our work titled “Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory Perception”☆27Updated 7 months ago
- Accepted by TMM 2022☆18Updated 3 years ago
- ☆20Updated last year
- Deformable Speech Transformer (DST)☆35Updated last year
- ☆156Updated 2 years ago
- Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition☆78Updated last year
- A list of papers (with available code), tutorials, and surveys on recent AI for emotion recognition (AI4ER)☆30Updated last year
- Code for paper "Audio Deepfake Detection with Self-supervised XLS-R and SLS classifier☆55Updated 10 months ago
- [ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech E…☆186Updated last year
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆192Updated last year
- Baseline system for CNVSRC2023 (Chinese Continuous Visual Speech Recognition Challenge 2023)☆22Updated last year
- Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information☆162Updated 2 years ago
- Voice Face Association Learning Paper List☆16Updated 2 years ago
- Research progress on speech deepfake detection: Relevant datasets aggregated from the review literature and publicly available codes☆273Updated 6 months ago
- 语音方向实验室/公司/资源/实习等,欢迎推荐或自荐☆590Updated last year
- ☆57Updated last year
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆153Updated 4 years ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆44Updated 3 years ago
- This package aims at simplifying the download of the AudioCaps dataset.☆36Updated 2 years ago
- ☆62Updated last year