Voice Face Association Learning Paper List
☆17May 20, 2023Updated 2 years ago
Alternatives and similar repositories for vfal_papers
Users that are interested in vfal_papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [IJCAI2022] Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast☆21Oct 25, 2023Updated 2 years ago
- Official implementation of SBNet as described in "Single-branch Network for Multimodal Training".☆13Aug 28, 2023Updated 2 years ago
- ☆12Jun 14, 2022Updated 3 years ago
- Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.☆18Jul 11, 2022Updated 3 years ago
- ☆11Nov 5, 2025Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆22Jul 25, 2024Updated last year
- ☆16Apr 27, 2025Updated 11 months ago
- ☆19Jun 8, 2021Updated 4 years ago
- ☆19Mar 2, 2024Updated 2 years ago
- Voice-Face Association Learning Evaluation☆49Feb 13, 2024Updated 2 years ago
- Tools for downloading VoxCeleb2 dataset☆34Mar 16, 2024Updated 2 years ago
- RONO: Robust Discriminative Learning with Noisy Labels for 2D-3D Cross-Modal Retrieval (CVPR 2023, PyTorch Code)☆21Mar 11, 2024Updated 2 years ago
- ☆42Nov 22, 2024Updated last year
- ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'☆92May 29, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for Audio-Visual Target Speaker Extraction with Selective Auditory Attention (TASLP)☆32Feb 28, 2025Updated last year
- Python code to show basic sound separation using Ideal Binary Masks☆13Oct 13, 2018Updated 7 years ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆44Oct 31, 2022Updated 3 years ago
- AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…☆11Feb 23, 2024Updated 2 years ago
- ☆64Jun 28, 2023Updated 2 years ago
- Data-Independent Operator: A Training-Free Artifact Representation Extractor for Generalizable Deepfake Detection☆17Mar 19, 2024Updated 2 years ago
- Download and preprocess voxceleb datasets.☆40Jun 18, 2025Updated 10 months ago
- This is the release code for CVPR2022 paper "Voice-Face Homogeneity Tells Deepfake".☆15Mar 7, 2022Updated 4 years ago
- AudioVisual Diarization - Supervised and Unsupervised☆15Nov 22, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Region-Based Optimization in Continual Learning for Audio Deepfake Detection☆12Dec 17, 2024Updated last year
- Code release for ICCV 2019 paper "Spatiotemporal Feature Residual Propagation for Action Prediction"☆14Sep 20, 2021Updated 4 years ago
- Proton density fat fraction calculation for MRI☆12Jul 2, 2025Updated 9 months ago
- ☆16Feb 19, 2026Updated 2 months ago
- A Challenging Benchmark of Anime Style Recognition☆29Feb 19, 2025Updated last year
- 针对CN-Celeb数据集的基于ECAPA-TDNN的说话人识别的pytorch实现☆13Apr 3, 2023Updated 3 years ago
- ☆18Nov 22, 2024Updated last year
- ☆14Jan 7, 2023Updated 3 years ago
- Please go to https://github.com/facebookresearch/stable_signature☆13Jul 26, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- FattyRiot algorithm for separation of fat and water magnetic resonance images☆13Nov 5, 2015Updated 10 years ago
- Bimodal Adaptive Feature Fusion Network for Person Verification☆20Jul 30, 2022Updated 3 years ago
- self ensemble label correction☆17Jul 29, 2022Updated 3 years ago
- ☆20Mar 16, 2020Updated 6 years ago
- Leveraging BERT to Improve Spoken Language Identification☆17Nov 22, 2022Updated 3 years ago
- ☆13Mar 30, 2023Updated 3 years ago
- This is the official PyTorch implementation of the paper “Neural Transformation Fields for Arbitrary-Styled Font Generation”.☆25Jun 10, 2024Updated last year