a-nagrani/SVHF-Net

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/a-nagrani/SVHF-Net)

a-nagrani / SVHF-Net

SVHF-Net for Cross-modal binary matching

☆32

Alternatives and similar repositories for SVHF-Net

Users that are interested in SVHF-Net are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KID-7391 / seeking-the-shape-of-sound
View on GitHub
☆19Jun 8, 2021Updated 5 years ago
msaadsaeed / FOP
View on GitHub
Official implementation of FOP method as described in "Fusion and Orthogonal Projection for Improved Face-Voice Association"
☆23Dec 31, 2025Updated 6 months ago
matthijsvk / TCDTIMITprocessing
View on GitHub
processing and extracting of face and mouth image files out of the TCDTIMIT database
☆47Sep 22, 2020Updated 5 years ago
ms-dot-k / Visual-Context-Attentional-GAN
View on GitHub
PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)
☆25Mar 9, 2024Updated 2 years ago
Derpimort / VGGVox-PyTorch
View on GitHub
Implementing VGGVox for Speaker Identification on VoxCeleb1 dataset in PyTorch.
☆25Oct 15, 2020Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
zhengyang5 / MMED400
View on GitHub
☆13Nov 19, 2024Updated last year
ms-dot-k / Image-to-Speech
View on GitHub
Pytorch implementation of "Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal T…
☆12Apr 29, 2026Updated 2 months ago
fufangze / SDR-GNN
View on GitHub
An official pytorch implementation for the paper: SDR-GNN: Spectral Domain Reconstruction Graph Neural Network for incomplete multimodal …
☆18Dec 30, 2024Updated last year
cmu-mlsp / reconstructing_faces_from_voices
View on GitHub
[NeurIPS 2019] Face Reconstruction from Voice using Generative Adversarial Networks
☆193Jan 5, 2020Updated 6 years ago
eeskimez / Talking-Face-Landmarks-from-Speech
View on GitHub
Generating Talking Face Landmarks from Speech
☆158Dec 22, 2022Updated 3 years ago
Marvinmw / ChatVul
View on GitHub
☆13Apr 26, 2023Updated 3 years ago
choijeongsoo / utut
View on GitHub
[TASLP 2024] Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation
☆31Sep 6, 2024Updated last year
penghu-cs / MAN
View on GitHub
Multimodal Adversarial Network for Cross-modal Retrieval (PyTorch Code)
☆29Apr 7, 2020Updated 6 years ago
Wangt-CN / VQG-GCN
View on GitHub
A GCN based visual question generation model
☆13Aug 21, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
changil / facevoice
View on GitHub
Learning associations between human faces and voices
☆12Feb 15, 2019Updated 7 years ago
lihaod / Deep_inpainting_localization
View on GitHub
Implementation of “Localization of Deep Inpainting Using High-Pass Fully Convolutional Network”
☆30Apr 13, 2022Updated 4 years ago
JunweiLiang / FVTA_MemexQA
View on GitHub
Real-world photo sequence question answering system (MemexQA). CVPR'18 and TPAMI'19
☆33Jul 1, 2019Updated 7 years ago
PKU-ICST-MIPL / MGAH_TMM2019
View on GitHub
Source code of our TMM 2019 paper "Multi-pathway Generative Adversarial Hashing for Unsupervised Cross-modal Retrieval"
☆12Jun 17, 2019Updated 7 years ago
yuji-roh / fr-train
View on GitHub
FR-Train: A Mutual Information-Based Approach to Fair and Robust Training (ICML 2020)
☆13Jun 3, 2021Updated 5 years ago
huybery / GDPnet
View on GitHub
GDPnet: "Geometry-guided Dense Perspective Network for Speech-Driven Facial Animation." (TVCG 2021)
☆11Nov 21, 2021Updated 4 years ago
Gorilla-Lab-SCUT / TTAC2
View on GitHub
[TPAMI 2024] The official implementation of "Revisiting Realistic Test-Time Training: Sequential Inference and Adaptation by Anchored Clu…
☆13Mar 19, 2024Updated 2 years ago
a-nagrani / VGGVox
View on GitHub
VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasets
☆401Feb 4, 2019Updated 7 years ago
xiangyongcao / CNN-MRF-v1
View on GitHub
This is a modified version of the code for Hyperspectral image classification using CNN (Post-processing code is written in python).
☆10Mar 3, 2018Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
devraj89 / Generalized-Semantic-Preserving-Hashing-for-N-Label-Cross-Modal-Retrieval
View on GitHub
This is the implementation for the paper "Generalized Semantic Preserving Hashing for N-Label Cross-Modal Retrieval"
☆14Dec 7, 2017Updated 8 years ago
saiteja-talluri / Speech2Face
View on GitHub
Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face Behind a Voice by MIT CSAIL
☆178Mar 24, 2023Updated 3 years ago
naver-ai / cgl_fairness
View on GitHub
☆14Jan 11, 2024Updated 2 years ago
Finn-Fengming / Vggvox-TensorFlow
View on GitHub
Implementation of the VGGVox network using TensorFlow.
☆17Mar 20, 2026Updated 4 months ago
usnistgov / ActEV_Scorer
View on GitHub
Scoring software for the TRECVID Activities in Extended Video (ActEV) evaluation
☆43Updated this week
v-iashin / VoxCeleb
View on GitHub
An attempt to replicate the results of [1706.08612] VoxCeleb: a large-scale speaker identification dataset
☆12Dec 11, 2019Updated 6 years ago
plnguyen2908 / UniTalk-ASD-code
View on GitHub
[Interspeech 2026] Revisiting Active Speaker Detection: An In-the-Wild Benchmark for Generalization and Robustness
☆22Jun 25, 2026Updated last month
choyingw / Cross-Modal-Perceptionist
View on GitHub
CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?
☆130Dec 11, 2024Updated last year
yanbeic / CCL
View on GitHub
PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning
☆88Jul 7, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
joonson / voxceleb_unsupervised
View on GitHub
Augmentation adversarial training for self-supervised speaker recognition
☆77Aug 15, 2021Updated 4 years ago
shinington / facesec
View on GitHub
Corresponding code to "FACESEC: A Fine-grained Robustness Evaluation Framework for Face Recognition Systems" @ CVPR 2021
☆13Jun 22, 2021Updated 5 years ago
tuffr5 / CAR-GAN
View on GitHub
code for the paper "Cascade Attention Guided Residue GAN for Cross-Modal Translation"
☆17Oct 31, 2020Updated 5 years ago
artelab / Multi-modal-classification
View on GitHub
This project contains the code of the implementation of the approach proposed in I. Gallo, A. Calefati, S. Nawaz and M.K. Janjua, "Image …
☆22Apr 10, 2019Updated 7 years ago
liujianee / Pertrubation_Rectifying_Network
View on GitHub
Tensorflow implementation of "Defense against Universal Adversarial Perturbations"
☆10Apr 16, 2018Updated 8 years ago
Rongpeng-Lin / A-DA-GAN-architecture
View on GitHub
A basic architecture of "DA-GAN: Instance-level Image Translation by Deep Attention Generative Adversarial Networks"
☆15Oct 6, 2018Updated 7 years ago
AliaksandrSiarohin / face-makeup.PyTorch
View on GitHub
Lip and hair color editor using face parsing maps.
☆11Jun 10, 2019Updated 7 years ago