☆16Apr 27, 2025Updated 10 months ago
Alternatives and similar repositories for SF2F_PyTorch
Users that are interested in SF2F_PyTorch are comparing it to the libraries listed below
Sorting:
- A PyTorch implementation of MIT CSAIL's Speech2Face research paper from IEEE CVPR 2019☆14Mar 25, 2023Updated 2 years ago
- [IJCAI2022] Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast☆22Oct 25, 2023Updated 2 years ago
- [AAAI 2024] XKD: Cross-modal Knowledge Distillation with Domain Alignment for Video Representation Learning.☆15Jul 9, 2024Updated last year
- Voice Face Association Learning Paper List☆17May 20, 2023Updated 2 years ago
- ☆12May 14, 2020Updated 5 years ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- ☆27Dec 22, 2022Updated 3 years ago
- code for paper "learning to fool the speaker recognition"☆10Jun 12, 2020Updated 5 years ago
- Code for ISSTA'21 paper 'Attack as Defense: Characterizing Adversarial Examples using Robustness'.☆12Sep 4, 2021Updated 4 years ago
- Code for "Self-Lifting: A Novel Framework For Unsupervised Voice-Face Association Learning,ICMR,2022"☆15Oct 25, 2024Updated last year
- Probabilistic Face Embeddings[2019-ICCV]☆23Apr 28, 2020Updated 5 years ago
- Official implementation of SBNet as described in "Single-branch Network for Multimodal Training".☆12Aug 28, 2023Updated 2 years ago
- Course - 3 of Deep Learning specialisation taught by Andrew Ng offered by Coursera☆10Nov 10, 2017Updated 8 years ago
- Interpretive Spatio-Temporal Features for Multi-Turn Responses Selection☆16Aug 13, 2019Updated 6 years ago
- ☆27Jan 17, 2024Updated 2 years ago
- [Reproduce] Code for the EMNLP2018 paper "A Visual Attention Grounding Neural Model for Multimodal Machine Translation".☆11Jan 19, 2020Updated 6 years ago
- Official implementation of "Max Pooling with Vision Transformers reconciles class and shape in weakly supervised semantic segmentation"☆19Jan 13, 2023Updated 3 years ago
- My implement of InstantBooth☆13Sep 11, 2023Updated 2 years ago
- ☆16Jul 30, 2016Updated 9 years ago
- ☆17Nov 23, 2021Updated 4 years ago
- Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face Behind a Voice by MIT CSAIL☆178Mar 24, 2023Updated 2 years ago
- Lecture materials, exercises, and solutions for Machine Learning Nanodegree Udacity Connect Intensive☆10Mar 4, 2018Updated 8 years ago
- For now, Illusion is a convenience layer on top of Vulkan. However, I plan to add more features as I progress in learning.☆14Nov 22, 2019Updated 6 years ago
- Implementation of our PR 2020 paper:Unsupervised Text-to-Image Synthesis☆13Jul 9, 2020Updated 5 years ago
- OpenGL and C++14 game engine that loads glTF 2.0☆13Apr 26, 2019Updated 6 years ago
- 中文文本近似计算☆13Jan 22, 2019Updated 7 years ago
- ICCV 2021☆34May 11, 2022Updated 3 years ago
- ☆13Feb 1, 2022Updated 4 years ago
- ☆12Mar 3, 2025Updated last year
- Latest research advances on semantic slot filling.☆25Feb 13, 2023Updated 3 years ago
- INA's library with pretrained models for gender and age prediction from faces.☆24Oct 7, 2024Updated last year
- Instance-level Facial Attributes Editing (CVIU 2021)☆15Jul 19, 2022Updated 3 years ago
- ☆16Jul 21, 2022Updated 3 years ago
- ☆14Nov 11, 2025Updated 4 months ago
- LÖVE example of an animated 2D mesh☆15Aug 18, 2022Updated 3 years ago
- Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Neural Networks☆18Nov 4, 2016Updated 9 years ago
- ☆13Sep 1, 2025Updated 6 months ago
- Pytorch implemenation of the model proposed in the paper: Double Multi-Head Attention for Speaker Verification☆20Jul 25, 2024Updated last year
- The MAVD represents Mandarin Audio-Visual dataset with Depth information. MAVD has a rich variety of modal data, including audio, RGB ima…☆20Apr 22, 2024Updated last year