a-nagrani / SVHF-NetView external linksLinks
SVHF-Net for Cross-modal binary matching
☆32Aug 22, 2018Updated 7 years ago
Alternatives and similar repositories for SVHF-Net
Users that are interested in SVHF-Net are comparing it to the libraries listed below
Sorting:
- ☆19Jun 8, 2021Updated 4 years ago
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆46Sep 22, 2020Updated 5 years ago
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆25Mar 9, 2024Updated last year
- Real-world photo sequence question answering system (MemexQA). CVPR'18 and TPAMI'19☆33Jul 1, 2019Updated 6 years ago
- [TASLP 2024] Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation☆31Sep 6, 2024Updated last year
- Official implementation of "Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data" (ICLR 2024)☆34Oct 16, 2024Updated last year
- ☆30Aug 9, 2022Updated 3 years ago
- [NeurIPS 2019] Face Reconstruction from Voice using Generative Adversarial Networks☆194Jan 5, 2020Updated 6 years ago
- VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasets☆399Feb 4, 2019Updated 7 years ago
- La-O-Dan iOS study☆19Oct 11, 2011Updated 14 years ago
- PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)☆20Apr 11, 2022Updated 3 years ago
- The Introduction of the OLKAVS Dataset☆37May 28, 2024Updated last year
- Audio-Visual Speech Recognition using Sequence to Sequence Models☆83Jul 10, 2020Updated 5 years ago
- Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"☆11Sep 20, 2021Updated 4 years ago
- Visual Relationship Understanding☆10Oct 2, 2021Updated 4 years ago
- CS20 (Tensorflow) 정리☆13Oct 24, 2018Updated 7 years ago
- LPC(線形予測分析)法によるホルマント周波数とピッチ周波数を推定する簡略的なプログラム。☆12Mar 7, 2019Updated 6 years ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)☆39Oct 2, 2022Updated 3 years ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆42Aug 11, 2023Updated 2 years ago
- ☆40Jul 19, 2018Updated 7 years ago
- Generating Talking Face Landmarks from Speech☆160Dec 22, 2022Updated 3 years ago
- calling GNU Octave functions from the Julia language☆11Jan 31, 2025Updated last year
- This repository contains the speaker labeled information of VoxCeleb2 and LRS3 audio-visual datasets. (AAAI 2025)☆12Sep 6, 2024Updated last year
- A Julia implementation of ▁▂▃▅▂▇ spark: simple printing of unicode trendlines☆10Nov 24, 2018Updated 7 years ago
- ☆11Mar 25, 2024Updated last year
- Research project on glyph-based Chinese character embedding. Preparing for EMNLP 2019☆10Mar 18, 2019Updated 6 years ago
- Dockerfile and instructions for human pose estimation implementation using Caffe, OpenCV 3.1.0 and Python 2.7.☆12Mar 3, 2019Updated 6 years ago
- Common Lisp like condition system for Julia☆15Oct 12, 2023Updated 2 years ago
- A CUDA powered audio decoding framework for FLAC.☆11May 22, 2018Updated 7 years ago
- Julia package for editing and displaying binary file data in hexadecimal format☆11Mar 14, 2023Updated 2 years ago
- Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…☆14Feb 15, 2023Updated 2 years ago
- Tools to build knowledge graphs from multi-modal extractions☆12Apr 2, 2020Updated 5 years ago
- This code can be used for 2D facial reconstruction based on a state-of-the-art deep learning technique known as SFSNet.☆13Jul 18, 2020Updated 5 years ago
- Time frequency ridge detection based on relevant ridge portions☆11Aug 17, 2023Updated 2 years ago
- Evaluation metrics and submission file creation scripts the Action Recognition challenge☆14Updated this week
- Generate compositions, supercompositions and variants for a given Hanzi / Kanji☆11Dec 22, 2018Updated 7 years ago
- GDPnet: "Geometry-guided Dense Perspective Network for Speech-Driven Facial Animation." (TVCG 2021)☆11Nov 21, 2021Updated 4 years ago
- 3D Facial Reconstruction☆14Aug 5, 2020Updated 5 years ago