SVHF-Net for Cross-modal binary matching
☆32Aug 22, 2018Updated 7 years ago
Alternatives and similar repositories for SVHF-Net
Users that are interested in SVHF-Net are comparing it to the libraries listed below
Sorting:
- processing and extracting of face and mouth image files out of the TCDTIMIT database☆46Sep 22, 2020Updated 5 years ago
- Implementing VGGVox for Speaker Identification on VoxCeleb1 dataset in PyTorch.☆25Oct 15, 2020Updated 5 years ago
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆25Mar 9, 2024Updated last year
- Real-world photo sequence question answering system (MemexQA). CVPR'18 and TPAMI'19☆33Jul 1, 2019Updated 6 years ago
- ☆30Aug 9, 2022Updated 3 years ago
- [TASLP 2024] Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation☆31Sep 6, 2024Updated last year
- [NeurIPS 2019] Face Reconstruction from Voice using Generative Adversarial Networks☆194Jan 5, 2020Updated 6 years ago
- VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasets☆399Feb 4, 2019Updated 7 years ago
- PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)☆20Apr 11, 2022Updated 3 years ago
- La-O-Dan iOS study☆19Oct 11, 2011Updated 14 years ago
- The Introduction of the OLKAVS Dataset☆37May 28, 2024Updated last year
- Audio-Visual Speech Recognition using Sequence to Sequence Models☆83Jul 10, 2020Updated 5 years ago
- PyTorch Implementation on Paper [CVPR2021]Distilling Audio-Visual Knowledge by Compositional Contrastive Learning☆89Jul 7, 2021Updated 4 years ago
- Visual Relationship Understanding☆10Oct 2, 2021Updated 4 years ago
- CS20 (Tensorflow) 정리☆13Oct 24, 2018Updated 7 years ago
- Dota Auto Chess Picker is a utility for planning your strategy☆11Oct 1, 2020Updated 5 years ago
- LPC(線形予測分析)法によるホルマント周波数とピッチ周波数を推定する 簡略的なプログラム。☆12Mar 7, 2019Updated 7 years ago
- WaveNet auto-ancoders for ZeroSpeech challenge 2020☆37Apr 7, 2022Updated 3 years ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- Original implementation of the pooling method introduced in "Speaker embeddings by modeling channel-wise correlations"☆11Sep 20, 2021Updated 4 years ago
- This is an official pytorch implementation of Learning To Recognize Procedural Activities with Distant Supervision. In this repository, w…☆43Feb 21, 2023Updated 3 years ago
- ☆12Jun 22, 2020Updated 5 years ago
- Frequency tracking in time-frequency representations☆13Jan 19, 2021Updated 5 years ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆42Aug 11, 2023Updated 2 years ago
- Official Codebase of "Localizing Visual Sounds the Easy Way" (ECCV 2022)☆40Oct 2, 2022Updated 3 years ago
- ☆40Jul 19, 2018Updated 7 years ago
- Generating Talking Face Landmarks from Speech☆160Dec 22, 2022Updated 3 years ago
- Editor for making simple bandlimited waveform SVGs☆17May 25, 2015Updated 10 years ago
- This repository contains the speaker labeled information of VoxCeleb2 and LRS3 audio-visual datasets. (AAAI 2025)☆13Sep 6, 2024Updated last year
- A benchmark of programming tasks for LLMs that supports almost any programming language.☆13Jun 30, 2025Updated 8 months ago
- Julia package for transfer operator spectral methods☆11Aug 14, 2024Updated last year
- This repository contains the official code for "Flexible Biometrics Recognition: Bridging the Multimodality Gap through Attention, Alignm…☆11Oct 9, 2024Updated last year
- Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…☆14Feb 15, 2023Updated 3 years ago
- Types and functions for working with continued fractions in Julia☆12Feb 7, 2022Updated 4 years ago
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated last month
- ☆10Jul 7, 2020Updated 5 years ago
- Research project on glyph-based Chinese character embedding. Preparing for EMNLP 2019☆11Mar 18, 2019Updated 6 years ago
- GLVisualize for the Web☆10Feb 8, 2020Updated 6 years ago
- ☆13Oct 25, 2024Updated last year