keetsky / Net_ghostVLAD-pytorch
☆21Updated 5 years ago
Alternatives and similar repositories for Net_ghostVLAD-pytorch:
Users that are interested in Net_ghostVLAD-pytorch are comparing it to the libraries listed below
- Efficient python implementation of VLAD image descriptors☆18Updated 7 years ago
- The proposed method in LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild☆25Updated 6 years ago
- SE-Resnet+AMSoftmax for Speaker Verification☆47Updated 6 years ago
- Implement SphereFace in Pytorch☆36Updated 5 years ago
- (2020) Video Classification Neural Network☆30Updated 5 years ago
- Collection of works from VIPL-AVSU☆41Updated last week
- Video classification, youtube8m, Knowledge distillation, Tensorflow, NeXtVLAD☆26Updated 5 years ago
- SHN-based (Stacked Hourglass Network) methods for 2D face alignment☆31Updated 5 years ago
- convenience utilities for model validation☆23Updated 5 years ago
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆25Updated 2 years ago
- Code and instruction on replicating the experiments done in paper: Unified Hypersphere Embedding for Speaker Recognition☆31Updated 5 years ago
- It's a CNN + NetVLAD + CRN network☆12Updated 7 years ago
- Code for Vision-Infused Deep Audio Inpainting (ICCV 2019)☆56Updated 5 years ago
- Learned Contextual Feature Reweighting for Image Geo-Localization (CVPR 2017)☆33Updated 6 years ago
- Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)☆70Updated 4 years ago
- Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for A…☆23Updated 5 years ago
- Acoustic Scene Classification Using Deep Residual Networks with Late Fusion of Separated High and Low Frequency Paths - McDonnell and Gao…☆21Updated 7 months ago
- PyTorch implementation of NetVLAD & Online Hardest Triplet Loss.☆449Updated 6 years ago
- mnist classify using center loss☆13Updated 6 years ago
- A simplified version for DMC (Deep Multimodal Clustering for Unsupervised Audiovisual Learning)☆19Updated 4 years ago
- Script for converting the pretrained VGGish model provided with AudioSet from TensorFlow to PyTorch, along with a basic smoke test.☆85Updated 5 years ago
- Source code for models described in the paper "ESResNet: Environmental Sound Classification Based on Visual Domain Models" (https://arxiv…☆31Updated last year
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- 2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning…☆22Updated last year
- Python function of Pytorch Grid Sample with Zero Padding☆19Updated 4 years ago
- Co-Separating Sounds of Visual Objects (ICCV 2019)☆94Updated last year
- ☆21Updated 4 years ago
- Official repository of 'Co-Attention for Conditioned Image Matching'☆37Updated 3 years ago
- CP-JKU submission to DCASE 19, performant single-model CNN☆56Updated 4 years ago
- Pytorch implementation of NetVlad for classification on UCF101☆27Updated 4 years ago