keetsky / Net_ghostVLAD-pytorchLinks
☆21Updated 6 years ago
Alternatives and similar repositories for Net_ghostVLAD-pytorch
Users that are interested in Net_ghostVLAD-pytorch are comparing it to the libraries listed below
Sorting:
- SE-Resnet+AMSoftmax for Speaker Verification☆47Updated 7 years ago
 - Efficient python implementation of VLAD image descriptors☆18Updated 7 years ago
 - The proposed method in LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild☆26Updated 6 years ago
 - (2020) Video Classification Neural Network☆30Updated 5 years ago
 - convenience utilities for model validation☆23Updated 6 years ago
 - Co-Separating Sounds of Visual Objects (ICCV 2019)☆97Updated 2 years ago
 - TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆26Updated 3 years ago
 - Collection of works from VIPL-AVSU☆49Updated 3 months ago
 - PyTorch implementation of NetVLAD & Online Hardest Triplet Loss.☆463Updated 7 years ago
 - Paddle Implementation of DOLG (ICCV 2021)☆45Updated 2 years ago
 - Acoustic Scene Classification Using Deep Residual Networks with Late Fusion of Separated High and Low Frequency Paths - McDonnell and Gao…☆22Updated last year
 - Code and instruction on replicating the experiments done in paper: Unified Hypersphere Embedding for Speaker Recognition☆32Updated 6 years ago
 - Video classification, youtube8m, Knowledge distillation, Tensorflow, NeXtVLAD☆27Updated 6 years ago
 - DenseNet3D Model In "LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild", https://arxiv.org/abs/1810.069…☆118Updated 4 years ago
 - The stastics information of top conference realted to information area including AI, ML, CV, etc☆26Updated 5 years ago
 - Script for converting the pretrained VGGish model provided with AudioSet from TensorFlow to PyTorch, along with a basic smoke test.☆87Updated 6 years ago
 - A simplified version for DMC (Deep Multimodal Clustering for Unsupervised Audiovisual Learning)☆19Updated 5 years ago
 - Implement SphereFace in Pytorch☆36Updated 6 years ago
 - Torch code for using Residual Networks with LSTMs for Lipreading☆99Updated 7 years ago
 - Learned Contextual Feature Reweighting for Image Geo-Localization (CVPR 2017)☆33Updated 7 years ago
 - mnist classify using center loss☆13Updated 7 years ago
 - Video Retrieval, 3D ResNet, Triplet Loss, UCF101, Pytorch☆23Updated 6 years ago
 - Codebase for the paper "Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation" (ECCV2020)☆72Updated 5 years ago
 - Simple Tensorflow implementation of "GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond"☆39Updated 6 years ago
 - Official codes of the paper: Deep Center-Based Dual-Constrained Hashing for Discriminative Face Image Retrieval (DCDH)☆29Updated last year
 - pytorch lmdb dataset with protobuf☆52Updated 6 years ago
 - "LipNet: End-to-End Sentence-level Lipreading" in PyTorch☆69Updated 6 years ago
 - Code for Vision-Infused Deep Audio Inpainting (ICCV 2019)☆57Updated 6 years ago
 - Code for the Active Speakers in Context Paper (CVPR2020)☆55Updated 4 years ago
 - Code for "Speaker Clustering using Dominant Sets", ICPR 2018☆11Updated 4 years ago