Sid2697 / Word-recognition-EmbedNet-CAB
Code implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"
☆21Updated 3 years ago
Alternatives and similar repositories for Word-recognition-EmbedNet-CAB:
Users that are interested in Word-recognition-EmbedNet-CAB are comparing it to the libraries listed below
- Code implementation for our DAS, 2020 paper titled "Fused Text Recogniser and Deep Embeddings Improve Word Recognition and Retrieval"☆15Updated 9 months ago
- Labeled Movie Trailer Dataset☆16Updated 7 years ago
- Model submitted for the ICMI 2018 EmotiW Group-Level Emotion Recognition Challenge☆79Updated 6 years ago
- An implementation of the paper "Contextualize, Show and Tell: A Neural Visual Storyteller." presented at the Storytelling Workshop, co-lo…☆33Updated 6 years ago
- 12-in-1: Multi-Task Vision and Language Representation Learning Web Demo☆35Updated 2 years ago
- Code to train and evaluate the GeNeVA-GAN model for the GeNeVA task proposed in our ICCV 2019 paper "Tell, Draw, and Repeat: Generating a…☆85Updated 2 years ago
- [CVPR 2019] Pytorch code for Audio Visual Scene-Aware Dialog☆34Updated 4 years ago
- COMIC: This is the code repo of our TMM2019 work titled "COMIC: Towards a Compact Image Captioning Model with Attention".☆15Updated 3 years ago
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆106Updated 10 months ago
- A unified framework to jointly model images, text, and human attention traces.☆78Updated 3 years ago
- PyTorch implementation of DRAW: A Recurrent Neural Network For Image Generation trained on Devanagari dataset.☆89Updated 4 years ago
- menovideo: pytorch library for video action recognition and video understanding☆29Updated 3 years ago
- AViD Dataset: Anonymized Videos from Diverse Countries☆56Updated 2 years ago
- Pytorch Code for S2IGAN☆41Updated 4 years ago
- Code for our ICCC'19 paper - "Trick or TReAT : Thematic Reinforcement for Artistic Typography"☆19Updated 3 years ago
- Unofficial Implementation of Google Deepmind's paper `Objects that Sound`☆83Updated 6 years ago
- Minimal implementation of Denoised Smoothing (https://arxiv.org/abs/2003.01908) in TensorFlow.☆20Updated 3 years ago
- 5th Place Solution to 3rd YouTube-8M Video Understanding Challenge (Last Top GB Model)☆13Updated 5 years ago
- LSTM/BOF model to encode Videos. Implementation of our BMVC paper "Story Understanding in Video Advertisements".☆14Updated 4 years ago
- EgoCom: A Multi-person Multi-modal Egocentric Communications Dataset☆56Updated 4 years ago
- ☆17Updated 3 years ago
- Real-world photo sequence question answering system (MemexQA). CVPR'18 and TPAMI'19☆32Updated 5 years ago
- Video classification tools using 3D ResNet☆23Updated 7 years ago
- Connective Cognition Network for Directional Visual Commonsense Reasoning☆15Updated 3 years ago
- Here we describe a new approach to train a video captioning neural network , that is not only based on the normal cross entropy loss for …☆7Updated 5 years ago
- ☆23Updated 3 years ago
- ☆37Updated 7 years ago
- ADVISE model for the Automatic Understanding of Visual Advertisements challenge. Please refer to our ECCV paper "ADVISE: Symbolism and Ex…☆26Updated 6 years ago
- Using an LSTM and 4d convolutional network for lip reading☆12Updated 6 years ago
- Identifying Visible Actions in Lifestyle Vlogs☆15Updated last year