Sid2697 / Word-recognition-EmbedNet-CABLinks
Code implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"
☆21Updated 4 years ago
Alternatives and similar repositories for Word-recognition-EmbedNet-CAB
Users that are interested in Word-recognition-EmbedNet-CAB are comparing it to the libraries listed below
Sorting:
- Code implementation for our DAS, 2020 paper titled "Fused Text Recogniser and Deep Embeddings Improve Word Recognition and Retrieval"☆15Updated last year
- Code to train and evaluate the GeNeVA-GAN model for the GeNeVA task proposed in our ICCV 2019 paper "Tell, Draw, and Repeat: Generating a…☆85Updated 3 years ago
- 12-in-1: Multi-Task Vision and Language Representation Learning Web Demo☆35Updated 2 years ago
- ☆38Updated 3 years ago
- ☆17Updated 4 years ago
- Unofficial Implementation of Google Deepmind's paper `Objects that Sound`☆83Updated 7 years ago
- Deep Audio-Visual Embedding network (DAVEnet) implementation in PyTorch☆65Updated 7 years ago
- EgoCom: A Multi-person Multi-modal Egocentric Communications Dataset☆58Updated 5 years ago
- A Keras Implementation of Supervised Video Summarization using Attention Based Encoder-Decoder Networks☆29Updated 3 years ago
- Model submitted for the ICMI 2018 EmotiW Group-Level Emotion Recognition Challenge☆79Updated 7 years ago
- Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)☆229Updated 2 years ago
- Labeled Movie Trailer Dataset☆16Updated 7 years ago
- Video Captioning is an encoder decoder mode based on sequence to sequence learning☆138Updated last year
- An easy-to-use app to visualise attentions of various VQA models.☆41Updated 3 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆291Updated 3 years ago
- Official code for the paper "Visual Speech Enhancement Without A Real Visual Stream" published at WACV 2021☆108Updated last year
- Explores jigsaw puzzles solvinig as pre-text task for fine grained classification for bird species identification (Implemented with pyTor…☆22Updated 5 years ago
- Code for our ICCC'19 paper - "Trick or TReAT : Thematic Reinforcement for Artistic Typography"☆19Updated 4 years ago
- Shapley values for assessing the importance of each frame in a video☆17Updated 4 years ago
- Neural Machine Translator for translating from english to hindi text. Used Pytorch framework with seq2seq architecture having Attention f…☆13Updated 6 years ago
- Code repo for the EMOTIC dataset☆127Updated 6 months ago
- An implementation of the paper "Contextualize, Show and Tell: A Neural Visual Storyteller." presented at the Storytelling Workshop, co-lo…☆34Updated 6 years ago
- Google Summer of Code 2018 Project: Multilingual Neural Machine Translation System for TV News☆27Updated last year
- Listen to Look: Action Recognition by Previewing Audio (CVPR 2020)☆129Updated 4 years ago
- A Simple but Powerful CNN Trainer For PyTorch☆26Updated 4 years ago
- Pytorch Code for S2IGAN☆41Updated 5 years ago
- State of the Art Language models and Classifier for Hindi language (spoken in Indian sub-continent)☆124Updated 5 years ago
- [EMNLP 2018] PyTorch code for TVQA: Localized, Compositional Video Question Answering☆181Updated 3 years ago
- PyTorch 3D video classification models pre-trained on 65 million Instagram videos☆265Updated 5 years ago
- Image Captioning with Keras☆64Updated 5 years ago