Sid2697 / Word-recognition-EmbedNet-CABLinks
Code implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"
☆21Updated 4 years ago
Alternatives and similar repositories for Word-recognition-EmbedNet-CAB
Users that are interested in Word-recognition-EmbedNet-CAB are comparing it to the libraries listed below
Sorting:
- Code implementation for our DAS, 2020 paper titled "Fused Text Recogniser and Deep Embeddings Improve Word Recognition and Retrieval"☆15Updated 11 months ago
- 12-in-1: Multi-Task Vision and Language Representation Learning Web Demo☆35Updated 2 years ago
- Labeled Movie Trailer Dataset☆16Updated 7 years ago
- An implementation of the paper "Contextualize, Show and Tell: A Neural Visual Storyteller." presented at the Storytelling Workshop, co-lo…☆34Updated 6 years ago
- ☆17Updated 3 years ago
- ☆37Updated 3 years ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆59Updated 4 years ago
- Tooling to play around with multilingual machine translation for Indian Languages.☆22Updated 3 years ago
- ☆44Updated 3 years ago
- A unified framework to jointly model images, text, and human attention traces.☆78Updated 4 years ago
- Code to train and evaluate the GeNeVA-GAN model for the GeNeVA task proposed in our ICCV 2019 paper "Tell, Draw, and Repeat: Generating a…☆85Updated 2 years ago
- Identifying Visible Actions in Lifestyle Vlogs☆15Updated last year
- A non-JIT version implementation / replication of CLIP of OpenAI in pytorch☆34Updated 4 years ago
- a repository containing the details of natural language inference dataset in Hindi☆11Updated 4 years ago
- Connective Cognition Network for Directional Visual Commonsense Reasoning☆15Updated 4 years ago
- code for running trained model from Visual Reasoning by Progressive Module Networks (ICLR19)☆15Updated 6 years ago
- Website for TextVQA dataset.☆28Updated 2 years ago
- Minimal implementation of Denoised Smoothing (https://arxiv.org/abs/2003.01908) in TensorFlow.☆20Updated 3 years ago
- PyTorch implementation of HUSE: Hierarchical Universal Semantic Embeddings☆14Updated 5 years ago
- Shapley values for assessing the importance of each frame in a video☆17Updated 4 years ago
- EgoCom: A Multi-person Multi-modal Egocentric Communications Dataset☆57Updated 4 years ago
- Image Captioning: Implementing the Neural Image Caption Generator with python☆64Updated 7 years ago
- A one-stop shop for YouCook2 info such as leaderboard and recent advances on (cooking) video retrieval and captioning.☆40Updated 2 years ago
- Model submitted for the ICMI 2018 EmotiW Group-Level Emotion Recognition Challenge☆79Updated 6 years ago
- In-the-wild Question Answering☆15Updated 2 years ago
- Code for "Weakly-supervised Fingerspelling Recognition in British Sign Language Videos", BMVC 2022.☆12Updated 2 years ago
- Google Summer of Code 2018 Project: Multilingual Neural Machine Translation System for TV News☆27Updated last year
- Video classification tools using 3D ResNet☆23Updated 7 years ago
- ☆23Updated 3 years ago
- COMIC: This is the code repo of our TMM2019 work titled "COMIC: Towards a Compact Image Captioning Model with Attention".☆15Updated 4 years ago