Sid2697 / Word-recognition-EmbedNet-CABLinks
Code implementation for our ICPR, 2020 paper titled "Improving Word Recognition using Multiple Hypotheses and Deep Embeddings"
☆21Updated 4 years ago
Alternatives and similar repositories for Word-recognition-EmbedNet-CAB
Users that are interested in Word-recognition-EmbedNet-CAB are comparing it to the libraries listed below
Sorting:
- Code implementation for our DAS, 2020 paper titled "Fused Text Recogniser and Deep Embeddings Improve Word Recognition and Retrieval"☆15Updated last year
- Code to train and evaluate the GeNeVA-GAN model for the GeNeVA task proposed in our ICCV 2019 paper "Tell, Draw, and Repeat: Generating a…☆85Updated 3 years ago
- EgoCom: A Multi-person Multi-modal Egocentric Communications Dataset☆59Updated 5 years ago
- A Keras Implementation of Supervised Video Summarization using Attention Based Encoder-Decoder Networks☆30Updated 3 years ago
- An easy-to-use app to visualise attentions of various VQA models.☆41Updated 3 years ago
- Model submitted for the ICMI 2018 EmotiW Group-Level Emotion Recognition Challenge☆79Updated 7 years ago
- Labeled Movie Trailer Dataset☆16Updated 7 years ago
- Unofficial Implementation of Google Deepmind's paper `Objects that Sound`☆83Updated 7 years ago
- A neural network architecture(CNN+LSTM) that automatically generates captions from the images. The model uses ResNet architecture to trai…☆25Updated 5 years ago
- 12-in-1: Multi-Task Vision and Language Representation Learning Web Demo☆35Updated 3 years ago
- An implementation of the paper "Contextualize, Show and Tell: A Neural Visual Storyteller." presented at the Storytelling Workshop, co-lo…☆34Updated 6 years ago
- 5th Place Solution to 3rd YouTube-8M Video Understanding Challenge (Last Top GB Model)☆13Updated 6 years ago
- Collection of useful FFMPEG commands for processing audio and video files.☆44Updated 6 years ago
- This repository is the main Food Recognition Benchmark template and Starter kit. Clone the repository to compete now!☆69Updated 2 years ago
- State of the Art Language models and Classifier for Hindi language (spoken in Indian sub-continent)☆124Updated 5 years ago
- Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)☆230Updated 2 years ago
- generate captions for images using a CNN-RNN model that is trained on the Microsoft Common Objects in COntext (MS COCO) dataset☆81Updated 7 years ago
- PyTorch Tutorial on google colaboratory.☆77Updated 6 years ago
- Deep Audio-Visual Embedding network (DAVEnet) implementation in PyTorch☆65Updated 7 years ago
- ☆38Updated 4 years ago
- Image Captioning: Implementing the Neural Image Caption Generator with python☆64Updated 8 years ago
- Image captioning using attention models☆39Updated 5 years ago
- A unified framework to jointly model images, text, and human attention traces.☆79Updated 4 years ago
- Easy to use video deep features extractor☆322Updated 5 years ago
- ☆37Updated 8 years ago
- Explores jigsaw puzzles solvinig as pre-text task for fine grained classification for bird species identification (Implemented with pyTor…☆22Updated 5 years ago
- [CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations☆564Updated 4 months ago
- making use of (Language model + Image model) to generate captions on flickr images. CNN + LSTM + Transfer learning☆20Updated 7 years ago
- Code for our paper: *Shamsian, *Kleinfeld, Globerson & Chechik, "Learning Object Permanence from Video"☆68Updated last year
- Datasets, transforms and samplers for video in PyTorch☆88Updated 2 years ago