gombru / LearnFromWebData
Code used in the paper "Learning to Learn from Web Data through Deep Semantic Embeddings" ECCV 2018 MULA Workshop
☆11Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for LearnFromWebData
- We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances…☆45Updated 3 years ago
- ☆74Updated 2 years ago
- ☆11Updated 4 years ago
- MDMMT: Multidomain Multimodal Transformer for Video Retrieval☆26Updated 3 years ago
- ☆31Updated 3 years ago
- Released code and data for "Frame-Transformer Emotion Classification Network." ICMR 2017☆17Updated 7 years ago
- UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning☆53Updated 3 years ago
- sairin1202 / Commonsense-Knowledge-Aware-Concept-Selection-For-Diverse-and-Informative-Visual-StorytellingThe implement of Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling☆11Updated 3 years ago
- ☆51Updated 3 years ago
- [Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training☆21Updated 2 years ago
- Official implementation of AdaMML. https://arxiv.org/abs/2105.05165.☆50Updated 2 years ago
- ☆24Updated 3 years ago
- Research code for "Training Vision-Language Transformers from Captions Alone"☆33Updated 2 years ago
- [ACL 2021] mTVR: Multilingual Video Moment Retrieval☆26Updated 2 years ago
- Implementation of our PR 2020 paper:Unsupervised Text-to-Image Synthesis☆13Updated 4 years ago
- ☆31Updated 6 years ago
- Code for "A Graph-Based Framework to Bridge Movies and Synopses", ICCV2019☆51Updated 4 years ago
- Use CLIP to represent video for Retrieval Task☆69Updated 3 years ago
- CoCon: Cooperative Contrastive Learning☆20Updated 2 years ago
- A library of transformer models for computer vision and multi-modality research☆49Updated 3 years ago
- Video action classification benchmark for common CNN architectures, implemented in PyTorch☆11Updated 2 years ago
- (ICML 2021) Implementation for S2SD - Simultaneous Similarity-based Self-Distillation for Deep Metric Learning. Paper Link: https://arxiv…☆41Updated 4 years ago
- Code for the Visual Question Answering (VQA) part of CVPR 2019 oral paper: "Learning to Compose Dynamic Tree Structures for Visual Contex…☆35Updated 5 years ago
- official pytorch implementation of "Deep Metric Learning with Spherical Embedding", NeurIPS 2020☆41Updated 3 years ago
- Implementation of OmniNet, Omnidirectional Representations from Transformers, in Pytorch☆56Updated 3 years ago
- RareAct: A video dataset of unusual interactions☆32Updated 4 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆17Updated 3 years ago
- Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.☆48Updated 3 years ago
- [Findings of EMNLP 2022] AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant☆23Updated last year
- a pytorch implementation for MoCo V3☆32Updated 3 years ago