gombru / LearnFromWebData

Code used in the paper "Learning to Learn from Web Data through Deep Semantic Embeddings" ECCV 2018 MULA Workshop

☆11

Related projects ⓘ

Alternatives and complementary repositories for LearnFromWebData

facebookresearch / GDT
We present a framework for training multi-modal deep learning models on unlabelled video data by forcing the network to learn invariances…
☆45Updated 3 years ago
airsplay / vimpac
☆74Updated 2 years ago
princetonvisualai / SPICE-U
☆11Updated 4 years ago
papermsucode / mdmmt
MDMMT: Multidomain Multimodal Transformer for Video Retrieval
☆26Updated 3 years ago
sangho-vision / avbert
☆31Updated 3 years ago
kittenish / Frame-Transformer-Network
Released code and data for "Frame-Transformer Emotion Classification Network." ICMR 2017
☆17Updated 7 years ago
dddzg / unimoco
UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning
☆53Updated 3 years ago
sairin1202 / Commonsense-Knowledge-Aware-Concept-Selection-For-Diverse-and-Informative-Visual-Storytelling
The implement of Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling
☆11Updated 3 years ago
littleredxh / HardNegative
☆51Updated 3 years ago
showlab / DemoVLP
[Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training
☆21Updated 2 years ago
IBM / AdaMML
Official implementation of AdaMML. https://arxiv.org/abs/2105.05165.
☆50Updated 2 years ago
google-research / trecs_image_generation
☆24Updated 3 years ago
guilk / VLC
Research code for "Training Vision-Language Transformers from Captions Alone"
☆33Updated 2 years ago
jayleicn / mTVRetrieval
[ACL 2021] mTVR: Multilingual Video Moment Retrieval
☆26Updated 2 years ago
yldcs / Unsupervised_Text-to-Image_Synthesis
Implementation of our PR 2020 paper:Unsupervised Text-to-Image Synthesis
☆13Updated 4 years ago
yj-yu / lsmdc
☆31Updated 6 years ago
ycxioooong / MovieSynopsisAssociation
Code for "A Graph-Based Framework to Bridge Movies and Synopses", ICCV2019
☆51Updated 4 years ago
Deferf / CLIP_Video_Representation
Use CLIP to represent video for Retrieval Task
☆69Updated 3 years ago
nishantrai18 / cocon
CoCon: Cooperative Contrastive Learning
☆20Updated 2 years ago
amazon-science / gluonmm
A library of transformer models for computer vision and multi-modality research
☆49Updated 3 years ago
TengdaHan / ActionClassification
Video action classification benchmark for common CNN architectures, implemented in PyTorch
☆11Updated 2 years ago
MLforHealth / S2SD
(ICML 2021) Implementation for S2SD - Simultaneous Similarity-based Self-Distillation for Deep Metric Learning. Paper Link: https://arxiv…
☆41Updated 4 years ago
KaihuaTang / VCTree-Visual-Question-Answering
Code for the Visual Question Answering (VQA) part of CVPR 2019 oral paper: "Learning to Compose Dynamic Tree Structures for Visual Contex…
☆35Updated 5 years ago
Dyfine / SphericalEmbedding
official pytorch implementation of "Deep Metric Learning with Spherical Embedding", NeurIPS 2020
☆41Updated 3 years ago
lucidrains / omninet-pytorch
Implementation of OmniNet, Omnidirectional Representations from Transformers, in Pytorch
☆56Updated 3 years ago
antoine77340 / RareAct
RareAct: A video dataset of unusual interactions
☆32Updated 4 years ago
yonatanbitton / data_efficient_masked_language_modeling_for_vision_and_language
Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".
☆17Updated 3 years ago
sjenni / temporal-ssl
Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.
☆48Updated 3 years ago
StanLei52 / TQVSR
[Findings of EMNLP 2022] AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant
☆23Updated last year
searobbersduck / MoCo_v3_pytorch
a pytorch implementation for MoCo V3
☆32Updated 3 years ago