jacobswan1 / MTG-pytorch
Gender/Age attribute grounding using weak supervised manner.
☆12Updated 5 years ago
Alternatives and similar repositories for MTG-pytorch:
Users that are interested in MTG-pytorch are comparing it to the libraries listed below
- This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)☆36Updated 3 years ago
- The HC-STVG Dataset☆56Updated 2 years ago
- Knowledge-guided Pairwise Reconstruction Network for Weakly Supervised Referring Expression Grounding☆4Updated 4 years ago
- Placeholder for code of BSP.☆11Updated 3 years ago
- The Pytorch implementation for "Video-Text Pre-training with Learned Regions"☆42Updated 2 years ago
- Video Visual Relation Detection (VidVRD) tracklets generation. also for ACM MM Visual Relation Understanding Grand Challenge☆39Updated 2 years ago
- The official PyTorch code for "Relation-aware Instance Refinement for Weakly Supervised Visual Grounding" accepted by CVPR2021☆27Updated 3 years ago
- Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)☆33Updated 3 years ago
- Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding☆34Updated 5 years ago
- Weakly Supervised Video Moment Retrieval from Text Queries☆43Updated 4 years ago
- ☆29Updated last year
- [BMVC 2021]: Official PyTorch implementation of : "Few Shot Temporal Action Localization using Query Adaptive Transformers"☆21Updated 2 years ago
- [ECCV'22 Poster] Explicit Image Caption Editing☆22Updated 2 years ago
- This is an implementation of "Grounding of Textual Phrases in Images by Reconstruction" in PyTorch☆17Updated 5 years ago
- Code for CVPR 19 Paper "Improving Referring Expression Grounding with Cross-modal Attention-guided Erasing"☆33Updated 5 years ago
- ☆16Updated 4 years ago
- PIC Challenge Baseline☆19Updated 6 years ago
- Code for ECCV 2020 paper "Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language"☆17Updated 4 years ago
- Pytorch implementation of our paper Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs, which i…☆46Updated last year
- CVPR2021: Detecting Human-Object Interaction via Fabricated Compositional Learning☆15Updated 3 years ago
- Code for ICCV2021: Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detection☆24Updated 3 years ago
- [ICCV2021] Generic Event Boundary Detection: A Benchmark for Event Segmentation☆68Updated 3 years ago
- ☆22Updated 3 years ago
- 🚴♂️ ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection (MM 2020)☆32Updated last year
- AAAI2020-The official implementation of "Learning Cross-modal Context Graph for Visual Grounding"☆58Updated 3 years ago
- ☆16Updated 3 months ago
- Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)☆20Updated 3 years ago
- ☆19Updated 2 years ago
- ☆26Updated 3 years ago
- Learning phrase grounding from captioned images through InfoNCE bound on mutual information☆73Updated 4 years ago