jhuang81 / weak-sup-visual-groundingView external linksLinks
The official implementation of CVPR 2021 Paper: Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation.
☆12Oct 15, 2021Updated 4 years ago
Alternatives and similar repositories for weak-sup-visual-grounding
Users that are interested in weak-sup-visual-grounding are comparing it to the libraries listed below
Sorting:
- Official code for "Rethinking Chain-of-Thought Reasoning for Videos"☆20Dec 14, 2025Updated 2 months ago
- The official PyTorch code for "Relation-aware Instance Refinement for Weakly Supervised Visual Grounding" accepted by CVPR2021☆27Oct 9, 2021Updated 4 years ago
- The implementation of "A Simple Baseline for Weakly-Supervised Scene Graph Generation" for ICCV2021☆15Aug 17, 2021Updated 4 years ago
- This repo holds the implementation of PAVE: Patching and Adapting Video Large Language Models (CVPR2025)☆26Sep 6, 2025Updated 5 months ago
- Implementation for MAF: Multimodal Alignment Framework☆46Nov 25, 2020Updated 5 years ago
- Official Pytorch implementation for NeurIPS 2022 paper "Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigati…☆33Apr 23, 2023Updated 2 years ago
- Implementation of paper "Not All Frames Are Equal: Weakly-Supervised Video Grounding with Contextual Similarity and Visual Clustering Los…☆30Jun 29, 2020Updated 5 years ago
- Training and evaluation codes for the BertGen paper (ACL-IJCNLP 2021)☆11Sep 17, 2023Updated 2 years ago
- Automatically generates captions for an image using Image processing and NLP. Model was trained on Flickr30K dataset.☆11Jun 11, 2020Updated 5 years ago
- ☆10Jun 21, 2024Updated last year
- Course review and timetable planning platform used by thousands of CUHK students☆13Aug 19, 2024Updated last year
- A processor for KyotoCorpus, KWDLC, and AnnotatedFKCCorpus☆10Jun 26, 2024Updated last year
- ☆12Sep 19, 2021Updated 4 years ago
- Pytorch code for NODIS: Neural Ordinary Differential Scene Understanding, ECCV2020☆11Aug 28, 2020Updated 5 years ago
- Third place of 2021 IEEE GRSS Data Fusion Contest: Track MSD☆10Mar 31, 2021Updated 4 years ago
- Latex template for CUHK PhD Thesis☆11Jun 29, 2025Updated 7 months ago
- Dataset of measurements from a low-cost single-photon camera used in our CVPR 2024 paper "Towards 3D Vision with Low-Cost Single-Photon C…☆11Nov 24, 2025Updated 2 months ago
- ☆11Mar 8, 2023Updated 2 years ago
- [TNNLS 2022] Official pytorch implementation of "Tackling the Challenges in Scene Graph Generation with Local-to-Global Interactions"☆11Apr 19, 2022Updated 3 years ago
- Code and resources for EMNLP 2022 paper on 'Robustness of Fusion-based Multimodal Classifiers to Cross-Modal Content Dilutions'☆10Mar 11, 2024Updated last year
- Code for paper titled, "Learning to Predict Task Progress by Self-Supervised Video Alignment" by Gerard Donahue and Ehsan Elhamifar, publ…☆16Jul 26, 2024Updated last year
- [ICCV 2021] Official code for "Learning to Generate Scene Graph from Natural Language Supervision"☆101Apr 4, 2023Updated 2 years ago
- Interactive application to verify multiple LLMs☆13Feb 20, 2024Updated last year
- ☆17Mar 14, 2024Updated last year
- ☆14Dec 25, 2020Updated 5 years ago
- [CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.☆49Aug 31, 2021Updated 4 years ago
- ☆10Jun 1, 2019Updated 6 years ago
- Code release for RICA^2: Rubric-Informed, Calibrated Assessment of Actions (ECCV 2024)☆13Nov 9, 2025Updated 3 months ago
- [ICCV 2025] Official code for "AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning"☆54Oct 9, 2025Updated 4 months ago
- Rethinking the Trust Region in LLM Reinforcement Learning☆34Feb 5, 2026Updated last week
- ☆14Jul 13, 2021Updated 4 years ago
- [ICCV2023] Spatio-temporal Prompting Network for Robust Video Feature Extraction☆10Aug 17, 2023Updated 2 years ago
- fork from https://github.com/jwyang/faster-rcnn.pytorch☆10Aug 6, 2018Updated 7 years ago
- [CVPR 2023] Official code for "Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations"☆56Aug 8, 2023Updated 2 years ago
- Code release for DeepEDM (ICML 2025)☆27Jan 20, 2026Updated 3 weeks ago
- ☆13Mar 22, 2018Updated 7 years ago
- Teeth Mold Point Cloud Completion Via Data Augmentation and Hybrid RL-GAN (Paper Code)☆13May 23, 2023Updated 2 years ago
- Efficient Sentence Embedding via Semantic Subspace Analysis☆14Feb 25, 2020Updated 5 years ago
- pre-trained vision and language model summary☆12Apr 20, 2021Updated 4 years ago