jhuang81/weak-sup-visual-grounding

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jhuang81/weak-sup-visual-grounding)

jhuang81 / weak-sup-visual-grounding

The official implementation of CVPR 2021 Paper: Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation.

☆12

Alternatives and similar repositories for weak-sup-visual-grounding

Users that are interested in weak-sup-visual-grounding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LaVi-Lab / Rethink_CoT_Video
View on GitHub
Official code for "Rethinking Chain-of-Thought Reasoning for Videos"
☆21Dec 14, 2025Updated 7 months ago
PKU-VaLuE-Lab / m3eval
View on GitHub
Official code for M3Eval: Multi-Modal Memory Evaluation through Cognitively-Grounded Video Tasks
☆21Jun 4, 2026Updated last month
youngfly11 / ReIR-WeaklyGrounding.pytorch
View on GitHub
The official PyTorch code for "Relation-aware Instance Refinement for Weakly Supervised Visual Grounding" accepted by CVPR2021
☆28Oct 9, 2021Updated 4 years ago
dragonlzm / PAVE
View on GitHub
This repo holds the implementation of PAVE: Patching and Adapting Video Large Language Models (CVPR2025)
☆27Sep 6, 2025Updated 10 months ago
qinzzz / Multimodal-Alignment-Framework
View on GitHub
Implementation for MAF: Multimodal Alignment Framework
☆45Nov 25, 2020Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
dabeschte / DeltaCNN
View on GitHub
☆13Jun 26, 2022Updated 4 years ago
PeihaoChen / WS-MGMap
View on GitHub
Official Pytorch implementation for NeurIPS 2022 paper "Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigati…
☆35Apr 23, 2023Updated 3 years ago
facebookresearch / ProcedureVRL
View on GitHub
[CVPR 2023] Official code for "Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations"
☆56Aug 8, 2023Updated 2 years ago
STaoZWT / JiZhangApp
View on GitHub
An Offline Account Book App made by HITSZ (Harbin Institute of Technology, Shenzhen) Software Engineering (Fall 2020) Group 19
☆19Nov 4, 2020Updated 5 years ago
baoqianyue / DFC2021-Track-MSD
View on GitHub
Third place of 2021 IEEE GRSS Data Fusion Contest: Track MSD
☆10Mar 31, 2021Updated 5 years ago
YiwuZhong / SGG_from_NLS
View on GitHub
[ICCV 2021] Official code for "Learning to Generate Scene Graph from Natural Language Supervision"
☆100Apr 4, 2023Updated 3 years ago
abrarmajeedi / rica2_aqa
View on GitHub
Code release for RICA^2: Rubric-Informed, Calibrated Assessment of Actions (ECCV 2024)
☆15Nov 9, 2025Updated 8 months ago
StarsThu2016 / ApproxDet
View on GitHub
☆12Nov 16, 2020Updated 5 years ago
zjuchenlong / faster-rcnn.pytorch
View on GitHub
fork from https://github.com/jwyang/faster-rcnn.pytorch
☆10Aug 6, 2018Updated 7 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
zhuoyan-xu / Foundation-Model_Multitask
View on GitHub
☆17Mar 14, 2024Updated 2 years ago
SijieSong / CVPR21-Cogrounding_semantic_attention
View on GitHub
☆14Jul 13, 2021Updated 5 years ago
svip-lab / LBYLNet
View on GitHub
[CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.
☆50Aug 31, 2021Updated 4 years ago
claws-lab / multimodal-robustness
View on GitHub
Code and resources for EMNLP 2022 paper on 'Robustness of Fusion-based Multimodal Classifiers to Cross-Modal Content Dilutions'
☆10Mar 11, 2024Updated 2 years ago
uwgraphics / LCSPCData
View on GitHub
Dataset of measurements from a low-cost single-photon camera used in our CVPR 2024 paper "Towards 3D Vision with Low-Cost Single-Photon C…
☆15Nov 24, 2025Updated 8 months ago
nithintata / image-caption-generator-using-deep-learning
View on GitHub
Automatically generates captions for an image using Image processing and NLP. Model was trained on Flickr30K dataset.
☆11Jun 11, 2020Updated 6 years ago
zhangzhao156 / Human-Activity-Recognition-Codes-Datasets
View on GitHub
The comparsion methods code
☆12Mar 7, 2022Updated 4 years ago
llm-jp / llm-jp-model-playground
View on GitHub
Interactive application to verify multiple LLMs
☆14Feb 20, 2024Updated 2 years ago
jshi31 / NAFAE
View on GitHub
Implementation of paper "Not All Frames Are Equal: Weakly-Supervised Video Grounding with Contextual Similarity and Visual Clustering Los…
☆30Jun 29, 2020Updated 6 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
LaVi-Lab / AIM
View on GitHub
[ICCV 2025] Official code for "AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning"
☆65Oct 9, 2025Updated 9 months ago
zhujiagang / st-gcn-data-len
View on GitHub
☆13Mar 22, 2018Updated 8 years ago
zhoujuncc1 / shenjingcat
View on GitHub
☆14Mar 8, 2023Updated 3 years ago
RuipingL / OpenSU
View on GitHub
IEEE/CVF International Conference on Computer Vision Workshop (2023)
☆17Feb 7, 2024Updated 2 years ago
BinWang28 / Sentence-Embedding-S3E
View on GitHub
Efficient Sentence Embedding via Semantic Subspace Analysis
☆14Feb 25, 2020Updated 6 years ago
fmu2 / nlos3d
View on GitHub
☆18Dec 23, 2022Updated 3 years ago
ku-nlp / kyoto-reader
View on GitHub
A processor for KyotoCorpus, KWDLC, and AnnotatedFKCCorpus
☆10Jun 26, 2024Updated 2 years ago
AlenUbuntu / Awesome-Vision-and-Language-PreTrain-Papers
View on GitHub
☆14Dec 25, 2020Updated 5 years ago
zeeshank95 / GVSR
View on GitHub
☆14Dec 9, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ImperialNLP / BertGen
View on GitHub
Training and evaluation codes for the BertGen paper (ACL-IJCNLP 2021)
☆11Sep 17, 2023Updated 2 years ago
CYVincent / Scene-Graph-Transformer-CogTree
View on GitHub
☆15Jun 11, 2021Updated 5 years ago
jdtoscano94 / Hybrid-RL-GAN-Point_Cloud_Completion
View on GitHub
Teeth Mold Point Cloud Completion Via Data Augmentation and Hybrid RL-GAN (Paper Code)
☆13May 23, 2023Updated 3 years ago
ht014 / SG2HOI
View on GitHub
☆12Sep 19, 2021Updated 4 years ago
baoqianyue / ImageProcessSamples
View on GitHub
Some examples of image processing based on Opencv
☆17Feb 22, 2019Updated 7 years ago
SkrighYZ / FGVE
View on GitHub
Code accompanying paper "Fine-Grained Visual Entailment" [ECCV 2022].
☆11Oct 31, 2022Updated 3 years ago
Graph-COM / HEPT
View on GitHub
[ICML 2024 Oral] LSH-Based Efficient Point Transformer (HEPT)
☆27Jan 24, 2025Updated last year