zjh31/CPL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zjh31/CPL)

zjh31 / CPL

☆21

Alternatives and similar repositories for CPL

Users that are interested in CPL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kingthreestones / RefCLIP
View on GitHub
☆39Jun 28, 2023Updated 3 years ago
baopj / E3M
View on GitHub
[ECCV 2024] The first zero-shot setting for spatio-temporal video grounding.
☆11Jul 16, 2024Updated 2 years ago
ZzZZCHS / WS-3DVG
View on GitHub
[ICCV 2023] Distilling Coarse-to-fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding
☆14Oct 2, 2024Updated last year
qinzzz / Multimodal-Alignment-Framework
View on GitHub
Implementation for MAF: Multimodal Alignment Framework
☆45Nov 25, 2020Updated 5 years ago
liuting20 / DARA
View on GitHub
[ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding
☆22Feb 26, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
LeapLabTHU / Pseudo-Q
View on GitHub
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
☆153Jul 13, 2024Updated 2 years ago
xulingjing88 / WSMA
View on GitHub
[AAAI 2024]Weakly Supervised Multimodal Affordance Grounding for Egocentric Images
☆13Nov 10, 2024Updated last year
insomnia94 / DTWREG
View on GitHub
Preliminary code for reviewers
☆12Mar 30, 2021Updated 5 years ago
HengLan / CGSTVG
View on GitHub
[CVPR 2024] Context-Guided Spatio-Temporal Video Grounding
☆66Jun 28, 2024Updated 2 years ago
baopj / Vid-Morp
View on GitHub
☆12Dec 6, 2024Updated last year
LouChao98 / VLGAE
View on GitHub
Official Implementation for CVPR 2022 paper "Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene Graphs with Language …
☆24Oct 19, 2022Updated 3 years ago
ErikStammes / EADER
View on GitHub
End-to-End Adversarial Erasing for Weakly Supervised Semantic Segmentation
☆15Nov 15, 2020Updated 5 years ago
mengcaopku / DCNet
View on GitHub
[ACM MM 22] Correspondence Matters for Video Referring Expression Comprehension
☆15Sep 4, 2022Updated 3 years ago
dieuroi / SimAesthetics
View on GitHub
☆10May 18, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
linhuixiao / CLIP-VG
View on GitHub
[TMM 2023] Self-paced Curriculum Adapting of CLIP for Visual Grounding.
☆135Nov 10, 2025Updated 8 months ago
W-Wu / DEER
View on GitHub
☆12Aug 25, 2023Updated 2 years ago
luo3300612 / LaRE
View on GitHub
Official code for LaRE2: Latent Reconstruction Error Based Method for Diffusion-Generated Image Detection. (CVPR 2024)
☆54Dec 3, 2024Updated last year
AI9Stars / MM-UAVBench
View on GitHub
Code for "MM-UAVBench: How Well Do Multimodal Large Language Models See, Think, and Plan in Low-Altitude UAV Scenarios?"
☆20Jan 18, 2026Updated 6 months ago
li-jl16 / LORS
View on GitHub
CVPR2024 highlight.
☆13Oct 10, 2024Updated last year
iQua / M-DGT
View on GitHub
The source code of the CVPR22 paper titled "Multi-Modal Dynamic Graph Transformer for Visual Grounding".
☆22Mar 26, 2022Updated 4 years ago
liunian-harold-li / DesCo
View on GitHub
☆30Mar 13, 2024Updated 2 years ago
eslambakr / LAR-Look-Around-and-Refer
View on GitHub
This is the official implementation for our paper;"LAR:Look Around and Refer".
☆30Dec 1, 2022Updated 3 years ago
ycWang9725 / WSTAN
View on GitHub
☆16Dec 21, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
rmcong / TNet_TMM2022
View on GitHub
☆15Jan 17, 2023Updated 3 years ago
CASIA-IVA-Lab / SC-Tune
View on GitHub
Official code for CVPR 2024 paper, "SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models"
☆16Apr 22, 2024Updated 2 years ago
rmcong / GLNet_TCYB2022
View on GitHub
The results and code of our IEEE TCYB 2022 paper, titled "Global-and-Local Collaborative Learning for Co-Salient Object Detection"
☆13May 2, 2022Updated 4 years ago
THU-MIG / Consolidator
View on GitHub
Official implementation for ICLR 2023 paper Consolidator: Mergeable Adapter with Grouped Connections for Visual Adaptation
☆16Jan 23, 2024Updated 2 years ago
yuanzhoulvpi2017 / yuanzhoulvpi2017
View on GitHub
personal info
☆11Mar 23, 2024Updated 2 years ago
leolyj / 3D-VLP
View on GitHub
This is the code related to "Context-aware Alignment and Mutual Masking for 3D-Language Pre-training" (CVPR 2023).
☆29Jun 15, 2023Updated 3 years ago
liupeng0606 / clip4caption
View on GitHub
The first unofficial implementation of CLIP4Caption: CLIP for Video Caption (ACMMM 2021)
☆16Jan 2, 2023Updated 3 years ago
PPjmchen / HAM
View on GitHub
☆17Jul 8, 2023Updated 3 years ago
toheart / cocursor
View on GitHub
☆18Feb 9, 2026Updated 5 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
luogen1996 / SimREC
View on GitHub
A lightweight codebase for referring expression comprehension and segmentation
☆57May 21, 2022Updated 4 years ago
djiajunustc / TransVG
View on GitHub
☆198Feb 27, 2024Updated 2 years ago
longmalongma / TW-GRPO
View on GitHub
The official repository of our paper "Reinforcing Video Reasoning with Focused Thinking"
☆36Jun 12, 2025Updated last year
youngfly11 / ReIR-WeaklyGrounding.pytorch
View on GitHub
The official PyTorch code for "Relation-aware Instance Refinement for Weakly Supervised Visual Grounding" accepted by CVPR2021
☆28Oct 9, 2021Updated 4 years ago
dfki-av / MiKASA-3DVG
View on GitHub
[CVPR'24] MiKASA: Multi-Key-Anchor & Scene-Aware Transformer for 3D Visual Grounding
☆18Dec 13, 2024Updated last year
LukeForeverYoung / QRNet
View on GitHub
☆41Jun 3, 2022Updated 4 years ago
zlccccc / 3DVG-Transformer
View on GitHub
[ICCV2021] 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds
☆43Jul 6, 2022Updated 4 years ago