WayneTomas/TransCP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/WayneTomas/TransCP)

WayneTomas / TransCP

[TPAMI 2024] This is the official Pytorch code for our paper "Context Disentangling and Prototype Inheriting for Robust Visual Grounding".

☆28

Alternatives and similar repositories for TransCP

Users that are interested in TransCP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jcwang0602 / PLVL
View on GitHub
Progressive Language-guided Visual Learning for Multi-Task Visual Grounding
☆13May 9, 2025Updated last year
WayneTomas / Balance-Constraint-KMeans
View on GitHub
[Symmetry 2019] This is the Matlab code for our paper "Optimizing MSE for Clustering with Balanced Size Constraints".
☆20Mar 25, 2025Updated last year
Mr-Bigworth / MMCA
View on GitHub
Visual Grounding with Multi-modal Conditional Adaptation (ACMMM 2024 Oral)
☆26Jun 11, 2025Updated last year
kevendai / fandp-ijcai2025-issues
View on GitHub
☆17Oct 13, 2025Updated 9 months ago
May2333 / FDCA
View on GitHub
[ICLR 2025] This repo is the official implementation of our paper "Learning Fine-Grained Representations through Textual Token Disentangl…
☆23Jul 28, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
WayneTomas / VPP-LLaVA
View on GitHub
[TMM 2025] This is the official Pytorch code for our paper "Visual Position Prompt for MLLM based Visual Grounding".
☆31Jul 23, 2025Updated 11 months ago
Dmmm1997 / SimVG
View on GitHub
[NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion
☆103Oct 29, 2025Updated 8 months ago
uvavision / AMC-grounding
View on GitHub
[CVPR 2023] Code for "Improving Visual Grounding by Encouraging Consistent Gradient-based Explanations"
☆19Oct 10, 2023Updated 2 years ago
GalaxyCong / HPMDubbing_Vocoder
View on GitHub
16k Hz Vocoder (HiFiGAN Codes and Pretrained Models)
☆18Apr 3, 2023Updated 3 years ago
chenwei746 / EEVG
View on GitHub
☆23Aug 20, 2024Updated last year
ictnlp / LSG
View on GitHub
The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”
☆15Jan 3, 2025Updated last year
fawazsammani / awesome-self-supervised-vision
View on GitHub
Awesome Self-Supervised Vision Learning
☆11Mar 27, 2024Updated 2 years ago
tuyunbin / Video-Description-with-Spatial-Temporal-Attention
View on GitHub
[ACM MM 2017 & IEEE TMM 2020] This is the Theano code for the paper "Video Description with Spatial Temporal Attention"
☆61Oct 20, 2020Updated 5 years ago
yxchng / mask-grounding
View on GitHub
[CVPR2024] Mask Grounding for Referring Image Segmentation
☆29Jul 22, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
NeverMoreLCH / Awesome-Video-Grounding
View on GitHub
A reading list of papers about Visual Grounding.
☆31Aug 24, 2022Updated 3 years ago
tuyunbin / SCORER
View on GitHub
[ICCV 2023] This is the Pytorch code for our paper "Self-Supervised Cross-View Representation Reconstruction for Change Captioning".
☆20Sep 25, 2025Updated 9 months ago
GalaxyCong / StyleDubber
View on GitHub
[ACL 2024] This is the Pytorch code for our paper "StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing"
☆98Nov 14, 2024Updated last year
baoqianyue / DFC2021-Track-MSD
View on GitHub
Third place of 2021 IEEE GRSS Data Fusion Contest: Track MSD
☆10Mar 31, 2021Updated 5 years ago
lijun2005 / ICML26-Holmes
View on GitHub
[ICML 2026] Revisiting Uncertainty: On Evidential Learning for Partially Relevant Video Retrieval
☆24Jul 10, 2026Updated last week
AntXinyuan / sph2pob
View on GitHub
(IJCAI 2023) Sph2Pob: Boosting Object Detection on Spherical Images with Planar Oriented Boxes Methods
☆14Aug 23, 2023Updated 2 years ago
wonchulSon / DGKD
View on GitHub
Densely Guided Knowledge Distillation using Multiple Teacher Assistants
☆11Oct 10, 2021Updated 4 years ago
GeWu-Lab / Patch-Matters
View on GitHub
[CVPR2025] Code Release of Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception
☆25Jun 17, 2025Updated last year
zdaiot / wiznote2hexo2csdn
View on GitHub
为知笔记markdown转为hexo博客markdown，hexo博客markdown转外链图片的markdown(可直接复制到csdn、简书等)
☆10Oct 29, 2019Updated 6 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
SijieSong / CVPR21-Cogrounding_semantic_attention
View on GitHub
☆14Jul 13, 2021Updated 5 years ago
HDUyiming / SOCCER
View on GitHub
We are very happy that our work has been accepted by ACM Multimedia 2024！🥰
☆12Jan 8, 2025Updated last year
lx709 / VRSBench
View on GitHub
☆69Jun 11, 2026Updated last month
GalaxyCong / HPMDubbing
View on GitHub
[CVPR 2023] Official code for paper: Learning to Dub Movies via Hierarchical Prosody Models.
☆111Jun 21, 2024Updated 2 years ago
dfki-av / MiKASA-3DVG
View on GitHub
[CVPR'24] MiKASA: Multi-Key-Anchor & Scene-Aware Transformer for 3D Visual Grounding
☆18Dec 13, 2024Updated last year
CruiserProject / Cruiser-OnboardROS
View on GitHub
Onboard ROS programs for Cruiser project, implemented on an intelligent drone for security purpose.
☆10Apr 29, 2018Updated 8 years ago
AIGNLAI / GDDSG
View on GitHub
☆22Oct 16, 2025Updated 9 months ago
ajhamdi / vointcloud
View on GitHub
Voint Cloud: Multi-View Point Cloud Representation for 3D Understanding (ICLR 2023)
☆22May 2, 2023Updated 3 years ago
RanaCM / DSU-AVO
View on GitHub
Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023
☆12May 13, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
RaidenIV / 3D-Spectrogram
View on GitHub
Audio Processing & Visualization Concepts
☆12Jun 20, 2023Updated 3 years ago
WeitaiKang / SegVG
View on GitHub
[ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding
☆63Oct 22, 2024Updated last year
JLUtangchuan / Parts2Words
View on GitHub
This is the source code of Part2Word: Learning Joint Embedding of Point Clouds and Text by Bidirectional Matching between Parts and Words
☆16Mar 22, 2023Updated 3 years ago
yangli18 / VLTVG
View on GitHub
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
☆97Dec 2, 2022Updated 3 years ago
zhujiagang / st-gcn-data-len
View on GitHub
☆13Mar 22, 2018Updated 8 years ago
RuipingL / OpenSU
View on GitHub
IEEE/CVF International Conference on Computer Vision Workshop (2023)
☆17Feb 7, 2024Updated 2 years ago
liuting20 / SwimVG
View on GitHub
Transactions on Multimedia (TMM25)
☆21Apr 8, 2025Updated last year