Mr-Bigworth/MMCA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Mr-Bigworth/MMCA)

Mr-Bigworth / MMCA

Visual Grounding with Multi-modal Conditional Adaptation (ACMMM 2024 Oral)

☆26

Alternatives and similar repositories for MMCA

Users that are interested in MMCA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

linhuixiao / OneRef
View on GitHub
[NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.
☆32Nov 13, 2025Updated 8 months ago
Lens4MLLMs / LENS
View on GitHub
☆29Feb 13, 2026Updated 5 months ago
WayneTomas / TransCP
View on GitHub
[TPAMI 2024] This is the official Pytorch code for our paper "Context Disentangling and Prototype Inheriting for Robust Visual Grounding"…
☆28May 8, 2025Updated last year
LukeForeverYoung / QRNet
View on GitHub
☆41Jun 3, 2022Updated 4 years ago
Dmmm1997 / SimVG
View on GitHub
[NeurIPS2024] - SimVG: A Simple Framework for Visual Grounding with Decoupled Multi-modal Fusion
☆103Oct 29, 2025Updated 9 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
top-yun / SPARK
View on GitHub
A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.
☆19Dec 27, 2024Updated last year
linhuixiao / HiVG
View on GitHub
[ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.
☆65Nov 10, 2025Updated 8 months ago
zjh31 / CPL
View on GitHub
☆21Apr 2, 2024Updated 2 years ago
swagger-coder / ASDA
View on GitHub
This is an official PyTorch implementation of ASDA (accepted by ACMMM 2024).
☆26Oct 22, 2024Updated last year
cv516Buaa / OV-VG
View on GitHub
☆31Mar 25, 2024Updated 2 years ago
qzp2018 / MCLN
View on GitHub
This is a PyTorch implementation of MCLN proposed by our paper "Multi-branch Collaborative Learning Network for 3D Visual Grounding"(ECCV…
☆27Oct 10, 2024Updated last year
heitorrapela / ModPrompt
View on GitHub
[ICCV2025] ModPrompt: Visual Modality Prompt for Adapting Vision-Language Object Detectors
☆28Jul 10, 2025Updated last year
TencentYoutuResearch / ImageColorization-ColorFormer
View on GitHub
Code for ECCV 2022 paper "ColorFormer: Image Colorization via Color Memory assisted Hybrid-attention Transformer"
☆12Jan 30, 2023Updated 3 years ago
yuanzhoulvpi2017 / yuanzhoulvpi2017
View on GitHub
personal info
☆11Mar 23, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
YijinHuang / FPT
View on GitHub
[TNNLS'25] [MICCAI'24] A Parameter and Memory Efficient Transfer Learning Method
☆35Oct 29, 2025Updated 8 months ago
zdaiot / wiznote2hexo2csdn
View on GitHub
为知笔记markdown转为hexo博客markdown，hexo博客markdown转外链图片的markdown(可直接复制到csdn、简书等)
☆10Oct 29, 2019Updated 6 years ago
Pbihao / HDMNet
View on GitHub
☆118Jun 7, 2024Updated 2 years ago
LRQ577 / FAITH
View on GitHub
Code release of paper "FAITH: Frequency-domain Attention In Two Horizons for Long-term time series forecasting"
☆19Jun 20, 2025Updated last year
toheart / cocursor
View on GitHub
☆18Feb 9, 2026Updated 5 months ago
yeppp27 / VisualScore
View on GitHub
☆21May 28, 2026Updated 2 months ago
henghuiding / gRefCOCO
View on GitHub
A benchmark dataset for GREx: GRES, GREC, and GREG [CVPR 2023 & IJCV 2026]
☆241Nov 14, 2025Updated 8 months ago
luxiaolili / HunyuanOCR_Train
View on GitHub
☆18Jan 14, 2026Updated 6 months ago
ajhamdi / vointcloud
View on GitHub
Voint Cloud: Multi-View Point Cloud Representation for 3D Understanding (ICLR 2023)
☆22May 2, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
tonyzyl / Semisupervised-VAE-for-Regression-Application-on-Soft-Sensor
View on GitHub
An semi-supervised extension based on VAE for Regression, demonstrate its performance on two soft sensor benchmark problems.
☆27Aug 15, 2023Updated 2 years ago
JLUtangchuan / Parts2Words
View on GitHub
This is the source code of Part2Word: Learning Joint Embedding of Point Clouds and Text by Bidirectional Matching between Parts and Words
☆16Mar 22, 2023Updated 3 years ago
AntXinyuan / SSP
View on GitHub
Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection
☆13Jul 7, 2026Updated 3 weeks ago
pumpkin805 / FALIP
View on GitHub
[ECCV2024]FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance
☆18Sep 11, 2024Updated last year
Ivan-Tang-3D / ViewRefer3D
View on GitHub
(ICCV2023) Official implementation of 'ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance'…
☆60Apr 18, 2024Updated 2 years ago
tiangeluo / RegionFocus
View on GitHub
A simple visual test-time scaling method for GUI agent grounding
☆26Dec 7, 2025Updated 7 months ago
RenshengJi / C-CoTTA
View on GitHub
The official repository of C-CoTTA: Controllable Continual Test-Time Adaptation
☆10Jun 17, 2024Updated 2 years ago
dengandong / GroundMoRe
View on GitHub
☆18May 18, 2026Updated 2 months ago
tomguluson92 / EraseAnything
View on GitHub
EraseAnything, ICML 2025
☆43Sep 28, 2025Updated 10 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
xxyzll / UMB
View on GitHub
UMB: Understanding Model Behavior for Open-World object Detection (NeurIPS 2024)
☆12May 26, 2024Updated 2 years ago
Gorilla-Lab-SCUT / TTAC2
View on GitHub
[TPAMI 2024] The official implementation of "Revisiting Realistic Test-Time Training: Sequential Inference and Adaptation by Anchored Clu…
☆13Mar 19, 2024Updated 2 years ago
sunwei925 / UIQA
View on GitHub
Official Code for Assessing UHD Image Quality from Aesthetics, Distortions, and Saliency
☆24Jun 10, 2025Updated last year
yangli18 / VLTVG
View on GitHub
Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022
☆97Dec 2, 2022Updated 3 years ago
DYZhang09 / ViTWSS3D
View on GitHub
[ICCV 23] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection
☆13Apr 12, 2024Updated 2 years ago
RobertLuo1 / CoHD
View on GitHub
The official implementation of A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation
☆27Aug 17, 2025Updated 11 months ago
EndoluminalSurgicalVision-IMR / PASS
View on GitHub
[IEEE TMI 2024] PASS: Prompt tuning for both styles and semantic shapes
☆20Feb 12, 2025Updated last year