[EMNLP 2024 Main] MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression Comprehension
☆16Jan 6, 2025Updated last year
Alternatives and similar repositories for MaPPER
Users that are interested in MaPPER are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Transactions on Multimedia (TMM25)☆19Apr 8, 2025Updated last year
- [ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding☆23Feb 26, 2025Updated last year
- [AAAI-2025] The official code of Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation☆72May 21, 2025Updated 11 months ago
- Explicit Context Reasoning with Supervision for Visual Tracking (ACM MM 25)☆18Jul 20, 2025Updated 9 months ago
- DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models (IJCAI 2023) | The DiffuseStyleGesture+ ent…☆210Apr 9, 2026Updated 3 weeks ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation☆284Mar 14, 2026Updated last month
- ☆23Aug 20, 2024Updated last year
- H2ASeg: Hierarchical Adaptive Interaction and Weighting Network for Tumor Segmentation in PET/CT Images☆20May 29, 2025Updated 11 months ago
- ☆16Apr 4, 2022Updated 4 years ago
- ☆10Nov 12, 2024Updated last year
- Diverse Demonstrations Improve In-context Compositional Generalization☆12Jul 7, 2023Updated 2 years ago
- SAM4SS: Tailoring SAM and SAM2 for Semantic Segmentation☆11Jul 31, 2024Updated last year
- ☆12Aug 7, 2022Updated 3 years ago
- Code & Weights for “Learning Robust Anymodal Segmentor with Unimodal and Cross-modal Distillation”☆14Dec 6, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official implementation of dLLM-Var☆32Nov 6, 2025Updated 5 months ago
- The official implementation for the CVPR'2025 paper Dynamic Updates for Language Adaptation in Visual-Language Tracking☆39Mar 27, 2025Updated last year
- ☆10Jun 13, 2023Updated 2 years ago
- [IJCV] Progressive Visual Prompt Learning with Contrastive Feature Re-formation☆15Aug 10, 2024Updated last year
- [AAAI2024] An official pytorch implement of the paper: Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Underst…☆13Dec 8, 2024Updated last year
- 东北大学2019届[2015级]本科毕设Latex模版☆11Jun 16, 2019Updated 6 years ago
- Implementation of Weakly Supervised Deep Detection Networks with PyTorch☆12Dec 7, 2022Updated 3 years ago
- ☆12Jul 24, 2023Updated 2 years ago
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆20Jul 24, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [TMM 2025] This is the official Pytorch code for our paper "Visual Position Prompt for MLLM based Visual Grounding".☆29Jul 23, 2025Updated 9 months ago
- [ICLR2025] Accelerating Diffusion Transformers with Token-wise Feature Caching☆216Mar 14, 2025Updated last year
- [AAAI 2026] Global Compression Commander: Plug-and-Play Inference Acceleration for High-Resolution Large Vision-Language Models☆42Jan 27, 2026Updated 3 months ago
- [CVPR 2023] VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval☆38Feb 28, 2023Updated 3 years ago
- ☆14Jul 8, 2024Updated last year
- An implementation of the Distance-Vector Routing Protocol using the Bellman Ford Algorithm☆10Feb 6, 2018Updated 8 years ago
- Official PyTorch Implementation for CVPR'23 Paper, "The Dialog Must Go On: Improving Visual Dialog via Generative Self-Training"☆20Dec 11, 2023Updated 2 years ago
- ☆19Jul 15, 2022Updated 3 years ago
- ☆10Mar 31, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A vision-language model with bidirectional progressive fusion and global-local alignment for enhanced medical image segmentation.☆18Dec 25, 2025Updated 4 months ago
- [WWW24-UrbanCLIP] A comprehensive toolkit designed to facilitate the collection, processing, and integration of satellite imagery and ass…☆18Oct 6, 2024Updated last year
- Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation☆17Apr 3, 2024Updated 2 years ago
- Repository for my paper: Evaluation of Error and Correlation-Based Loss Functions For Multitask Learning Dimensional Speech Emotion Recog…☆20Mar 13, 2024Updated 2 years ago
- [ACM MM2024] The code for HMLLM.☆11Oct 27, 2024Updated last year
- [CVPR2024] GSVA: Generalized Segmentation via Multimodal Large Language Models☆167Sep 12, 2024Updated last year
- Adapting Vision-Language Foundation Model for Next Generation Medical Ultrasound Image Analysis☆18Jan 7, 2026Updated 3 months ago