[TMM 2025] This is the official Pytorch code for our paper "Visual Position Prompt for MLLM based Visual Grounding".
☆27Jul 23, 2025Updated 7 months ago
Alternatives and similar repositories for VPP-LLaVA
Users that are interested in VPP-LLaVA are comparing it to the libraries listed below
Sorting:
- This is the official Pytorch code for our paper "Artemis: Structured Visual Reasoning for Perception Policy Learning".☆14Dec 4, 2025Updated 3 months ago
- [ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization☆18Feb 14, 2026Updated 2 weeks ago
- The detector of wire's breakage in power system. (Python, Opencv...)☆12Oct 29, 2018Updated 7 years ago
- Image-processing filters implemented on GPU with OpenCL☆12Jun 7, 2022Updated 3 years ago
- End-to-end implementation of the Social Graph Network (SGN), described in the Structural Reasoning for Image-based Social Relation Recogn…☆13Apr 3, 2024Updated last year
- Official implementation of "Interpreting and Controlling Vision Foundation Models via Text Explanations"☆14May 29, 2024Updated last year
- ☆13May 21, 2023Updated 2 years ago
- [ACCV 2024 Poster] official code for "VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model"☆10Sep 28, 2024Updated last year
- ACM MM 2022 paper_AVQA: A Dataset for Audio-Visual Question Answering on Videos☆16Aug 17, 2023Updated 2 years ago
- IntelligetnScissor implemented by C++.☆12Apr 20, 2018Updated 7 years ago
- ☆12Aug 19, 2023Updated 2 years ago
- ☆17Dec 23, 2025Updated 2 months ago
- Implementation example of Distributed Tensorflow☆10Jul 22, 2017Updated 8 years ago
- Official PyTorch Implementation of Exploring Stochastic Autoregressive Image Modeling for Visual Representation, Accepted by AAAI 2023.☆16Jul 3, 2023Updated 2 years ago
- My DirectX 11 MLAA & SMAA implementation☆12Jun 9, 2020Updated 5 years ago
- ☆13Aug 9, 2022Updated 3 years ago
- This project reproduces the application of the paper SVLRM on depth image sampling, depth image restoration, natural image denoising, etc…☆12Jun 6, 2020Updated 5 years ago
- 用python和c++实现了原始的LBP、圆形LBP、旋转不变的圆形LBP、等价模式的圆形LBP、旋转不变的等价模式的圆形LBP。python版本LBP用于模型训练,c++版本LBP用于模型部署。☆11Aug 27, 2018Updated 7 years ago
- ☆16Oct 11, 2025Updated 4 months ago
- official code for Dynamic Smooth Label Assignment☆11Oct 5, 2022Updated 3 years ago
- 2020年秋国科大模式识别(刘成林、向世明、张煦尧)课后作业☆10Feb 3, 2021Updated 5 years ago
- Re-implementation of Progressive Neural Networks with PyTorch☆15Jul 25, 2024Updated last year
- [ACL 2021] This is the Pytorch code for our paper "Semantic Relation-aware Difference Representation Learning for Change Captioning".☆13Jan 16, 2022Updated 4 years ago
- ☆11Mar 11, 2025Updated 11 months ago
- ☆12Jan 28, 2021Updated 5 years ago
- RPIfield dataset for Person Re-identification☆13Aug 17, 2020Updated 5 years ago
- ☆46Dec 30, 2024Updated last year
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆18Feb 29, 2024Updated 2 years ago
- ☆13Dec 12, 2024Updated last year
- Unofficial version of LaneExtraction☆13Oct 12, 2022Updated 3 years ago
- [CVPR2016] Laplacian Patch-Based Image Synthesis☆13May 8, 2017Updated 8 years ago
- Extract MFCCs from videos and make bag-of-audio-words (BOAW) representations.☆11Dec 20, 2018Updated 7 years ago
- Hierarchical Group Sparse Regularization for Deep Convolutional Neural Networks☆12Apr 13, 2020Updated 5 years ago
- ☆14Oct 10, 2022Updated 3 years ago
- [CVPR2025] Code Release of Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception☆21Jun 17, 2025Updated 8 months ago
- Exploring CoT-Decoding from Google DeepMind's paper, "Chain-of-Thought Reasoning Without Prompting".☆13Feb 22, 2024Updated 2 years ago
- Training scripts and Python modules for the ECCV 2018 paper "Interpolating Convolutional Networks Using Batch Normalization"☆11Jun 13, 2020Updated 5 years ago
- NeurIPS 2020 paper: UnModNet: Learning to Unwrap a Modulo Image for High Dynamic Range Imaging☆10Oct 24, 2021Updated 4 years ago
- Code for the EACL 2024 paper: "Small Language Models Improve Giants by Rewriting Their Outputs"☆12Apr 20, 2024Updated last year