WayneTomas / VPP-LLaVAView external linksLinks
[TMM 2025] This is the official Pytorch code for our paper "Visual Position Prompt for MLLM based Visual Grounding".
☆27Jul 23, 2025Updated 6 months ago
Alternatives and similar repositories for VPP-LLaVA
Users that are interested in VPP-LLaVA are comparing it to the libraries listed below
Sorting:
- This is the official Pytorch code for our paper "Artemis: Structured Visual Reasoning for Perception Policy Learning".☆14Dec 4, 2025Updated 2 months ago
- Implementation of the paper Open Eyes, Then Reason: Fine-grained Visual Mathematical Understanding in MLLMs☆13Jun 7, 2025Updated 8 months ago
- Official source codes for the paper: EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing.☆37Jun 3, 2025Updated 8 months ago
- The detector of wire's breakage in power system. (Python, Opencv...)☆12Oct 29, 2018Updated 7 years ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆15Sep 7, 2024Updated last year
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆24Jul 21, 2025Updated 6 months ago
- The official source code of our AAAI25 paper "D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matchin…☆10Feb 9, 2025Updated last year
- Official implementation of "Interpreting and Controlling Vision Foundation Models via Text Explanations"☆14May 29, 2024Updated last year
- ☆13May 21, 2024Updated last year
- ACM MM 2022 paper_AVQA: A Dataset for Audio-Visual Question Answering on Videos☆15Aug 17, 2023Updated 2 years ago
- Image-processing filters implemented on GPU with OpenCL☆12Jun 7, 2022Updated 3 years ago
- ☆10Jan 28, 2024Updated 2 years ago
- IntelligetnScissor implemented by C++.☆12Apr 20, 2018Updated 7 years ago
- Implementation example of Distributed Tensorflow☆10Jul 22, 2017Updated 8 years ago
- ☆12Aug 19, 2023Updated 2 years ago
- My DirectX 11 MLAA & SMAA implementation☆12Jun 9, 2020Updated 5 years ago
- End-to-end implementation of the Social Graph Network (SGN), described in the Structural Reasoning for Image-based Social Relation Recogn…☆13Apr 3, 2024Updated last year
- Computer Vision Paper Reading☆11Nov 21, 2019Updated 6 years ago
- ☆17Dec 23, 2025Updated last month
- ☆20Jun 12, 2025Updated 8 months ago
- ☆13May 21, 2023Updated 2 years ago
- Official PyTorch Implementation of Exploring Stochastic Autoregressive Image Modeling for Visual Representation, Accepted by AAAI 2023.☆16Jul 3, 2023Updated 2 years ago
- [ACCV 2024 Poster] official code for "VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model"☆10Sep 28, 2024Updated last year
- RPIfield dataset for Person Re-identification☆13Aug 17, 2020Updated 5 years ago
- 2020年秋国科大模式识别(刘成林、向世明、张煦尧)课后作业☆10Feb 3, 2021Updated 5 years ago
- 用python和c++实现了原始的LBP、圆形LBP、旋转不变的圆形LBP、等价模式的圆形LBP、旋转不变的等价模式的圆形LBP。python版本LBP用于模型训练,c++版本LBP用于模型部署。☆11Aug 27, 2018Updated 7 years ago
- Image inpainting using Gaussian Mixture Models☆10Feb 1, 2024Updated 2 years ago
- [CVPR 2025] Official implementation of paper "Multi-Granularity Class Prototype Topology Distillation for Class-Incremental Source-Free …☆17Aug 26, 2025Updated 5 months ago
- ☆16Oct 11, 2025Updated 4 months ago
- ☆11Mar 11, 2025Updated 11 months ago
- [ACL 2021] This is the Pytorch code for our paper "Semantic Relation-aware Difference Representation Learning for Change Captioning".☆13Jan 16, 2022Updated 4 years ago
- This project reproduces the application of the paper SVLRM on depth image sampling, depth image restoration, natural image denoising, etc…☆12Jun 6, 2020Updated 5 years ago
- ☆46Dec 30, 2024Updated last year
- ☆12May 17, 2019Updated 6 years ago
- Hierarchical Group Sparse Regularization for Deep Convolutional Neural Networks☆12Apr 13, 2020Updated 5 years ago
- ☆13Dec 12, 2024Updated last year
- Perspective Transformation for Indoor Image Aesthetic Enhancement☆12Jan 8, 2020Updated 6 years ago
- [ICCV 2023] The official code for "SILT: Shadow-aware Iterative Label Tuning for Learning to Detect Shadows from Noisy Labels"☆15May 1, 2024Updated last year
- Training scripts and Python modules for the ECCV 2018 paper "Interpolating Convolutional Networks Using Batch Normalization"☆11Jun 13, 2020Updated 5 years ago