Terry-Xu-666 / visual_inference_chainView external linksLinks
This repository contains the official code for our paper: Thinking Before Looking: Improving Multimodal LLM Reasoning via Mitigating Visual Hallucination
☆25Nov 15, 2024Updated last year
Alternatives and similar repositories for visual_inference_chain
Users that are interested in visual_inference_chain are comparing it to the libraries listed below
Sorting:
- Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language…☆13Dec 16, 2024Updated last year
- t-vMF Similarity for Regularizing Intra-Class Feature Distribution☆21Jun 11, 2021Updated 4 years ago
- [ECCV’24] Official repository for "BEAF: Observing Before-AFter Changes to Evaluate Hallucination in Vision-language Models"☆21Mar 26, 2025Updated 10 months ago
- ☆43Jul 18, 2024Updated last year
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆45Jun 14, 2024Updated last year
- pytorch☆10Apr 13, 2022Updated 3 years ago
- Optimized MDNet for fast object tracking☆10Apr 10, 2019Updated 6 years ago
- Normalized Wasserstein for Mixture Distributions☆11Mar 24, 2023Updated 2 years ago
- 🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.☆10Dec 12, 2024Updated last year
- A large-scale dataset composed of high-quality synthetic images aimed at evaluating social biases in LVLMs☆13Oct 6, 2025Updated 4 months ago
- [WACV 2025-Oral Presentation] Test-Time Adaptation in Point Clouds: Leveraging Sampling Variation with Weight Averaging☆12Mar 31, 2025Updated 10 months ago
- EraseDiff: Erasing Data Influence in Diffusion Models☆14Nov 20, 2024Updated last year
- ICCV 2021 papers and code focus on adversarial attacks and defense☆11Nov 5, 2021Updated 4 years ago
- Official repository for "Stylized Adversarial Training" (TPAMI 2022)☆11Dec 30, 2022Updated 3 years ago
- Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding (CVPR 2025 Oral)☆36Nov 28, 2025Updated 2 months ago
- Pytorch implementation of Detective☆12Jul 11, 2024Updated last year
- Official Implementation of the CVPR'23 paper 'Regularization of polynomial networks for image recognition'.☆10Jun 8, 2023Updated 2 years ago
- Code for Modeling Annotator Preference and Stochastic Annotation Error for Medical Image Segmentation (MedIA 2023).☆11Nov 17, 2023Updated 2 years ago
- ☆14Jan 5, 2022Updated 4 years ago
- Python codes for mathematical modeling.☆12Sep 5, 2021Updated 4 years ago
- Implementation of the Paper Scene-Graph ViT☆10Dec 20, 2024Updated last year
- This is the original matlab version of MKCFup☆10Jan 23, 2019Updated 7 years ago
- Unexplored Faces of Robustness and Out-of-Distribution: Covariate Shifts in Environment and Sensor Domains (CVPR 2024)☆10Jan 17, 2026Updated 3 weeks ago
- Data for evaluating GPT-4V☆11Oct 26, 2023Updated 2 years ago
- English and Chinese LaTeX template for reports/projects/proposal at Beijing Institute of Technology☆10Nov 19, 2020Updated 5 years ago
- [COLM 2024] LITE: Modeling Environmental Ecosystems with Multimodal Large Language Models☆14Jan 4, 2025Updated last year
- Official Pytorch implementation of "Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large Language Models" [IEEE ICASSP 202…☆29Jan 18, 2026Updated 3 weeks ago
- 2nd place solution of ECCV 2020 workshop VIPriors Image Classification Challenge, https://arxiv.org/abs/2008.00261☆13Aug 22, 2021Updated 4 years ago
- [ICPR 2024] Official repository of robustness and generalization benchmark collection introduced in the paper "GenFormer - Generated Imag…☆13Aug 29, 2024Updated last year
- The official implement of "Routing Experts: Learning to Route Dynamic Experts in Existing Multi-modal Large Language Models"☆17Mar 24, 2025Updated 10 months ago
- This is the implementation for IEEE S&P 2022 paper "Model Orthogonalization: Class Distance Hardening in Neural Networks for Better Secur…☆11Aug 24, 2022Updated 3 years ago
- This is the official implementation of our PrOmpt cLass lEarning (POLE).☆12Jan 21, 2024Updated 2 years ago
- (TIP'18) An Embarrassingly Simple Approach to Visual Domain Adaptation☆12Aug 7, 2018Updated 7 years ago
- [ICML 2024] Learning with Complementary Labels Revisited: The Selected-Completely-at-Random Setting Is More Practical☆12May 12, 2024Updated last year
- [ECCV 2022] Contrastive Prototypical Network with Wasserstein Confidence Penalty☆11Oct 20, 2022Updated 3 years ago
- A re-implementation of T-GNN, a framework for pedestrian trajectory prediction that addresses the performance decrease caused by distribu…☆15Jul 4, 2023Updated 2 years ago
- ☆12May 27, 2022Updated 3 years ago
- ☆12Mar 1, 2018Updated 7 years ago
- JoPano: Unified Panorama Generation via Joint Modeling☆23Dec 12, 2025Updated 2 months ago