xlyu0106/ViF

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xlyu0106/ViF)

xlyu0106 / ViF

[ICLR 26] Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow

☆44

Alternatives and similar repositories for ViF

Users that are interested in ViF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xlyu0106 / MACT
View on GitHub
☆19Jul 31, 2025Updated 11 months ago
lyrig / TokenAR
View on GitHub
TokenAR: Multiple Subject Generation via Autoregressive Token-level enhancement
☆22Mar 4, 2026Updated 4 months ago
latentcraft / replay
View on GitHub
[CVPR 2026] Boosting Reasoning in Large Multimodal Models via Activation Replay
☆24May 7, 2026Updated 2 months ago
LsmnBmnc / Med-CMR
View on GitHub
Official code repository for Med-CMR : "A Fine-Grained Benchmark Integrating Visual Evidence and Clinical Logic for Medical Complex Multi…
☆26Dec 10, 2025Updated 7 months ago
Yuan-Hou / Human-MME
View on GitHub
Official repository for "Human-MME: A Holistic Evaluation Benchmark for Human-Centric Multimodal Large Language Models"
☆22Dec 2, 2025Updated 7 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
YinBo0927 / FeRA
View on GitHub
[ICML 2026] The official code of FeRA: Frequency–Energy Constrained Routing for Effective Diffusion Adaptation Fine-Tuning
☆29Dec 27, 2025Updated 6 months ago
rain152 / LFA-Video-Generation
View on GitHub
From Large Angles to Consistent Faces: Identity-Preserving Video Generation via Mixture of Facial Experts
☆27Jan 12, 2026Updated 6 months ago
HaoxuanXU1024 / IRPO
View on GitHub
☆30Nov 28, 2025Updated 7 months ago
xlyu0106 / VisMem
View on GitHub
☆91Feb 5, 2026Updated 5 months ago
NUS-Project / Landmark-of-medical-agent
View on GitHub
☆181Jun 8, 2026Updated last month
zhangzjn / T3-Video
View on GitHub
[ICML 2026] Transform Trained Transformer for Accelerating Native 4K Video Generation
☆41Dec 16, 2025Updated 7 months ago
ustc-hyin / ClearSight
View on GitHub
Code for paper: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language Models
☆60Dec 18, 2024Updated last year
zhangzjn / Soul
View on GitHub
[CVPR 2026] Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation
☆64Dec 16, 2025Updated 7 months ago
silent-commit / CLEAR
View on GitHub
CLEAR: Context-Aware Learning with End-to-End Mask-Free Inference for Adaptive Video Subtitle Removal
☆20May 25, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ZhengyaoFang / PruneSID
View on GitHub
Official code for **Prune Redundancy, Preserve Essence: Vision Token Compression in VLMs via Synergistic Importance-Diversity** (PruneSI…
☆14Mar 25, 2026Updated 3 months ago
pspdada / SENTINEL
View on GitHub
[ICCV 2025] Official repository of "Mitigating Object Hallucinations via Sentence-Level Early Intervention".
☆31Jul 2, 2026Updated 2 weeks ago
NUS-Project / MedMASLab
View on GitHub
☆30Mar 22, 2026Updated 4 months ago
linjh1118 / AwesomeRM
View on GitHub
☆32Jan 11, 2026Updated 6 months ago
zhangbaijin / LVLMs-Saliency
View on GitHub
[ICLR 2026 Oral] 🎉Hallucination Begins Where Saliency Drops
☆65Feb 12, 2026Updated 5 months ago
diaoquesang / GL-LCM
View on GitHub
[MICCAI 2025] GL-LCM: Global-Local Latent Consistency Models for Fast High-Resolution Bone Suppression in Chest X-Ray Images
☆17Mar 12, 2026Updated 4 months ago
YinBo0927 / RePro
View on GitHub
The official code of Refinement Provenance Inference: Detecting LLM-Refined Training Prompts from Model Behavior
☆22Jan 6, 2026Updated 6 months ago
wzhwzhwzh0921 / Awesome_LRM_with_Entropy
View on GitHub
Introduction about AWESOME_ENTROPY+LRM_PAPERS
☆32Dec 16, 2025Updated 7 months ago
cgao-comp / GAMC
View on GitHub
☆15Feb 24, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
XiaoyuXu-Vincent / step-saliency
View on GitHub
Official code for paper "Reasoning Fails Where Step Flow Breaks" (ACL 2026)
☆18Apr 19, 2026Updated 3 months ago
CodeDance-VL / CodeDance
View on GitHub
☆32Mar 17, 2026Updated 4 months ago
Linxi-ZHAO / MARINE
View on GitHub
☆19Jun 6, 2025Updated last year
PKU-YuanGroup / Look-Back
View on GitHub
This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".
☆99Jul 10, 2025Updated last year
LiangThree / MCMA
View on GitHub
☆15Jan 12, 2026Updated 6 months ago
iLearn-Lab / ACL25-PTQ1.61
View on GitHub
☆15Apr 6, 2026Updated 3 months ago
UCSB-AI / DMLR
View on GitHub
[CVPR2026] Official codebase for the paper "Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space"
☆84May 12, 2026Updated 2 months ago
zhangquanchen / SIFThinker
View on GitHub
[AAAI 2026] SIFThinker: Spatially-Aware Image Focus for Visual Reasoning
☆22Dec 2, 2025Updated 7 months ago
HUuxiaobin / DiffuMatting
View on GitHub
☆18Jul 14, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
bingreeky / opd-evolver
View on GitHub
☆37Jun 17, 2026Updated last month
MikeWangWZHL / PAPO
View on GitHub
Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"
☆151Feb 4, 2026Updated 5 months ago
MILVLG / videoarm
View on GitHub
☆27Apr 9, 2026Updated 3 months ago
jinghan1he / VHR
View on GitHub
[ACL 2025] Cracking the Code of Hallucination in LVLMs with Vision-aware Head Divergence
☆21Jun 10, 2025Updated last year
zhangquanchen / 4DThinker
View on GitHub
4DThinker: Thinking with 4D Imagery for Dynamic Spatial Understanding
☆77May 26, 2026Updated last month
VTON-HandFit / VTON-HandFit
View on GitHub
☆42Nov 12, 2025Updated 8 months ago
GuangtaoLyu / awesome-hallucination-mllm
View on GitHub
MLLM hallucination, LVLM, LLM, Hallucination Mitigation, Training-free hallucination mitigation
☆30Jul 13, 2026Updated last week