YU-deep/VisMem

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YU-deep/VisMem)

YU-deep / VisMem

☆75

Alternatives and similar repositories for VisMem

Users that are interested in VisMem are comparing it to the libraries listed below

Sorting:

YU-deep / MACT
View on GitHub
☆18Jul 31, 2025Updated 7 months ago
YU-deep / ViF
View on GitHub
[ICLR 26] Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow
☆35Oct 3, 2025Updated 4 months ago
Yu-xm / ReVision
View on GitHub
Modality Gap–Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models
☆51Feb 23, 2026Updated last week
christian42mmreason / ActivationReplay
View on GitHub
☆20Dec 3, 2025Updated 2 months ago
YinBo0927 / FeRA
View on GitHub
The official code of FeRA: Frequency–Energy Constrained Routing for Effective Diffusion Adaptation Fine-Tuning
☆28Dec 27, 2025Updated 2 months ago
marinero4972 / CyberV
View on GitHub
☆18Jun 10, 2025Updated 8 months ago
ZJU-REAL / cooper
View on GitHub
☆25Aug 19, 2025Updated 6 months ago
KejiaZhang-Robust / TARS
View on GitHub
TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs
☆23Sep 21, 2025Updated 5 months ago
Vision-CAIR / Infinibench
View on GitHub
Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows
☆19Nov 4, 2025Updated 3 months ago
zjq0455 / PTQ1.61
View on GitHub
☆15Jan 12, 2026Updated last month
GATECH-EIC / LaCache
View on GitHub
[ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models
☆17Nov 4, 2025Updated 3 months ago
HUuxiaobin / VTBench
View on GitHub
☆22May 26, 2025Updated 9 months ago
inclusionAI / MoBE
View on GitHub
Mixture-of-Basis-Experts for Compressing MoE-based LLMs
☆29Dec 24, 2025Updated 2 months ago
chanchimin / AgentMonitor
View on GitHub
Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"
☆13Dec 13, 2024Updated last year
huaixuheqing / VPPO-RL
View on GitHub
[ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"
☆49Jan 30, 2026Updated last month
ziplab / CoV
View on GitHub
CoV: Chain-of-View Prompting for Spatial Reasoning
☆51Jan 23, 2026Updated last month
XieZilongAI / E2E-AFG
View on GitHub
An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation
☆16Oct 27, 2024Updated last year
LgQu / TIGeR
View on GitHub
Code for paper: Unified Text-to-Image Generation and Retrieval
☆16Jul 6, 2024Updated last year
XMUDeepLIT / Faithful-RAG
View on GitHub
Code and Data for "FaithfulRAG: Fact-Level Conflict Modeling for Context-Faithful Retrieval-Augmented Generation" (ACL25)
☆29Oct 26, 2025Updated 4 months ago
PolyU-ChenLab / ETBench
View on GitHub
👾 E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding (NeurIPS 2024)
☆74Jan 20, 2025Updated last year
jiyt17 / IDA-VLM
View on GitHub
[ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
☆37Nov 27, 2024Updated last year
shenao-zhang / reward-augmented-preference
View on GitHub
The official implementation of Preference Data Reward-Augmentation.
☆18May 1, 2025Updated 10 months ago
ML-GSAI / ReFusion
View on GitHub
[ICLR 2026] Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"
☆57Dec 26, 2025Updated 2 months ago
DAMO-NLP-SG / LongPO
View on GitHub
[ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization
☆43Feb 27, 2025Updated last year
Liuziyu77 / MIA-DPO
View on GitHub
Official implement of MIA-DPO
☆70Jan 23, 2025Updated last year
patrick-tssn / VideoHallucer
View on GitHub
VideoHallucer, The first comprehensive benchmark for hallucination detection in large video-language models (LVLMs)
☆42Dec 16, 2025Updated 2 months ago
MiuLab / FactAlign
View on GitHub
Source code of our EMNLP 2024 paper "FactAlign: Long-form Factuality Alignment of Large Language Models"
☆19Oct 3, 2024Updated last year
qhjqhj00 / MetaAgent
View on GitHub
MetaAgent: Toward Self-Evolving Agent via Tool Meta-Learning
☆42Sep 3, 2025Updated 5 months ago
EnVision-Research / RectifiedHR
View on GitHub
Official PyTorch/Diffusers implementation of "RectifiedHR: Enable Efficient High Resolution Image Generation via Energy Rectification"
☆30Oct 11, 2025Updated 4 months ago
GeJulia / flc_pooling
View on GitHub
Code for FrequencyLowCut Pooling (FLC pooling)
☆20Apr 22, 2025Updated 10 months ago
enyac-group / Quamba
View on GitHub
The official repository of Quamba1 [ICLR 2025] & Quamba2 [ICML 2025]
☆67Jun 19, 2025Updated 8 months ago
markywg / transagent
View on GitHub
[NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration
☆26Oct 17, 2024Updated last year
yuelinan / Awesome-Efficient-R1-style-LRMs
View on GitHub
☆49Aug 14, 2025Updated 6 months ago
lime-RL / DCPO
View on GitHub
DCPO: Dynamic Adaptive Clipping for RL
☆45Dec 20, 2025Updated 2 months ago
fqhank / Align-KD
View on GitHub
☆35Mar 8, 2025Updated 11 months ago
WangYuxuan93 / CVLUE
View on GitHub
Chinese Vision-Language Understanding Evaluation
☆23Dec 26, 2024Updated last year
AV-Odyssey / AV-Odyssey
View on GitHub
This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"
☆31Dec 23, 2024Updated last year
GuanghaoYe / Emergence-of-Thinking
View on GitHub
☆53Feb 11, 2025Updated last year
multimodal-art-projection / TreePO
View on GitHub
☆60Jan 12, 2026Updated last month