weihao-bo/ViLoMem

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/weihao-bo/ViLoMem)

weihao-bo / ViLoMem

ViLoMem: Agentic Learner with Grow-and-Refine Multimodal Semantic Memory

☆66

Alternatives and similar repositories for ViLoMem

Users that are interested in ViLoMem are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zechao-li / SVF-few-shot-segmentation
View on GitHub
☆22May 16, 2023Updated 3 years ago
AI4Math-ShanZhang / SVE-Math
View on GitHub
Implementation of the paper Open Eyes, Then Reason: Fine-grained Visual Mathematical Understanding in MLLMs
☆12Jun 7, 2025Updated last year
zechao-li / SVF-pytorch
View on GitHub
☆27Apr 5, 2024Updated 2 years ago
r2llab / GTTA
View on GitHub
This codebase is to reproduce the results of the paper "Grounded Test-Time Adaptation for LLM Agents".
☆17Mar 4, 2026Updated 4 months ago
syp2ysy / prompt-SelF
View on GitHub
[TIP] Exploring Effective Factors for Improving Visual In-Context Learning
☆21Jul 2, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ZhouZJ-DL / Multi-turn_Consistent_Image_Editing
View on GitHub
User-friendly multi-turn image editing
☆19Aug 25, 2025Updated 10 months ago
ForJadeForest / Lever-LM
View on GitHub
The Code for Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models
☆18Oct 4, 2024Updated last year
syp2ysy / SVF
View on GitHub
[NeurIPS 2022] Singular Value Fine-tuning: Few-shot Segmentation requires Few-parameters Fine-tuning
☆74Jan 31, 2024Updated 2 years ago
CSSLab / ThinkTwice
View on GitHub
Jointly Optimizing Large Language Models for Reasoning and Self-Refinement
☆15Apr 22, 2026Updated 3 months ago
Yui010206 / Adaptive-Visual-Imagination-Control
View on GitHub
When and How Much to Imagine: Adaptive Test-Time Scaling with World Models for Visual Spatial Reasoning
☆18Jun 2, 2026Updated last month
ByungKwanLee / Distill-R1
View on GitHub
Open-source RL Framework with Online Teacher-Student Distillation
☆22Mar 5, 2026Updated 4 months ago
luckybird1994 / IPSeg
View on GitHub
Towards Training-free Open-world Segmentation via Image Prompt Foundation Models,
☆18Nov 22, 2024Updated last year
MasterVito / DAC-RL
View on GitHub
Official Repo for DAC-RL: Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability
☆16Feb 26, 2026Updated 4 months ago
jeon185 / LaViC
View on GitHub
Implementation of LaViC (KDD 2025)
☆13Jun 1, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
VITA-Group / Nabla-Reasoner
View on GitHub
[ICLR'26] "Nabla-Reasoner: LLM Reasoning via Test-Time Gradient Descent in Latent Space" by Peihao Wang*, Ruisi Cai*, Zhen Wang, Hongyuan…
☆35Mar 10, 2026Updated 4 months ago
ChoS3nE11ven / Agentic-MME
View on GitHub
☆36Apr 13, 2026Updated 3 months ago
GaryJiajia / OFv2_ICL_VQA
View on GitHub
[CVPR 2024] How to Configure Good In-Context Sequence for Visual Question Answering
☆21May 28, 2025Updated last year
w-yibo / VTC-R1
View on GitHub
VTC-R1: Vision-Text Compression for Efficient Long-Context Reasoning.
☆26Updated this week
Dtc7w3PQ / PRCO
View on GitHub
Official implementation of Seeing with You: Perception-Reasoning Co-evolution for Multimodal Reasoning.
☆30Jul 2, 2026Updated 3 weeks ago
sming256 / BOLT
View on GitHub
[CVPR2025] BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding
☆55Feb 5, 2026Updated 5 months ago
shiweijiezero / R3L
View on GitHub
☆23Apr 5, 2026Updated 3 months ago
xrenaf / MEMLENS
View on GitHub
☆23Updated this week
ShawnTan86 / TokenCarve
View on GitHub
This is the open-source code for TokenCarve.
☆25Jan 23, 2026Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
DreamMr / RAP
View on GitHub
Code for Retrieval-Augmented Perception （ICML 2025)
☆74Apr 22, 2026Updated 3 months ago
ejhshen / SLIM
View on GitHub
Implementation of SLIM, a framework of dynamics skill lifecycle management for agentic reinforcement learning
☆22May 12, 2026Updated 2 months ago
AIGeeksGroup / MMA
View on GitHub
MMA: Multimodal Memory Agent
☆23Mar 30, 2026Updated 3 months ago
hithqd / ReasonBrain
View on GitHub
【ICML2026】Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning
☆27May 18, 2026Updated 2 months ago
Namkyeong / AMOLE
View on GitHub
The official source code for "Vision Language Model is NOT All You Need: Augmentation Strategies for Molecule Language Model".
☆14Jul 23, 2024Updated 2 years ago
WenyiWU0111 / CoMEM-Agent
View on GitHub
Official repository for paper Auto-scaling Continuous Memory for GUI Agent
☆29Feb 2, 2026Updated 5 months ago
rohit901 / cooperative-foundational-models
View on GitHub
[WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"
☆84Jan 2, 2026Updated 6 months ago
nabk89 / NAS-with-Proxy-data
View on GitHub
Official code of "NAS acceleration via proxy data", IJCAI21
☆10May 29, 2022Updated 4 years ago
XSkill-Agent / XSkill
View on GitHub
[ICML 2026] XSkill: Continual Learning from Experience and Skills in Multimodal Agents
☆239May 13, 2026Updated 2 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
haowei-freesky / HERMES
View on GitHub
Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding" [ACL 2026]
☆92May 8, 2026Updated 2 months ago
cmkang / CTSN
View on GitHub
☆12Dec 19, 2016Updated 9 years ago
Feng-Hong / WINO-DLLM
View on GitHub
ICLR 2026
☆44May 29, 2026Updated last month
mrwu-mac / R-Bench
View on GitHub
[ICML2024] Repo for the paper `Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models'
☆24Jan 1, 2025Updated last year
aba122 / Q-Hawkeye
View on GitHub
☆61Feb 9, 2026Updated 5 months ago
RCHI-Lab / voicepilot
View on GitHub
☆19May 26, 2026Updated last month
HHHLF / LoDA_ICML2026
View on GitHub
Code for "Task-Driven Subspace Decomposition for Knowledge Sharing and Isolation in LoRA-based Continual Learning (ICML 2026)".
☆20Jun 9, 2026Updated last month