Agentic-Learning-AI-Lab / lifelong-memoryLinks

Code for LifelongMemory: Leveraging LLMs for Answering Queries in Long-form Egocentric Videos

☆28

Alternatives and similar repositories for lifelong-memory

Users that are interested in lifelong-memory are comparing it to the libraries listed below

Sorting:

kkahatapitiya / LangRepo
Code for our ACL 2025 paper "Language Repository for Long Video Understanding"
☆34Updated last year
orrzohar / Video-STaR
[ICLR 2025] Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision
☆72Updated last year
CeeZh / LLoVi
Official implementation for "A Simple LLM Framework for Long-Range Video Question-Answering"
☆106Updated last year
egoschema / EgoSchema
☆109Updated last year
wxh1996 / VideoAgent
☆134Updated 9 months ago
Ziyang412 / VideoTree
Code for CVPR25 paper "VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos"
☆154Updated 7 months ago
facebookresearch / ego4d-goalstep
Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)
☆54Updated last year
longvideobench / LongVideoBench
[Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.
☆113Updated last year
jongwoopark7978 / LVNet
☆41Updated 10 months ago
ChenYi99 / EgoPlan
[IJCV] EgoPlan-Bench: Benchmarking Multimodal Large Language Models for Human-Level Planning
☆79Updated last year
Gabesarch / grounded-rl
☆116Updated 6 months ago
lbaermann / qaego4d
Code and Dataset for the CVPRW Paper "Where did I leave my keys? — Episodic-Memory-Based Question Answering on Egocentric Videos"
☆29Updated 2 years ago
Ahnsun / merlin
[ECCV2024] Official code implementation of Merlin: Empowering Multimodal LLMs with Foresight Minds
☆96Updated last year
bigai-nlco / VideoLLaMB
[ICCV 2025] Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges
☆83Updated 11 months ago
Becomebright / GroundVQA
Official PyTorch code of GroundVQA (CVPR'24)
☆64Updated last year
CASIA-IVA-Lab / VideoNIAH
VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs
☆54Updated 10 months ago
imagegridworth / IG-VLM
☆138Updated last year
ziqipang / MR-Video
MR. Video: MapReduce is the Principle for Long Video Understanding
☆29Updated 9 months ago
appletea233 / Temporal-R1
Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency
☆60Updated 8 months ago
mll-lab-nu / TStar
TStar is a unified temporal search framework for long-form video question answering
☆86Updated 5 months ago
ruili33 / TPO
☆41Updated 4 months ago
bigai-nlco / VideoTGB
[EMNLP 2024] A Video Chat Agent with Temporal Prior
☆32Updated 11 months ago
alanaai / EVUD
Egocentric Video Understanding Dataset (EVUD)
☆32Updated last year
mlvlab / Flipped-VQA
Large Language Models are Temporal and Causal Reasoners for Video Question Answering (EMNLP 2023)
☆77Updated 10 months ago
RifleZhang / LLaVA-Hound-DPO
☆155Updated last year
llyx97 / TempCompass
[ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, …
☆127Updated 10 months ago
DoubtedSteam / MM-GCoT
The official implement of "Grounded Chain-of-Thought for Multimodal Large Language Models"
☆21Updated 6 months ago
mu-cai / TemporalBench
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
☆37Updated last year
showlab / videogui
[NeurIPS 2024 D&B] VideoGUI: A Benchmark for GUI Automation from Instructional Videos
☆48Updated 7 months ago
Open-Reasoner-Zero / Open-Vision-Reasoner
[NeurIPS 2025] The official repository for our paper, "Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reason…
☆153Updated 4 months ago