agentic-learning-ai-lab / lifelong-memoryLinks

Code for LifelongMemory: Leveraging LLMs for Answering Queries in Long-form Egocentric Videos

☆27

Alternatives and similar repositories for lifelong-memory

Users that are interested in lifelong-memory are comparing it to the libraries listed below

Sorting:

kkahatapitiya / LangRepo
Language Repository for Long Video Understanding
☆32Updated last year
CeeZh / LLoVi
Official implementation for "A Simple LLM Framework for Long-Range Video Question-Answering"
☆101Updated 11 months ago
egoschema / EgoSchema
☆101Updated 9 months ago
orrzohar / Video-STaR
[ICLR 2025] Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision
☆70Updated last year
facebookresearch / ego4d-goalstep
Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)
☆48Updated last year
longvideobench / LongVideoBench
[Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.
☆110Updated last year
Ziyang412 / VideoTree
Code for CVPR25 paper "VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos"
☆142Updated 4 months ago
wxh1996 / VideoAgent
☆117Updated 6 months ago
mll-lab-nu / TStar
TStar is a unified temporal search framework for long-form video question answering
☆69Updated last month
bigai-nlco / VideoLLaMB
[ICCV 2025] Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges
☆77Updated 7 months ago
lbaermann / qaego4d
Code and Dataset for the CVPRW Paper "Where did I leave my keys? — Episodic-Memory-Based Question Answering on Egocentric Videos"
☆28Updated 2 years ago
Becomebright / GroundVQA
Official PyTorch code of GroundVQA (CVPR'24)
☆64Updated last year
llyx97 / TempCompass
[ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, …
☆124Updated 6 months ago
Ahnsun / merlin
[ECCV2024] Official code implementation of Merlin: Empowering Multimodal LLMs with Foresight Minds
☆94Updated last year
showlab / VideoGUI
[NeurIPS 2024 D&B] VideoGUI: A Benchmark for GUI Automation from Instructional Videos
☆45Updated 4 months ago
jongwoopark7978 / LVNet
☆36Updated 7 months ago
tychen-SJTU / MECD-Benchmark
[NeurIPS'24 spotlight] MECD: Unlocking Multi-Event Causal Discovery in Video Reasoning
☆41Updated 3 months ago
imagegridworth / IG-VLM
☆138Updated last year
ruili33 / TPO
☆38Updated last month
MikeWangWZHL / Paxion
Repo for paper: "Paxion: Patching Action Knowledge in Video-Language Foundation Models" Neurips 23 Spotlight
☆37Updated 2 years ago
kahnchana / mvu
🤖 [ICLR'25] Multimodal Video Understanding Framework (MVU)
☆49Updated 8 months ago
alanaai / EVUD
Egocentric Video Understanding Dataset (EVUD)
☆31Updated last year
Becomebright / ReKV
Official PyTorch Code of ReKV (ICLR'25)
☆61Updated 7 months ago
StanfordVL / atp-video-language
Official repo for CVPR 2022 (Oral) paper: Revisiting the "Video" in Video-Language Understanding. Contains code for the Atemporal Probe (…
☆51Updated last year
EvolvingLMMs-Lab / VideoMMMU
☆60Updated last month
Gabesarch / grounded-rl
☆97Updated 3 months ago
mu-cai / TemporalBench
TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models
☆37Updated 11 months ago
mlvlab / Flipped-VQA
Large Language Models are Temporal and Causal Reasoners for Video Question Answering (EMNLP 2023)
☆76Updated 7 months ago
bigai-nlco / VideoTGB
[EMNLP 2024] A Video Chat Agent with Temporal Prior
☆32Updated 7 months ago
ChenYi99 / EgoPlan
[IJCV] EgoPlan-Bench: Benchmarking Multimodal Large Language Models for Human-Level Planning
☆74Updated 10 months ago