ADL-X / LLAVIDALLinks

This is the offical repository of LLAVIDAL

☆20

Alternatives and similar repositories for LLAVIDAL

Users that are interested in LLAVIDAL are comparing it to the libraries listed below

Sorting:

facebookresearch / EgoVLPv2
Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]
☆100Updated last year
jongwoopark7978 / LVNet
☆35Updated 6 months ago
facebookresearch / VidOSC
Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)
☆35Updated last year
PolyU-ChenLab / ETBench
👾 E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding (NeurIPS 2024)
☆65Updated 8 months ago
lucas-ventura / CoVR
Official PyTorch implementation of the paper "CoVR: Learning Composed Video Retrieval from Web Video Captions".
☆114Updated 6 months ago
ExplainableML / EgoCVR
[ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval
☆41Updated 5 months ago
WHB139426 / Grounded-Video-LLM
[EMNLP 2025 Findings] Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
☆125Updated last month
OpenGVLab / EgoExoLearn
[CVPR 2024] Data and benchmark code for the EgoExoLearn dataset
☆70Updated last month
aleflabo / PREGO
The official PyTorch implementation of the IEEE/CVF Computer Vision and Pattern Recognition (CVPR) '24 paper PREGO: online mistake detect…
☆24Updated 4 months ago
jh-yi / Video-Panda
Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models [CVPR 2025]
☆73Updated 3 months ago
fmu2 / snag_release
Official Implementation of SnAG (CVPR 2024)
☆54Updated 5 months ago
facebookresearch / ego4d-goalstep
Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)
☆46Updated last year
benedettaliberatori / T3AL
Official implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024
☆66Updated last year
ziplab / LongVLM
☆103Updated last year
Ziyang412 / VideoTree
Code for CVPR25 paper "VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos"
☆138Updated 3 months ago
facebookresearch / htstep
HT-Step is a large-scale article grounding dataset of temporal step annotations on how-to videos
☆21Updated last year
KyleHuang9 / SeFAR
[AAAI 2025] SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization
☆26Updated 9 months ago
franciszzj / OpenPSG
[ECCV 2024] OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models
☆49Updated 9 months ago
sudo-Boris / mr-Blip
Official Implementation of "Chrono: A Simple Blueprint for Representing Time in MLLMs"
☆91Updated 7 months ago
wlin-at / MAXI
MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)
☆30Updated 2 years ago
OpenGVLab / EgoVideo
[CVPR 2024 Champions][ICLR 2025] Solutions for EgoVis Chanllenges in CVPR 2024
☆129Updated 4 months ago
LilyDaytoy / OpenPVSG
Benchmarking Panoptic Video Scene Graph Generation (PVSG), CVPR'23
☆97Updated last year
fpv-iplab / EASG
Action Scene Graphs for Long-Form Understanding of Egocentric Videos (CVPR 2024)
☆44Updated 6 months ago
showlab / VideoLISA
[NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
☆135Updated 9 months ago
shiyi-zh0408 / LOGO
Accepted by CVPR 2023
☆44Updated last year
ut-vision / ActionVOS
[ECCV 2024 Oral] ActionVOS: Actions as Prompts for Video Object Segmentation
☆31Updated 10 months ago
Becomebright / GroundVQA
Official PyTorch code of GroundVQA (CVPR'24)
☆62Updated last year
gyxxyg / TRACE
[ICLR 2025] TRACE: Temporal Grounding Video LLM via Casual Event Modeling
☆126Updated last month
BolinLai / LEGO
[ECCV2024, Oral, Best Paper Finalist] This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation…
☆38Updated 7 months ago
naver-ai / tc-clip
[ECCV 2024] Official PyTorch implementation of TC-CLIP "Leveraging Temporal Contextualization for Video Action Recognition"
☆74Updated 7 months ago