chaoyuaw/lvu

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chaoyuaw/lvu)

chaoyuaw / lvu

☆87

Alternatives and similar repositories for lvu

Users that are interested in lvu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ycxioooong / MovieSynopsisAssociation
View on GitHub
Code for "A Graph-Based Framework to Bridge Movies and Synopses", ICCV2019
☆52Aug 9, 2020Updated 5 years ago
frostinassiky / bsp
View on GitHub
Placeholder for code of BSP.
☆11Aug 13, 2021Updated 4 years ago
Annusha / LIReC
View on GitHub
Learning Interactions and Relationships between Movie Characters (CVPR'20)
☆22Apr 12, 2023Updated 3 years ago
showlab / Q2A
View on GitHub
[ECCV 2022] AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant
☆23Jan 30, 2026Updated 5 months ago
facebookresearch / LaViLa
View on GitHub
Code release for "Learning Video Representations from Large Language Models"
☆534Oct 1, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
bradyz / task-distillation
View on GitHub
Code for Domain Adaptation Through Task Distillation (ECCV 20)
☆47Dec 8, 2022Updated 3 years ago
jimmy646 / violin
View on GitHub
Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"
☆161Apr 29, 2020Updated 6 years ago
MikeWangWZHL / VidIL
View on GitHub
Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
☆117Sep 15, 2022Updated 3 years ago
aimagelab / mvad-names-dataset
View on GitHub
M-VAD Names Dataset. Multimedia Tools and Applications (2019)
☆24Jul 9, 2019Updated 7 years ago
showlab / Show-Anything-3D
View on GitHub
Edit and Generate Anything in 3D world!
☆13Apr 15, 2023Updated 3 years ago
Chuhanxx / Temporal_Query_Networks
View on GitHub
The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding
☆64Mar 9, 2022Updated 4 years ago
TheShadow29 / VidSitu
View on GitHub
[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)
☆61Aug 17, 2021Updated 4 years ago
HCPLab-SYSU / SR
View on GitHub
☆48Jul 8, 2018Updated 8 years ago
facebookresearch / MeMViT
View on GitHub
Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022
☆155Nov 30, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
zhaoyue-zephyrus / TeSTra
View on GitHub
Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"
☆119Aug 23, 2025Updated 11 months ago
MCG-NJU / CPD-Video
View on GitHub
Learning Spatiotemporal Features via Video and Text Pair Discrimination
☆60Jan 20, 2021Updated 5 years ago
Li-Wanhua / GR2N
View on GitHub
PyTorch implementation of Graph-Based Social Relation Reasoning (ECCV 2020)
☆17Jan 11, 2024Updated 2 years ago
showlab / EgoVLP
View on GitHub
[NeurIPS 2022] Egocentric Video-Language Pretraining
☆261May 9, 2024Updated 2 years ago
piergiaj / AViD
View on GitHub
AViD Dataset: Anonymized Videos from Diverse Countries
☆54Mar 30, 2023Updated 3 years ago
siliu-group / pic-challenge-baseline
View on GitHub
PIC Challenge Baseline
☆18Dec 27, 2018Updated 7 years ago
hazeld / action-modifiers
View on GitHub
Code for the CVPR 2020 paper 'Action Modifiers: Learning from Adverbs in Instructional Videos'
☆23May 17, 2021Updated 5 years ago
movienet / movienet-tools
View on GitHub
Tools for movie and video research
☆313Jun 20, 2022Updated 4 years ago
Tushar-N / pytorch-resnet3d
View on GitHub
I3D Nonlocal ResNets in Pytorch
☆259Mar 26, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
TheShadow29 / vognet-pytorch
View on GitHub
[CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)
☆69Jun 10, 2020Updated 6 years ago
renjie-liang / HUAL
View on GitHub
Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-based Active Learning
☆15Dec 12, 2023Updated 2 years ago
isl-org / vision-for-action
View on GitHub
Code to accompany "Does computer vision matter for action?"
☆44Sep 2, 2024Updated last year
kennymckormick / ARAS-Dataset
View on GitHub
☆11Nov 5, 2024Updated last year
roeiherz / ORViT
View on GitHub
Object-Region Video Transformers
☆24Mar 24, 2022Updated 4 years ago
antoyang / VidChapters
View on GitHub
[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale
☆213Nov 13, 2023Updated 2 years ago
antoine77340 / MIL-NCE_HowTo100M
View on GitHub
PyTorch GPU distributed training code for MIL-NCE HowTo100M
☆221Jul 5, 2022Updated 4 years ago
LuoweiZhou / densecap
View on GitHub
Dense video captioning in PyTorch
☆41Aug 30, 2019Updated 6 years ago
chenjoya / manogcn
View on GitHub
Code for MANO-GCN —— "Capturing Implicit Spatial Cues for Monocular 3D Hand Reconstruction" (ICME2021 Oral)
☆13Jun 24, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
HaozhiQi / RPIN
View on GitHub
Learning Long-term Visual Dynamics with Region Proposal Interaction Networks (ICLR 2021)
☆113Jul 5, 2026Updated 3 weeks ago
Siyu-C / ACAR-Net
View on GitHub
[CVPR 2021] Actor-Context-Actor Relation Network for Spatio-temporal Action Localization
☆215Oct 8, 2021Updated 4 years ago
antoyang / TubeDETR
View on GitHub
[CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers
☆194Sep 24, 2023Updated 2 years ago
Yui010206 / Ego2Web
View on GitHub
[CVPR 2026] Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos
☆29Mar 25, 2026Updated 4 months ago
jayleicn / recurrent-transformer
View on GitHub
[ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
☆170Dec 4, 2020Updated 5 years ago
mauriciolp / inter-rel-net
View on GitHub
Code used at paper "Interaction Relational Network for Mutual Action Recognition" TMM 2021.
☆16Apr 5, 2021Updated 5 years ago
microsoft / UniTAB
View on GitHub
UniTAB: Unifying Text and Box Outputs for Grounded VL Modeling, ECCV 2022 (Oral Presentation)
☆90Jun 12, 2023Updated 3 years ago