☆87Mar 4, 2024Updated 2 years ago
Alternatives and similar repositories for lvu
Users that are interested in lvu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "A Graph-Based Framework to Bridge Movies and Synopses", ICCV2019☆52Aug 9, 2020Updated 5 years ago
- Placeholder for code of BSP.☆11Aug 13, 2021Updated 4 years ago
- Learning Interactions and Relationships between Movie Characters (CVPR'20)☆22Apr 12, 2023Updated 2 years ago
- [ECCV 2022] AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant☆23Jan 30, 2026Updated last month
- Code release for "Learning Video Representations from Large Language Models"☆534Oct 1, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for Domain Adaptation Through Task Distillation (ECCV 20)☆48Dec 8, 2022Updated 3 years ago
- Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners☆117Sep 15, 2022Updated 3 years ago
- Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"☆161Apr 29, 2020Updated 5 years ago
- M-VAD Names Dataset. Multimedia Tools and Applications (2019)☆24Jul 9, 2019Updated 6 years ago
- Edit and Generate Anything in 3D world!☆14Apr 15, 2023Updated 2 years ago
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding☆64Mar 9, 2022Updated 4 years ago
- [CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)☆61Aug 17, 2021Updated 4 years ago
- ☆48Jul 8, 2018Updated 7 years ago
- Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022☆153Nov 30, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"☆116Aug 23, 2025Updated 7 months ago
- Learning Spatiotemporal Features via Video and Text Pair Discrimination☆60Jan 20, 2021Updated 5 years ago
- PyTorch implementation of Graph-Based Social Relation Reasoning (ECCV 2020)☆17Jan 11, 2024Updated 2 years ago
- [NeurIPS 2022] Egocentric Video-Language Pretraining☆258May 9, 2024Updated last year
- AViD Dataset: Anonymized Videos from Diverse Countries☆56Mar 30, 2023Updated 2 years ago
- PIC Challenge Baseline☆18Dec 27, 2018Updated 7 years ago
- Code for the CVPR 2020 paper 'Action Modifiers: Learning from Adverbs in Instructional Videos'☆23May 17, 2021Updated 4 years ago
- [CVPR20] Video Object Grounding using Semantic Roles in Language Description (https://arxiv.org/abs/2003.10606)☆69Jun 10, 2020Updated 5 years ago
- Tools for movie and video research☆306Jun 20, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- I3D Nonlocal ResNets in Pytorch☆257Mar 26, 2022Updated 4 years ago
- Code to accompany "Does computer vision matter for action?"☆44Sep 2, 2024Updated last year
- ☆11Nov 5, 2024Updated last year
- Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-based Active Learning☆15Dec 12, 2023Updated 2 years ago
- ☆35Mar 22, 2022Updated 4 years ago
- [NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale☆207Nov 13, 2023Updated 2 years ago
- PyTorch GPU distributed training code for MIL-NCE HowTo100M☆219Jul 5, 2022Updated 3 years ago
- Code for MANO-GCN —— "Capturing Implicit Spatial Cues for Monocular 3D Hand Reconstruction" (ICME2021 Oral)☆13Jun 24, 2021Updated 4 years ago
- Dense video captioning in PyTorch☆41Aug 30, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Learning Long-term Visual Dynamics with Region Proposal Interaction Networks (ICLR 2021)☆113May 29, 2022Updated 3 years ago
- [CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers☆194Sep 24, 2023Updated 2 years ago
- [ACL 2020] PyTorch code for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning☆171Dec 4, 2020Updated 5 years ago
- [CVPR 2021] Actor-Context-Actor Relation Network for Spatio-temporal Action Localization☆214Oct 8, 2021Updated 4 years ago
- UniTAB: Unifying Text and Box Outputs for Grounded VL Modeling, ECCV 2022 (Oral Presentation)☆90Jun 12, 2023Updated 2 years ago
- Code used at paper "Interaction Relational Network for Mutual Action Recognition" TMM 2021.☆16Apr 5, 2021Updated 4 years ago
- A video retrieval dataset How2R and a video QA dataset How2QA☆24Oct 15, 2020Updated 5 years ago