Graph learning framework for long-term video understanding
☆72Jul 13, 2025Updated 7 months ago
Alternatives and similar repositories for GraVi-T
Users that are interested in GraVi-T are comparing it to the libraries listed below
Sorting:
- Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection (ECCV 2022)☆68Oct 29, 2023Updated 2 years ago
- Accepted by TMM 2022☆19Aug 18, 2022Updated 3 years ago
- Multi-modal transformer approach for natural language query based joint video summarization and highlight detection☆17May 23, 2024Updated last year
- Code for CVPR 2023 paper "SViTT: Temporal Learning of Sparse Video-Text Transformers"☆20Jun 16, 2023Updated 2 years ago
- The repository for IEEE CVPR 2023 (A Light Weight Model for Active Speaker Detection)☆166Mar 23, 2025Updated 11 months ago
- ☆20Dec 29, 2024Updated last year
- Code for "ATTA: Anomaly-aware Test-Time Adaptation for Out-of-Distribution Detection in Segmentation" (NeurIPS 23)☆14Apr 12, 2024Updated last year
- Code & Weights for “Learning Robust Anymodal Segmentor with Unimodal and Cross-modal Distillation”☆14Dec 6, 2024Updated last year
- This repository contains the dataset used to train the neural network model descried in the paper "Implicit HRTF Modeling Using Tempora…☆11Aug 4, 2023Updated 2 years ago
- ☆10Oct 18, 2021Updated 4 years ago
- ☆12Jan 5, 2024Updated 2 years ago
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆34Jun 12, 2023Updated 2 years ago
- ☆12Apr 6, 2023Updated 2 years ago
- Neural Reflectance Field from Shading and Shadow under a Fixed Viewpoint☆16Aug 8, 2022Updated 3 years ago
- ☆32May 3, 2024Updated last year
- Referring expression comprehension on ReferIt(RefClef)☆10Nov 28, 2016Updated 9 years ago
- ☆21Nov 24, 2022Updated 3 years ago
- ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'☆450Oct 23, 2023Updated 2 years ago
- An unofficial implementation of NeRF-Supervised Deep Stereo (CVPR 2023)☆18Dec 20, 2023Updated 2 years ago
- Use MHFormer [CVPR 2022] to do pose estimation and use Unity to control rig of model. (not real-time)☆18Sep 14, 2022Updated 3 years ago
- ☆22Mar 7, 2025Updated 11 months ago
- Towards Long Form Audio-visual Video Understanding☆15Jan 16, 2026Updated last month
- ☆16Jun 25, 2022Updated 3 years ago
- [CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos☆71Jan 29, 2024Updated 2 years ago
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆58May 29, 2023Updated 2 years ago
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆27Jan 17, 2026Updated last month
- [ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection☆21Feb 22, 2024Updated 2 years ago
- Official code release for "D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video".☆25Feb 28, 2025Updated last year
- ☆22Mar 20, 2024Updated last year
- Unity project for nerf_pl (Neural Radiance Fields) for VR☆18Oct 15, 2023Updated 2 years ago
- 4D face reconstruction and analysis☆23Aug 28, 2024Updated last year
- rmp data ranking☆13Nov 4, 2025Updated 4 months ago
- Time-Optimal Path Following with Bounded Acceleration and Velocity☆27May 25, 2023Updated 2 years ago
- Code release for ECCV 2022 paper "RFNet-4D: Joint Object Reconstruction and Flow Estimation from 4D Point Clouds"☆26Mar 24, 2023Updated 2 years ago
- IntelliGraphs is a collection of graph datasets for benchmarking generative models for knowledge graphs.☆21Feb 25, 2025Updated last year
- ☆23Jun 27, 2022Updated 3 years ago
- Code related to the paper "Boundary-Aware Superpixel Segmentation", sent to ICPR 2016.☆27Aug 31, 2019Updated 6 years ago
- This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)☆25Dec 7, 2023Updated 2 years ago
- [ACCV 2024] Official Implementation of "AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description". Junyu Xie, Tengda Han, M…☆28Jan 28, 2025Updated last year