A dataset for multi-object multi-actor activity parsing
☆41Sep 29, 2023Updated 2 years ago
Alternatives and similar repositories for moma
Users that are interested in moma are comparing it to the libraries listed below
Sorting:
- A video database bridging human actions and human-object relationships☆157Jun 30, 2020Updated 5 years ago
- Learning Spatial Common Sense with Geometry-Aware Recurrent Networks☆56Dec 16, 2019Updated 6 years ago
- Video Graph Transformer for Video Question Answering (ECCV'22)☆49Jun 8, 2023Updated 2 years ago
- Code for our CVPR 2022 Paper "Hybrid Relation Guided Set Matching for Few-shot Action Recognition".☆27Jan 3, 2023Updated 3 years ago
- [ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"☆12Dec 6, 2024Updated last year
- ☆33Sep 22, 2024Updated last year
- Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.☆43Apr 17, 2023Updated 2 years ago
- Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)☆34Sep 17, 2022Updated 3 years ago
- Fast and Efficient MMD-based Fair PCA via Optimization over Stiefel Manifold (AAAI 2022)☆11Sep 27, 2022Updated 3 years ago
- Spatial-Temporal Transformer for Dynamic Scene Graph Generation, ICCV2021☆212Aug 22, 2022Updated 3 years ago
- CVPR'25 official code for O-TPT: Orthogonality Constraints for Calibrating Test-time Prompt Tuning in Vision-Language Models☆16Sep 19, 2025Updated 6 months ago
- [CVPR 2022] Official PyTorch implementation of "Detector-Free Weakly Supervised Group Activity Recognition"☆27Jan 3, 2023Updated 3 years ago
- ☆19May 19, 2024Updated last year
- LEAP is a novel tool for discovering latent temporal causal relations.☆17Oct 18, 2021Updated 4 years ago
- Official code for our CVPR 2023 paper: Test of Time: Instilling Video-Language Models with a Sense of Time☆46Jun 11, 2024Updated last year
- ☆41Jan 26, 2026Updated last month
- The source code for the paper: Yirong Mao, Ruiping Wang, Shiguang Shan, Xilin Chen. COSONet: Compact Second-Order Network for Video Face …☆12Dec 27, 2018Updated 7 years ago
- ☆16Apr 4, 2025Updated 11 months ago
- 비디오 기반 인공지능 대화시스템☆14Dec 23, 2023Updated 2 years ago
- ☆16Jan 6, 2025Updated last year
- [ICML'25] Official code for "ConfPO: Exploiting Policy Model Confidence for Critical Token Selection in Preference Optimization"☆18Dec 22, 2025Updated 2 months ago
- [ICLR'24] Official code for "C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion"☆22Jun 9, 2024Updated last year
- ☆10Aug 23, 2022Updated 3 years ago
- Code for CVPR 2023 paper "Procedure-Aware Pretraining for Instructional Video Understanding"☆50Jan 27, 2025Updated last year
- A Unified Framework for Video-Language Understanding☆61Jun 17, 2023Updated 2 years ago
- Face recognition☆11Jun 20, 2019Updated 6 years ago
- Compositional Reasoning of Group Activity in Videos with Keypoint-Only Modality☆34Jan 19, 2025Updated last year
- ☆10Dec 23, 2018Updated 7 years ago
- [IJCV 2025] VLPrompt-PSG: Vision-Language Prompting for Panoptic Scene Graph Generation☆28Sep 24, 2024Updated last year
- Model-Agnostic Meta-Learning for HDR Image Reconstruction. By learning the common structure between all LDR-to-HDR conversion tasks, our …☆11May 10, 2021Updated 4 years ago
- Commonsense Scene Graph-based Target Localization for Object Search☆15Apr 2, 2024Updated last year
- [ICCV 2023] RLIPv2: Fast Scaling of Relational Language-Image Pre-training☆135May 28, 2024Updated last year
- Repository for the paper "Metadata Normalization"☆23May 1, 2025Updated 10 months ago
- A PyTorch implementation of VIOLET☆140Dec 17, 2023Updated 2 years ago
- code for downloading videos from HowTo100M dataset☆17May 13, 2021Updated 4 years ago
- Awesome Self-Supervised Vision Learning☆11Mar 27, 2024Updated last year
- CVPR2025☆21Aug 16, 2025Updated 7 months ago
- Neuron Activation☆26Nov 21, 2024Updated last year
- ☆13Jun 26, 2022Updated 3 years ago