A dataset for multi-object multi-actor activity parsing
☆44Sep 29, 2023Updated 2 years ago
Alternatives and similar repositories for moma
Users that are interested in moma are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- From scratch Python implementation of a genetic algorithm that recreates a target image☆11Jan 30, 2023Updated 3 years ago
- A video database bridging human actions and human-object relationships☆165Jun 30, 2020Updated 5 years ago
- Learning Spatial Common Sense with Geometry-Aware Recurrent Networks☆56Dec 16, 2019Updated 6 years ago
- ☆37Dec 20, 2023Updated 2 years ago
- Code for our CVPR 2022 Paper "Hybrid Relation Guided Set Matching for Few-shot Action Recognition".☆28Jan 3, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Scripts for pushing models to huggingface repos☆15Mar 13, 2026Updated 3 months ago
- Repo for Predicting Livelihood Indicators from Community-Generated Street-Level Imagery (AAAI21).☆21Dec 8, 2022Updated 3 years ago
- Action Scene Graphs for Long-Form Understanding of Egocentric Videos (CVPR 2024)☆46Apr 9, 2025Updated last year
- ☆33Sep 22, 2024Updated last year
- Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.☆43Apr 17, 2023Updated 3 years ago
- Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)☆35Sep 17, 2022Updated 3 years ago
- Fast and Efficient MMD-based Fair PCA via Optimization over Stiefel Manifold (AAAI 2022)☆11Sep 27, 2022Updated 3 years ago
- CVPR'25 official code for O-TPT: Orthogonality Constraints for Calibrating Test-time Prompt Tuning in Vision-Language Models☆16Sep 19, 2025Updated 8 months ago
- [CVPR 2022] Official PyTorch implementation of "Detector-Free Weakly Supervised Group Activity Recognition"☆27Jan 3, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆19May 19, 2024Updated 2 years ago
- LEAP is a novel tool for discovering latent temporal causal relations.☆17Oct 18, 2021Updated 4 years ago
- Official code for our CVPR 2023 paper: Test of Time: Instilling Video-Language Models with a Sense of Time☆46Jun 11, 2024Updated 2 years ago
- Pytorch implementation of our paper Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs, which i…☆47Jul 11, 2023Updated 2 years ago
- ☆16Jan 6, 2025Updated last year
- Official code repository for "Video-Mined Task Graphs for Keystep Recognition in Instructional Videos" arXiv, 2023☆14Apr 1, 2024Updated 2 years ago
- ☆13Jul 8, 2023Updated 2 years ago
- [ICML'25] Official code for "ConfPO: Exploiting Policy Model Confidence for Critical Token Selection in Preference Optimization"☆18Mar 15, 2026Updated 2 months ago
- [ICLR'24] Official code for "C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion"☆23Jun 9, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Learning about objects and their properties by interacting with them☆12Oct 21, 2020Updated 5 years ago
- ☆16Jun 4, 2023Updated 3 years ago
- Face recognition☆11Jun 20, 2019Updated 6 years ago
- Compositional Reasoning of Group Activity in Videos with Keypoint-Only Modality☆34Jan 19, 2025Updated last year
- Model-Agnostic Meta-Learning for HDR Image Reconstruction. By learning the common structure between all LDR-to-HDR conversion tasks, our …☆11May 10, 2021Updated 5 years ago
- [IJCV 2025] VLPrompt-PSG: Vision-Language Prompting for Panoptic Scene Graph Generation☆28Sep 24, 2024Updated last year
- Official repo for ECCV 2020 paper - RubiksNet: Learnable 3D-Shift for Efficient Video Action Recognition☆99Sep 6, 2020Updated 5 years ago
- ☆13Jul 10, 2024Updated last year
- Commonsense Scene Graph-based Target Localization for Object Search☆15Apr 2, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICCV 2023] RLIPv2: Fast Scaling of Relational Language-Image Pre-training☆137May 28, 2024Updated 2 years ago
- ☆10Oct 7, 2023Updated 2 years ago
- A PyTorch implementation of VIOLET☆138Dec 17, 2023Updated 2 years ago
- 简单的pagerank基础上加上稀疏化矩阵化并行化等处理☆12Oct 8, 2019Updated 6 years ago
- A PyTorch implementation of visual interaction networks☆12Jul 1, 2019Updated 6 years ago
- code for downloading videos from HowTo100M dataset☆18May 13, 2021Updated 5 years ago
- Awesome Self-Supervised Vision Learning☆11Mar 27, 2024Updated 2 years ago