[CVPR2022] SVIP: Sequence VerIfication for Procedures in Videos
☆24Feb 24, 2023Updated 3 years ago
Alternatives and similar repositories for SVIP-Sequence-VerIfication-for-Procedures-in-Videos
Users that are interested in SVIP-Sequence-VerIfication-for-Procedures-in-Videos are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TA2N: Two-Stage Action Alignment Network for Few-Shot Action Recognition☆17Mar 26, 2024Updated 2 years ago
- (CVPR 2023) Official implemention of the paper "Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos…☆31Apr 2, 2024Updated 2 years ago
- PyTorch port of RepNet: Counting Out Time - Class Agnostic Video Repetition Counting in the Wild☆12Oct 27, 2021Updated 4 years ago
- FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality Assessment☆149Aug 26, 2024Updated last year
- MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning☆142Oct 10, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [arXiv prepreint] Deep Learning Assisted Optimization for 3D Reconstruction from Single 2D Line Drawings☆20May 14, 2025Updated 11 months ago
- Rank-aware Attention Network from 'The Pros and Cons: Rank-aware Temporal Attention for Skill Determination in Long Videos'☆30Apr 16, 2021Updated 5 years ago
- Pytorch code for Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning, CVPR2022.☆95May 19, 2023Updated 2 years ago
- ☆17May 14, 2025Updated 11 months ago
- Generative Deformable Radiance Fields for Disentangled Image Synthesis of Topology-Varying Objects☆21Oct 7, 2022Updated 3 years ago
- Code accompanying EGO-TOPO: Environment Affordances from Egocentric Video (CVPR 2020)☆31Aug 3, 2022Updated 3 years ago
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding☆64Mar 9, 2022Updated 4 years ago
- Bidirectional Mapping between Action Physical-Semantic Space☆34Sep 7, 2025Updated 7 months ago
- [CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.☆50Aug 31, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This is an official pytorch implementation of Learning To Recognize Procedural Activities with Distant Supervision. In this repository, w…☆43Feb 21, 2023Updated 3 years ago
- Code for Semantic-Aware Dynamic Generation Networks for Few-Shot Human-Object Interaction Recognition☆10May 26, 2021Updated 4 years ago
- [ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…☆16Feb 22, 2025Updated last year
- [CVPR 2022] Sequential Voting with Relational Box Fields for Active Object Detection☆10Jun 19, 2022Updated 3 years ago
- ☆12Jul 8, 2023Updated 2 years ago
- [EMNLP 2020] What is More Likely to Happen Next? Video-and-Language Future Event Prediction☆51Aug 20, 2022Updated 3 years ago
- Code for our CVPR 2022 Paper "Hybrid Relation Guided Set Matching for Few-shot Action Recognition".☆27Jan 3, 2023Updated 3 years ago
- ☆18Oct 3, 2024Updated last year
- Layout-Guided Novel View Synthesis from a Single Indoor Panorama (CVPR 2021)☆36Oct 19, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Music-Aligned Holistic 3D Dance Generation via Hierarchical Motion Modeling [ICCV 2025] Official PyTorch implementation☆36Nov 11, 2025Updated 5 months ago
- ☆18Jul 26, 2023Updated 2 years ago
- [ACM MM 2021] TSA-Net: Tube Self-Attention Network for Action Quality Assessment☆40Jun 6, 2023Updated 2 years ago
- Combining "segment-anything" with MOT, it create the era of "MOTS"☆155May 29, 2023Updated 2 years ago
- Python transparent bindings for LSD (Line Segment Detector)☆15Mar 29, 2023Updated 3 years ago
- TESGNN: 3D Temporal Equivariant Scene Graph Neural Networks (published at TMLR)☆14Nov 2, 2025Updated 5 months ago
- ☆19Apr 5, 2024Updated 2 years ago
- Open Set Video HOI detection from Action-centric Chain-of-Look Prompting, ICCV2023☆12Oct 3, 2023Updated 2 years ago
- Offical implementation of "MetaLA: Unified Optimal Linear Approximation to Softmax Attention Map" (NeurIPS2024 Oral)☆36Jan 18, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for paper "RapVerse: Coherent Vocals and Whole-Body Motions Generations from Text"☆18May 30, 2024Updated last year
- [NeurIPS'24 spotlight] MECD: Unlocking Multi-Event Causal Discovery in Video Reasoning. [TPAMI'25] MECD+☆47Feb 11, 2026Updated 2 months ago
- The website of Matterport3D-Layout.☆18Sep 9, 2020Updated 5 years ago
- Custom layers for pytorch☆15Mar 16, 2024Updated 2 years ago
- Beyond Universal Saliency: Personalized Saliency Prediction with Multi-task CNN (IJCAI 2017 and TPAMI)☆11Jan 17, 2019Updated 7 years ago
- [AAAI2023] Revisiting the Spatial and Temporal Modeling for Few-shot Action Recognition (SloshNet)☆14Jan 10, 2024Updated 2 years ago
- [CVPR 2023] LOGO: A Long-Form Video Dataset for Group Action Quality Assessment☆45Apr 9, 2024Updated 2 years ago