Official implementation of the paper "Hierarchical Vector Quantization for Unsupervised Action Segmentation"
☆26Feb 6, 2026Updated last month
Alternatives and similar repositories for HVQ
Users that are interested in HVQ are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV2024] Gated Temporal Action Anticipation for Stochastic Long-Term Anticipation☆23May 29, 2025Updated 9 months ago
- Official implementation of the CVPR2022 paper "Learning of Global Objective for Network Flow in Multi-Object Tracking"☆18Dec 30, 2025Updated 2 months ago
- [3DV2026] Official repository for "CamC2V: Context-aware Controllable Video Generation"☆14Nov 11, 2025Updated 4 months ago
- [CVPR 2025] MANTA: Diffusion Mamba for Efficient and Effective Stochastic Long-Term Dense Anticipation☆24Jun 13, 2025Updated 9 months ago
- ☆25Feb 5, 2026Updated last month
- Simple template for quick prototyping and standardization of deep learning projects☆11Dec 29, 2023Updated 2 years ago
- Official implementation of: "PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning" by…☆18Jun 2, 2025Updated 9 months ago
- [NeurIPS 2023 (Spotlight)] Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts☆13Jan 30, 2024Updated 2 years ago
- [ICCV 2023] Latent Action Composition for Skeleton-based Action Segmentation☆21Oct 25, 2023Updated 2 years ago
- [CVPR 2025] Official Repository of the paper "On the Consistency of Video Large Language Models in Temporal Comprehension"☆16Oct 13, 2025Updated 5 months ago
- Official Implementation for "Fast Weakly Supervised Action Segmentation Using Mutual Consistency" - TPAMI 2021☆21Aug 30, 2021Updated 4 years ago
- 👆PyTorch Implementation of JEDi Metric described in "Beyond FVD: Enhanced Evaluation Metrics for Video Generation Quality"☆30Dec 8, 2024Updated last year
- Efficient Video Prediction via Sparsely Conditioned Flow Matching. In ICCV, 2023.☆24Jun 5, 2024Updated last year
- ☆30Aug 6, 2025Updated 7 months ago
- [ICCV 2023] How Much Temporal Long-Term Context is Needed for Action Segmentation?☆49Jun 21, 2024Updated last year
- [ICRA 2024] SLCF-Net: Sequential LiDAR-Camera Fusion for Semantic Scene Completion using a 3D Recurrent U-Net☆29Jul 23, 2024Updated last year
- Humans-in-Kitchens Dataset API ([NeurIPS 2023 Dataset and Benchmark Track])☆41Sep 29, 2024Updated last year
- ☆41Jan 26, 2026Updated last month
- A simple semi-automatic labelling tool for semantic segmention masks using SAM as support.☆15Apr 17, 2024Updated last year
- Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models [CVPR 2025]☆79Jun 24, 2025Updated 9 months ago
- D-Robotics Robotic Manipulation☆37Mar 14, 2026Updated last week
- Personal reading list for learning-based long-horizon goal reaching methods☆17Nov 26, 2020Updated 5 years ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Sep 11, 2025Updated 6 months ago
- Independent PyTorch Implementation of Object Scene Representation Transformer☆49May 25, 2023Updated 2 years ago
- Universal Visual Decomposer: Long-Horizon Manipulation Made Easy☆69Jan 20, 2025Updated last year
- [ICIP2023] Code for the paper 'Action Anticipation with Goal Consistency'☆12Apr 5, 2024Updated last year
- Python package for calculation mahalanobis distances from NumPy arrays☆15Jun 22, 2022Updated 3 years ago
- The official implementation of Instance As Identity: A Generic Online Paradigm for Video Instance Segmentation.☆17Sep 19, 2022Updated 3 years ago
- MOCA: Self-supervised Representation Learning by Predicting Masked Online Codebook Assignments☆13Jul 8, 2024Updated last year
- Pytorch Implementation of "HandNeRF: Learning to Reconstruct Hand-Object Interaction Scene from a Single RGB Image", In ICRA 2024☆26Mar 27, 2024Updated last year
- [CVPR2025] Hand-held Object Reconstruction from RGB Video with Dynamic Interaction☆33Sep 1, 2025Updated 6 months ago
- This is the code repository for IntPhys 2, a video benchmark designed to evaluate the intuitive physics understanding of deep learning mo…☆96Oct 21, 2025Updated 5 months ago
- [WACV 2025] Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection☆16Mar 23, 2025Updated last year
- ☆25Mar 14, 2026Updated last week
- The code for the paper "Embracing Collaboration Over Competition: Condensing Multiple Prompts for Visual In-Context Learning" (CVPR'25).☆15Sep 25, 2025Updated 5 months ago
- Codes for the paper "Multi-task Hierarchical Adversarial Inverse Reinforcement Learning"☆19May 20, 2023Updated 2 years ago
- Menagerie of video models trained on various video datasets☆10Oct 13, 2024Updated last year
- Multi-Pedestrian Tracking in Monocular Calibrated Cameras☆11Feb 21, 2014Updated 12 years ago
- ☆20Jan 17, 2026Updated 2 months ago