[CVPR 2023] HierVL Learning Hierarchical Video-Language Embeddings
☆46Aug 14, 2023Updated 2 years ago
Alternatives and similar repositories for HierVL
Users that are interested in HierVL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evaluation measures for the EPIC-KITCHENS-100 Action Detection challenge☆16Feb 10, 2026Updated 2 months ago
- The official implementation of Error Detection in Egocentric Procedural Task Videos☆24Sep 20, 2025Updated 7 months ago
- [CVPR 2023] Official code for "Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations"☆55Aug 8, 2023Updated 2 years ago
- [ICCV 2025] Factorized Learning for Temporally Grounded Video-Language Models☆24Apr 18, 2026Updated 2 weeks ago
- Code for CVPR 2023 paper "Procedure-Aware Pretraining for Instructional Video Understanding"☆50Jan 27, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Video narrator written in Python/GTK using vlc-lib☆25Jun 22, 2022Updated 3 years ago
- Data release for Step Differences in Instructional Video (CVPR24)☆14Jun 19, 2024Updated last year
- Annotations for the Mistake Detection benchmark of Assembly101☆12Aug 3, 2023Updated 2 years ago
- Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)☆57Apr 15, 2024Updated 2 years ago
- [CVPR 2024] KEPP: Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos☆12Sep 24, 2024Updated last year
- AgentHive provides the primitives and helpers for a seamless usage of robohive within TorchRL.☆35Jan 12, 2024Updated 2 years ago
- The official PyTorch implementation of the IEEE/CVF Computer Vision and Pattern Recognition (CVPR) '24 paper PREGO: online mistake detect…☆32Jun 9, 2025Updated 10 months ago
- Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)☆35Sep 9, 2024Updated last year
- ☆12Apr 6, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆51Sep 13, 2024Updated last year
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆34Jun 12, 2023Updated 2 years ago
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]☆108Jul 2, 2024Updated last year
- Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Align…☆19Apr 5, 2024Updated 2 years ago
- A python library to find differences between audio and transcriptions☆20Nov 14, 2023Updated 2 years ago
- Taming Self-Training for Open-Vocabulary Object Detection, CVPR 2024☆21Dec 30, 2023Updated 2 years ago
- Code release of our NeurIPS 18 paper "A flexible model for training action localization with varying levels of supervision"☆16Dec 28, 2018Updated 7 years ago
- Code for the paper "Differentiable Task Graph Learning: Procedural Activity Representation and Online Mistake Detection from Egocentric V…☆23Jan 9, 2025Updated last year
- HT-Step is a large-scale article grounding dataset of temporal step annotations on how-to videos☆25Mar 20, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding☆83Oct 6, 2023Updated 2 years ago
- ☆21Jan 17, 2025Updated last year
- Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Gr…☆153Aug 21, 2024Updated last year
- PyTorch implementation of R-MAE https//arxiv.org/abs/2306.05411☆111Jun 9, 2023Updated 2 years ago
- ☆26Aug 4, 2020Updated 5 years ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Jun 26, 2024Updated last year
- [ICML 2024] Official repository of ICML 2024 - RoboMP2: A Robotic Multimodal Perception-Planning Framework with Multimodal Large Language…☆11Apr 4, 2026Updated last month
- Official PyTorch implementation Source code for Adaptive Self-Training Framework for Fine-grained Scene Graph generation (ST-SGG), accept…☆22Jan 30, 2024Updated 2 years ago
- UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or …☆238Apr 15, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICCV'25] HERMES: temporal-coHERent long-forM understanding with Episodes and Semantics☆37Sep 10, 2025Updated 7 months ago
- ☆53Jan 3, 2023Updated 3 years ago
- Code release for the CVPR'23 paper titled "PartDistillation Learning part from Instance Segmentation"☆60Dec 17, 2023Updated 2 years ago
- ☆19May 27, 2023Updated 2 years ago
- Official repository for the paper "End-to-End Visual Editing with a Generatively Pre-Trained Artist", which is accepted at ECCV 2022. Her…☆29Dec 28, 2022Updated 3 years ago
- Deficiency-Aware Masked Transformer for Video Inpainting☆55Dec 11, 2023Updated 2 years ago
- ☆13May 21, 2024Updated last year