This repo contains the code for the recipe of the winning entry to the Ego4d VQ2D challenge at CVPR 2022.
☆41Mar 7, 2023Updated 3 years ago
Alternatives and similar repositories for vq2d_cvpr
Users that are interested in vq2d_cvpr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆139May 30, 2024Updated 2 years ago
- Placeholder for code of BSP.☆11Aug 13, 2021Updated 4 years ago
- [CVPR 2022] OCSampler: Compressing Videos to One Clip with Single-step Sampling☆17Jun 21, 2022Updated 3 years ago
- (NeurIPS 2023) Open-set visual object query search & localization in long-form videos☆26Feb 1, 2024Updated 2 years ago
- Code for CVPR 2023 paper "Procedure-Aware Pretraining for Instructional Video Understanding"☆50Jun 2, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Human-centric environment representations from egocentric video☆15Feb 5, 2026Updated 4 months ago
- [NeurIPS 2022] Egocentric Video-Language Pretraining☆260May 9, 2024Updated 2 years ago
- Evaluation measures for the EPIC-KITCHENS-100 Action Detection challenge☆16Feb 10, 2026Updated 4 months ago
- [ECCV 2022] Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing☆27Jul 15, 2022Updated 3 years ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆14Nov 4, 2023Updated 2 years ago
- [CVPR 2024 Champions][ICLR 2025] Solutions for EgoVis Chanllenges in CVPR 2024☆136May 11, 2025Updated last year
- This is a repository for my work on the paper "Oracle Guided Image Synthesis with Relative Queries".☆24May 6, 2022Updated 4 years ago
- NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory. CVPR 2023.☆17Jan 26, 2024Updated 2 years ago
- Environment Predictive Coding for Visual Navigation. ICLR 2022.☆15Dec 10, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Action Graphs: Weakly-supervised Action Localization with Graph Convolution Networks☆17Dec 8, 2022Updated 3 years ago
- [CVPR 2023] Official code for "Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations"☆55Aug 8, 2023Updated 2 years ago
- ☆80Sep 4, 2022Updated 3 years ago
- ☆21Mar 22, 2023Updated 3 years ago
- A curated list of resources about long-context in large-language models and video understanding.☆32Aug 8, 2023Updated 2 years ago
- The official implementation of paper: Estimating Egocentric 3D Human Pose in Global Space.☆12Sep 23, 2023Updated 2 years ago
- The OBMO module embedded in PatchNet☆10Feb 21, 2024Updated 2 years ago
- The 1st place solution of 2022 Ego4d Natural Language Queries.☆32Sep 5, 2022Updated 3 years ago
- Code for the paper "GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos" published at CVPR 2024☆54Mar 3, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ECCV 2022] AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant☆23Jan 30, 2026Updated 4 months ago
- Inference code for "StylePeople: A Generative Model of Fullbody Human Avatars" paper☆25Mar 2, 2023Updated 3 years ago
- [CVPR 2023] Egocentric Audio-Visual Object Localization☆27Jan 6, 2024Updated 2 years ago
- My implementation (PyTorch) for the paper SST: Single-Stream Temporal Action Proposals (http://vision.stanford.edu/pdf/buch2017cvpr.pdf).☆10Dec 8, 2022Updated 3 years ago
- For Ego4D VQ3D Task☆22Jan 9, 2024Updated 2 years ago
- Gated Pretrained Transformer model for robust denoised sequence-to-sequence modelling☆10May 29, 2021Updated 5 years ago
- Official PyTorch code of GroundVQA (CVPR'24)☆63Sep 13, 2024Updated last year
- Convolutional Neural Networks☆12Oct 5, 2017Updated 8 years ago
- Code release for "Learning Video Representations from Large Language Models"☆533Oct 1, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection☆52Jun 10, 2023Updated 3 years ago
- Code for “Pretrained Language Models as Visual Planners for Human Assistance”☆63Jun 12, 2023Updated 3 years ago
- ☆31Dec 8, 2023Updated 2 years ago
- Code for the paper: F. Ragusa, G. M. Farinella, A. Furnari. StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipat…☆13Apr 11, 2023Updated 3 years ago
- Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)☆58Apr 15, 2024Updated 2 years ago
- some useful info for apply PhD in top graduate school☆27Aug 6, 2017Updated 8 years ago
- Home Action Genome: Cooperative Contrastive Action Understanding☆22Nov 8, 2021Updated 4 years ago