This repo contains the code for the recipe of the winning entry to the Ego4d VQ2D challenge at CVPR 2022.
☆41Mar 7, 2023Updated 3 years ago
Alternatives and similar repositories for vq2d_cvpr
Users that are interested in vq2d_cvpr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆137May 30, 2024Updated last year
- Placeholder for code of BSP.☆11Aug 13, 2021Updated 4 years ago
- [CVPR 2022] OCSampler: Compressing Videos to One Clip with Single-step Sampling☆17Jun 21, 2022Updated 3 years ago
- (NeurIPS 2023) Open-set visual object query search & localization in long-form videos☆26Feb 1, 2024Updated 2 years ago
- Code for CVPR 2023 paper "Procedure-Aware Pretraining for Instructional Video Understanding"☆50Jan 27, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Human-centric environment representations from egocentric video☆15Feb 5, 2026Updated 3 months ago
- [NeurIPS 2022] Egocentric Video-Language Pretraining☆260May 9, 2024Updated last year
- Evaluation measures for the EPIC-KITCHENS-100 Action Detection challenge☆16Feb 10, 2026Updated 2 months ago
- Self-supervised algorithm for learning representations from ego-centric video data. Code is tested on EPIC-Kitchens-100 and Ego4D in PyTo…☆13Oct 23, 2022Updated 3 years ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆14Nov 4, 2023Updated 2 years ago
- This is a repository for my work on the paper "Oracle Guided Image Synthesis with Relative Queries".☆24May 6, 2022Updated 3 years ago
- NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory. CVPR 2023.☆17Jan 26, 2024Updated 2 years ago
- Environment Predictive Coding for Visual Navigation. ICLR 2022.☆15Dec 10, 2022Updated 3 years ago
- Action Graphs: Weakly-supervised Action Localization with Graph Convolution Networks☆17Dec 8, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [CVPR 2023] Official code for "Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations"☆55Aug 8, 2023Updated 2 years ago
- ☆80Sep 4, 2022Updated 3 years ago
- ☆21Mar 22, 2023Updated 3 years ago
- A curated list of resources about long-context in large-language models and video understanding.☆32Aug 8, 2023Updated 2 years ago
- ☆29Oct 26, 2015Updated 10 years ago
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆39Aug 29, 2023Updated 2 years ago
- The official implementation of paper: Estimating Egocentric 3D Human Pose in Global Space.☆12Sep 23, 2023Updated 2 years ago
- The OBMO module embedded in PatchNet☆10Feb 21, 2024Updated 2 years ago
- Code for the paper "GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos" published at CVPR 2024☆54Mar 3, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Target Transformed Regression for Accurate Tracking☆21Dec 5, 2021Updated 4 years ago
- [CVPR 2023] Egocentric Audio-Visual Object Localization☆27Jan 6, 2024Updated 2 years ago
- For Ego4D VQ3D Task☆22Jan 9, 2024Updated 2 years ago
- Sparse Fourier Backpropagation in Cryo-EM Reconstruction☆12Dec 3, 2023Updated 2 years ago
- Official PyTorch code of GroundVQA (CVPR'24)☆64Sep 13, 2024Updated last year
- Convolutional Neural Networks☆12Oct 5, 2017Updated 8 years ago
- Code release for "Learning Video Representations from Large Language Models"☆533Oct 1, 2023Updated 2 years ago
- BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection☆52Jun 10, 2023Updated 2 years ago
- Code for “Pretrained Language Models as Visual Planners for Human Assistance”☆62Jun 12, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization, CVPR 2024☆11Nov 7, 2024Updated last year
- ☆31Dec 8, 2023Updated 2 years ago
- ROS node for triggering cameras using GPIO on Jetson (targeting ROSCubeX, but easily adaptable to other platforms)☆13Apr 27, 2026Updated last week
- Code for the paper: F. Ragusa, G. M. Farinella, A. Furnari. StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipat…☆13Apr 11, 2023Updated 3 years ago
- some useful info for apply PhD in top graduate school☆27Aug 6, 2017Updated 8 years ago
- source code of our RaNet in EMNLP 2021☆30May 31, 2022Updated 3 years ago
- Test-Time Training on Video Streams☆70Jul 24, 2023Updated 2 years ago