This repo contains the code for the recipe of the winning entry to the Ego4d VQ2D challenge at CVPR 2022.
☆41Mar 7, 2023Updated 3 years ago
Alternatives and similar repositories for vq2d_cvpr
Users that are interested in vq2d_cvpr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆132May 30, 2024Updated last year
- Placeholder for code of BSP.☆11Aug 13, 2021Updated 4 years ago
- [CVPR 2022] OCSampler: Compressing Videos to One Clip with Single-step Sampling☆17Jun 21, 2022Updated 3 years ago
- (NeurIPS 2023) Open-set visual object query search & localization in long-form videos☆26Feb 1, 2024Updated 2 years ago
- Code for CVPR 2023 paper "Procedure-Aware Pretraining for Instructional Video Understanding"☆50Jan 27, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [NeurIPS 2022] Egocentric Video-Language Pretraining☆260May 9, 2024Updated last year
- Evaluation measures for the EPIC-KITCHENS-100 Action Detection challenge☆16Feb 10, 2026Updated 2 months ago
- Self-supervised algorithm for learning representations from ego-centric video data. Code is tested on EPIC-Kitchens-100 and Ego4D in PyTo…☆13Oct 23, 2022Updated 3 years ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆14Nov 4, 2023Updated 2 years ago
- [CVPR 2024 Champions][ICLR 2025] Solutions for EgoVis Chanllenges in CVPR 2024☆133May 11, 2025Updated 11 months ago
- This is a repository for my work on the paper "Oracle Guided Image Synthesis with Relative Queries".☆24May 6, 2022Updated 3 years ago
- NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory. CVPR 2023.☆17Jan 26, 2024Updated 2 years ago
- Environment Predictive Coding for Visual Navigation. ICLR 2022.☆15Dec 10, 2022Updated 3 years ago
- Action Graphs: Weakly-supervised Action Localization with Graph Convolution Networks☆17Dec 8, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [CVPR 2023] Official code for "Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations"☆56Aug 8, 2023Updated 2 years ago
- ☆80Sep 4, 2022Updated 3 years ago
- ☆21Mar 22, 2023Updated 3 years ago
- A curated list of resources about long-context in large-language models and video understanding.☆32Aug 8, 2023Updated 2 years ago
- ☆29Oct 26, 2015Updated 10 years ago
- The official implementation of paper: Estimating Egocentric 3D Human Pose in Global Space.☆12Sep 23, 2023Updated 2 years ago
- PyTorch implementation of the CVPR 2018 (oral) paper "MapNet: An Allocentric Spatial Memory for Mapping Environments" (Henriques and Veda…☆25Dec 5, 2019Updated 6 years ago
- The OBMO module embedded in PatchNet☆10Feb 21, 2024Updated 2 years ago
- The 1st place solution of 2022 Ego4d Natural Language Queries.☆32Sep 5, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ECCV 2022] AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant☆23Jan 30, 2026Updated 2 months ago
- Target Transformed Regression for Accurate Tracking☆21Dec 5, 2021Updated 4 years ago
- [CVPR 2023] Egocentric Audio-Visual Object Localization☆27Jan 6, 2024Updated 2 years ago
- My implementation (PyTorch) for the paper SST: Single-Stream Temporal Action Proposals (http://vision.stanford.edu/pdf/buch2017cvpr.pdf).☆10Dec 8, 2022Updated 3 years ago
- For Ego4D VQ3D Task☆22Jan 9, 2024Updated 2 years ago
- Gated Pretrained Transformer model for robust denoised sequence-to-sequence modelling☆10May 29, 2021Updated 4 years ago
- Sparse Fourier Backpropagation in Cryo-EM Reconstruction☆12Dec 3, 2023Updated 2 years ago
- Official PyTorch code of GroundVQA (CVPR'24)☆64Sep 13, 2024Updated last year
- Code release for "Learning Video Representations from Large Language Models"☆533Oct 1, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Procedural city generation.☆13Oct 15, 2022Updated 3 years ago
- [KDD 2023] code for "Test accuracy vs. generalization gap: model selection in NLP without accessing training or testing data" https://arx…☆12Oct 17, 2022Updated 3 years ago
- BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection☆52Jun 10, 2023Updated 2 years ago
- Code for “Pretrained Language Models as Visual Planners for Human Assistance”☆62Jun 12, 2023Updated 2 years ago
- X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization, CVPR 2024☆11Nov 7, 2024Updated last year
- ☆31Dec 8, 2023Updated 2 years ago
- Official implementation of `Discovering Hidden Visual Concepts Beyond Linguistic Input in Infant Learning`, CVPR 2025☆13Aug 1, 2025Updated 8 months ago