This repo contains the code for the recipe of the winning entry to the Ego4d VQ2D challenge at CVPR 2022.
☆41Mar 7, 2023Updated 3 years ago
Alternatives and similar repositories for vq2d_cvpr
Users that are interested in vq2d_cvpr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆138May 30, 2024Updated last year
- Placeholder for code of BSP.☆11Aug 13, 2021Updated 4 years ago
- [CVPR 2022] OCSampler: Compressing Videos to One Clip with Single-step Sampling☆17Jun 21, 2022Updated 3 years ago
- (NeurIPS 2023) Open-set visual object query search & localization in long-form videos☆26Feb 1, 2024Updated 2 years ago
- Human-centric environment representations from egocentric video☆15Feb 5, 2026Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [NeurIPS 2022] Egocentric Video-Language Pretraining☆260May 9, 2024Updated 2 years ago
- [ECCV 2022] Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing☆27Jul 15, 2022Updated 3 years ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆14Nov 4, 2023Updated 2 years ago
- [CVPR 2024 Champions][ICLR 2025] Solutions for EgoVis Chanllenges in CVPR 2024☆134May 11, 2025Updated last year
- This is a repository for my work on the paper "Oracle Guided Image Synthesis with Relative Queries".☆24May 6, 2022Updated 4 years ago
- Environment Predictive Coding for Visual Navigation. ICLR 2022.☆15Dec 10, 2022Updated 3 years ago
- Action Graphs: Weakly-supervised Action Localization with Graph Convolution Networks☆17Dec 8, 2022Updated 3 years ago
- [CVPR 2023] Official code for "Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations"☆55Aug 8, 2023Updated 2 years ago
- ☆80Sep 4, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆21Mar 22, 2023Updated 3 years ago
- A curated list of resources about long-context in large-language models and video understanding.☆32Aug 8, 2023Updated 2 years ago
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆39Aug 29, 2023Updated 2 years ago
- PyTorch implementation of the CVPR 2018 (oral) paper "MapNet: An Allocentric Spatial Memory for Mapping Environments" (Henriques and Veda…☆25Dec 5, 2019Updated 6 years ago
- [SIGGRAPH 2017] Realistic diffraction is the official implementation of the paper Practical Acquisition and Rendering of Diffraction Effe…☆16Jun 13, 2021Updated 4 years ago
- Combining GroundingDINO, Segment Anything, ZoeDepth and Multiview Compressive Coding for 3D reconstruction to reconstruct 3D model of the…☆13May 3, 2023Updated 3 years ago
- The 1st place solution of 2022 Ego4d Natural Language Queries.☆32Sep 5, 2022Updated 3 years ago
- [ECCV 2022] AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant☆23Jan 30, 2026Updated 3 months ago
- Target Transformed Regression for Accurate Tracking☆21Dec 5, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- My implementation (PyTorch) for the paper SST: Single-Stream Temporal Action Proposals (http://vision.stanford.edu/pdf/buch2017cvpr.pdf).☆10Dec 8, 2022Updated 3 years ago
- Official PyTorch code of GroundVQA (CVPR'24)☆63Sep 13, 2024Updated last year
- Convolutional Neural Networks☆12Oct 5, 2017Updated 8 years ago
- ☆25Feb 6, 2023Updated 3 years ago
- Code release for "Learning Video Representations from Large Language Models"☆533Oct 1, 2023Updated 2 years ago
- Code for “Pretrained Language Models as Visual Planners for Human Assistance”☆62Jun 12, 2023Updated 2 years ago
- Official implementation of `Discovering Hidden Visual Concepts Beyond Linguistic Input in Infant Learning`, CVPR 2025☆12Aug 1, 2025Updated 9 months ago
- Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)☆57Apr 15, 2024Updated 2 years ago
- A2B Neural Rendering of Ambisonic Recordings to Binaural☆18Aug 5, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Home Action Genome: Cooperative Contrastive Action Understanding☆22Nov 8, 2021Updated 4 years ago
- ☆11Feb 16, 2026Updated 3 months ago
- source code of our RaNet in EMNLP 2021☆30May 31, 2022Updated 3 years ago
- Test-Time Training on Video Streams☆70Jul 24, 2023Updated 2 years ago
- [CVPR 2025] Official Pytorch implementation of "Learning with Noisy Triplet Correspondence for Composed Image Retrieval".☆24Jun 9, 2025Updated 11 months ago
- MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)☆31Sep 5, 2023Updated 2 years ago
- Implementation for Bottom-Up Temporal Action Localization with Mutual Regularization (ECCV2020)☆47Dec 2, 2020Updated 5 years ago