NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory. CVPR 2023.
☆17Jan 26, 2024Updated 2 years ago
Alternatives and similar repositories for NaQ
Users that are interested in NaQ are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2022] Egocentric Video-Language Pretraining☆258May 9, 2024Updated last year
- In this codebase we establish a benchmark for egocentric user adaptation based on Ego4d.First, we start from a population model which ha…☆15Jan 16, 2025Updated last year
- [AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos☆34May 27, 2025Updated 10 months ago
- [ICLR'25] Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?☆12Apr 11, 2025Updated 11 months ago
- Tracking Multiple Deformable Objects in Egocentric Videos (CVPR 2023)☆13Apr 10, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Low-Computation Egocentric Barcode Detector for the Blind☆10Jun 9, 2017Updated 8 years ago
- CatNet: Class Incremental 3D ConvNets for Lifelong Egocentric Gesture Recognition☆12Apr 21, 2020Updated 5 years ago
- [CVPR2022] Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos☆102Oct 30, 2022Updated 3 years ago
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆34Jun 12, 2023Updated 2 years ago
- Collection of gym environments with support for domain randomization☆10Dec 11, 2024Updated last year
- Graph Convolutional Module for Temporal Action Localization in Videos☆10Jul 4, 2020Updated 5 years ago
- Trans4Map: Revisiting Holistic Top-down Mapping from Egocentric Images to Allocentric Semantics with Vision Transformers☆17Oct 14, 2022Updated 3 years ago
- This repo contains the code for the recipe of the winning entry to the Ego4d VQ2D challenge at CVPR 2022.☆41Mar 7, 2023Updated 3 years ago
- Vue component for Plaid Link☆10Sep 30, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos☆71Jan 29, 2024Updated 2 years ago
- Code for "Distributed, Egocentric Representations of Graphs for Detecting Critical Structures" (ICML 2019)☆20Aug 24, 2021Updated 4 years ago
- Reimplementation of NeRF (Neural Radiance Fields) (ECCV2020)☆10May 4, 2023Updated 2 years ago
- MAVERICS (Manually-vAlidated Vq^2a Examples fRom Image-Caption datasetS) is a suite of test-only benchmarks for visual question answering…☆13Feb 18, 2023Updated 3 years ago
- Code and models for the Action Recognition benchmark of Assembly101☆13Mar 26, 2023Updated 3 years ago
- Official repository of paper "Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval"☆10Dec 20, 2023Updated 2 years ago
- Annotations for the Mistake Detection benchmark of Assembly101☆11Aug 3, 2023Updated 2 years ago
- an X-ray image dataset for prohibited item segmentation☆10May 24, 2021Updated 4 years ago
- One-Shot Unsupervised Cross Domain Detection☆13Nov 22, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆12Nov 16, 2020Updated 5 years ago
- Interface to stable-baselines3 APIs for training RL policies on gym-registered environments☆12Jan 24, 2024Updated 2 years ago
- Placeholder for code of BSP.☆11Aug 13, 2021Updated 4 years ago
- Python package for egocentric network analysis☆14Feb 6, 2018Updated 8 years ago
- [CVPR 2025] Official Repository of the paper "On the Consistency of Video Large Language Models in Temporal Comprehension"☆16Oct 13, 2025Updated 5 months ago
- Noise Contrastive Test-Time Training☆13Mar 11, 2024Updated 2 years ago
- Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)☆108Jan 23, 2025Updated last year
- A Unified Framework for Video-Language Understanding☆61Jun 17, 2023Updated 2 years ago
- Training and testing code from our CVPR 2023 paper "Are Deep Neural Networks SMARTer than Second Graders?"☆11Aug 10, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Unofficial faiss wheel builder for NVIDIA GPU☆34Mar 8, 2026Updated 3 weeks ago
- This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"☆14Aug 22, 2025Updated 7 months ago
- ☆16Dec 11, 2024Updated last year
- Span-based Localizing Network for Natural Language Video Localization (ACL 2020)☆112Oct 15, 2021Updated 4 years ago
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆29Apr 16, 2024Updated last year
- ☆132May 30, 2024Updated last year
- Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-based Active Learning☆15Dec 12, 2023Updated 2 years ago