NaQ: Leveraging Narrations as Queries to Supervise Episodic Memory. CVPR 2023.
☆17Jan 26, 2024Updated 2 years ago
Alternatives and similar repositories for NaQ
Users that are interested in NaQ are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and data for "Does Spatial Cognition Emerge in Frontier Models?"☆30Apr 18, 2025Updated last year
- [ECCV 2022] AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant☆23Jan 30, 2026Updated 3 months ago
- [AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos☆37May 27, 2025Updated last year
- The official implementation of paper: Estimating Egocentric 3D Human Pose in Global Space.☆12Sep 23, 2023Updated 2 years ago
- Tracking Multiple Deformable Objects in Egocentric Videos (CVPR 2023)☆13Apr 10, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆13Jul 6, 2022Updated 3 years ago
- Low-Computation Egocentric Barcode Detector for the Blind☆10Jun 9, 2017Updated 8 years ago
- CatNet: Class Incremental 3D ConvNets for Lifelong Egocentric Gesture Recognition☆12Apr 21, 2020Updated 6 years ago
- ☆12Apr 6, 2023Updated 3 years ago
- Code release for the paper "Egocentric Video Task Translation" (CVPR 2023 Highlight)☆34Jun 12, 2023Updated 2 years ago
- Collection of gym environments with support for domain randomization☆10Dec 11, 2024Updated last year
- VisualEchoes Dataset (ECCV 2020)☆35Aug 31, 2021Updated 4 years ago
- Graph Convolutional Module for Temporal Action Localization in Videos☆10Jul 4, 2020Updated 5 years ago
- Automated Segmentation of Prohibited Items in X-ray Baggage Images Using Dense De-overlap Attention Snake, TMM 2022☆13Dec 28, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Trans4Map: Revisiting Holistic Top-down Mapping from Egocentric Images to Allocentric Semantics with Vision Transformers☆17Oct 14, 2022Updated 3 years ago
- This repo contains the code for the recipe of the winning entry to the Ego4d VQ2D challenge at CVPR 2022.☆41Mar 7, 2023Updated 3 years ago
- Code for "Distributed, Egocentric Representations of Graphs for Detecting Critical Structures" (ICML 2019)☆20Aug 24, 2021Updated 4 years ago
- Reimplementation of NeRF (Neural Radiance Fields) (ECCV2020)☆10May 4, 2023Updated 3 years ago
- MAVERICS (Manually-vAlidated Vq^2a Examples fRom Image-Caption datasetS) is a suite of test-only benchmarks for visual question answering…☆13Feb 18, 2023Updated 3 years ago
- Code and models for the Action Recognition benchmark of Assembly101☆14Mar 26, 2023Updated 3 years ago
- Official repository of paper "Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval"☆11Dec 20, 2023Updated 2 years ago
- One-Shot Unsupervised Cross Domain Detection☆13Nov 22, 2022Updated 3 years ago
- an X-ray image dataset for prohibited item segmentation☆10May 24, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆12Nov 16, 2020Updated 5 years ago
- Interface to stable-baselines3 APIs for training RL policies on gym-registered environments☆12Jan 24, 2024Updated 2 years ago
- Placeholder for code of BSP.☆11Aug 13, 2021Updated 4 years ago
- A simple and effective feature extractor for untrimmed videos☆13Sep 1, 2022Updated 3 years ago
- Annotations for the Mistake Detection benchmark of Assembly101☆12Aug 3, 2023Updated 2 years ago
- PONI: Potential Functions for ObjectGoal Navigation with Interaction-free Learning. CVPR 2022 (Oral).☆117Dec 29, 2022Updated 3 years ago
- [CVPR 2025] Official Repository of the paper "On the Consistency of Video Large Language Models in Temporal Comprehension"☆16Oct 13, 2025Updated 7 months ago
- Edit and Generate Anything in 3D world!☆13Apr 15, 2023Updated 3 years ago
- Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)☆108Jan 23, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Unified Framework for Video-Language Understanding☆62Jun 17, 2023Updated 2 years ago
- Training and testing code from our CVPR 2023 paper "Are Deep Neural Networks SMARTer than Second Graders?"☆11Aug 10, 2023Updated 2 years ago
- This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"☆13Aug 22, 2025Updated 9 months ago
- Span-based Localizing Network for Natural Language Video Localization (ACL 2020)☆113Oct 15, 2021Updated 4 years ago
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆29Apr 16, 2024Updated 2 years ago
- ☆138May 30, 2024Updated last year
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆14Nov 4, 2023Updated 2 years ago