Human-centric environment representations from egocentric video
☆14Feb 5, 2026Updated 2 months ago
Alternatives and similar repositories for ego-env
Users that are interested in ego-env are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Unofficial PyTorch implementation of MapNet: An Allocentric Spatial Memory for Mapping Environments☆12Jun 4, 2020Updated 5 years ago
- ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model☆16Apr 7, 2026Updated last week
- Data release for Step Differences in Instructional Video (CVPR24)☆14Jun 19, 2024Updated last year
- This is the official code release for "SceNeRFlow: Time-Consistent Reconstruction of General Dynamic Scenes" (3DV 2024), a NeRF-based met…☆20Jan 15, 2025Updated last year
- Environment Predictive Coding for Visual Navigation. ICLR 2022.☆15Dec 10, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.☆22Jan 22, 2024Updated 2 years ago
- ☆44Jan 13, 2026Updated 3 months ago
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆43Dec 7, 2024Updated last year
- For Ego4D VQ3D Task☆22Jan 9, 2024Updated 2 years ago
- Pytorch implementation for Egoinstructor at CVPR 2024☆28Dec 1, 2024Updated last year
- [ICCV 2025] Object-centric Video Question Answering with Visual Grounding and Referring☆25Aug 8, 2025Updated 8 months ago
- [CVPR 2026] UFVideo: Towards Unified Fine-Grained Video Cooperative Understanding with Large Language Models☆37Feb 21, 2026Updated last month
- Code for LifelongMemory: Leveraging LLMs for Answering Queries in Long-form Egocentric Videos☆28Oct 27, 2025Updated 5 months ago
- X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization, CVPR 2024☆11Nov 7, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆10Apr 22, 2021Updated 4 years ago
- Official implementation of `Discovering Hidden Visual Concepts Beyond Linguistic Input in Infant Learning`, CVPR 2025☆13Aug 1, 2025Updated 8 months ago
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- This repository presents an evaluation framework for speech-to-speech (S2S) models, following the methodology described in the EmphAsses …☆25Jan 9, 2024Updated 2 years ago
- ☆13Nov 28, 2021Updated 4 years ago
- Efficient Feature Extraction for High-resolution Video Frame Interpolation (BMVC 2022)☆13Aug 24, 2023Updated 2 years ago
- [CVPR 2025] Official Pytorch implementation of "Learning with Noisy Triplet Correspondence for Composed Image Retrieval".☆24Jun 9, 2025Updated 10 months ago
- Source code for the Paper "Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models"☆19Feb 1, 2026Updated 2 months ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Stop leaks. Safeguard your secrets with GitGuardian. GitGuardian actively prevents accidental exposure of sensitive information in your c…☆33Updated this week
- [CVPR 2023] Code for the paper "Masked Images Are Counterfactual Samples for Robust Fine-tuning"☆14Mar 24, 2023Updated 3 years ago
- ☆66Feb 1, 2026Updated 2 months ago
- [ECCV2024] The official implementation of "Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation".☆13Feb 24, 2025Updated last year
- Zero Experience Required: Plug & Play Modular Transfer Learning for Semantic Visual Navigation. CVPR 2022☆36Oct 27, 2022Updated 3 years ago
- COM Kitchens: An Unedited Overhead-view Video Dataset as a Vision-Language Benchmark☆15Aug 22, 2024Updated last year
- HD-EPIC Python script to download the entire datasets or parts of it☆19Oct 7, 2025Updated 6 months ago
- Official Implementation of DMT: Dual Mean-Teacher in PyTorch.☆10Oct 27, 2023Updated 2 years ago
- Composed Video Retrieval☆62May 2, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Reproducing the reinforcement learning models used in "Emergence of Linguistic Communication from Referential Games with Symbolic and Pix…☆12Jun 23, 2018Updated 7 years ago
- [CVPR'25] 🌟🌟 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering☆47Jun 19, 2025Updated 9 months ago
- This repo contains the code for the recipe of the winning entry to the Ego4d VQ2D challenge at CVPR 2022.☆41Mar 7, 2023Updated 3 years ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆14Nov 4, 2023Updated 2 years ago
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]☆108Jul 2, 2024Updated last year
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 2 years ago
- Action Scene Graphs for Long-Form Understanding of Egocentric Videos (CVPR 2024)☆46Apr 9, 2025Updated last year