google-research-datasets/egotempo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google-research-datasets/egotempo)

google-research-datasets / egotempo

☆26

Alternatives and similar repositories for egotempo

Users that are interested in egotempo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TdP-2025 / TdP-2025
View on GitHub
☆12Jul 22, 2025Updated last year
mlvlab / ST-VLM
View on GitHub
☆13Mar 28, 2025Updated last year
lbaermann / qaego4d
View on GitHub
Code and Dataset for the CVPRW Paper "Where did I leave my keys? — Episodic-Memory-Based Question Answering on Egocentric Videos"
☆31Aug 28, 2023Updated 2 years ago
arubique / OCCAM
View on GitHub
This is an implementation of the paper "Are We Done with Object-Centric Learning?"
☆14Jun 21, 2026Updated last month
jylei16 / Imagine-e
View on GitHub
☆14Jan 22, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
sapeirone / EgoPack
View on GitHub
Official implementation of "A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives", accepted at CVPR 2…
☆24Jun 13, 2024Updated 2 years ago
danaesavi / ImageChain
View on GitHub
This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…
☆15Jun 4, 2025Updated last year
filipgdorm / eco-llm
View on GitHub
☆14Mar 20, 2026Updated 4 months ago
xuboshen / EgoNCEpp
View on GitHub
[ICLR'25] Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?
☆14Apr 11, 2025Updated last year
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
nishadsinghi / sc-genrm-scaling
View on GitHub
[COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…
☆15Oct 31, 2025Updated 8 months ago
houzhijian / GroundNLQ
View on GitHub
The champion solution for Ego4D Natural Language Queries Challenge in CVPR 2023
☆18Jan 23, 2024Updated 2 years ago
chu0802 / SnD
View on GitHub
This is an official implementation of our work, Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on V…
☆17Sep 24, 2025Updated 10 months ago
prs-eth / FILM-Ensemble
View on GitHub
[NeurIPS 2022] FiLM-Ensemble: Probabilistic Deep Learning via Feature-wise Linear Modulation
☆28Dec 19, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
pro-assist / ProAssist
View on GitHub
☆20Jul 21, 2025Updated last year
Chiaraplizz / ARGO1M-What-can-a-cook
View on GitHub
☆11Jul 14, 2023Updated 3 years ago
gabrielegoletto / AMEGO
View on GitHub
Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024
☆45Dec 7, 2024Updated last year
fudan-zvg / DDMP
View on GitHub
[CVPR 2021] Depth-conditioned Dynamic Message Propagation for Monocular 3D Object Detection
☆26Jul 13, 2022Updated 4 years ago
lxa9867 / QSD
View on GitHub
[CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"
☆12Feb 27, 2024Updated 2 years ago
BatsResearch / ex2
View on GitHub
If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions
☆17Apr 4, 2024Updated 2 years ago
WPR001 / Ego-ST
View on GitHub
☆16Sep 25, 2025Updated 10 months ago
prs-eth / SnowDepthEstimation
View on GitHub
☆10Jan 11, 2024Updated 2 years ago
Chiaraplizz / OSNOM
View on GitHub
Official repository from the paper "Spatial Cognition from Egocentric Video: Out of Sight, Not Out of Mind"
☆17Mar 18, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
mulin-xml / FudanConnection
View on GitHub
复旦大学有线网连接工具
☆12Oct 14, 2020Updated 5 years ago
gianluigilopardo / smace
View on GitHub
Code for the paper "SMACE: A New Method for the Interpretability of Composite Decision Systems", ECML 2022
☆15Apr 17, 2023Updated 3 years ago
amazon-science / few-shot-object-detection-benchmark
View on GitHub
☆13Jan 20, 2023Updated 3 years ago
zihuixue / MKE
View on GitHub
[ICCV 2021] Multimodal Knowledge Expansion
☆10Aug 28, 2021Updated 4 years ago
LAION-AI / scaling-laws-for-comparison
View on GitHub
☆22May 12, 2026Updated 2 months ago
paolotron / D3G
View on GitHub
Visual Relationship Reasoning for Grasp Planning
☆19May 22, 2025Updated last year
open-retina / open-retina
View on GitHub
Collaborative retina modelling across datasets and species.
☆20Updated this week
zhyever / DepthFormer
View on GitHub
Offical Implement of DepthFormer
☆12Oct 25, 2022Updated 3 years ago
RAIVNLab / VideoNet
View on GitHub
CVPR '26 Highlight
☆24May 6, 2026Updated 2 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
shengliu66 / LC
View on GitHub
Official Implementation of Avoiding spurious correlations via logit correction
☆17May 6, 2023Updated 3 years ago
Annusha / refam
View on GitHub
Official implementaiton of RefAM: Attention Magnets for Zero-Shot Referral Segmentaiton
☆16Feb 6, 2026Updated 5 months ago
mshukor / eP-ALM
View on GitHub
[ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.
☆27Oct 27, 2023Updated 2 years ago
princeton-pli / VLM_S2H
View on GitHub
Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?
☆19Jun 3, 2025Updated last year
franciszchen / SCA-Net
View on GitHub
☆10Oct 7, 2023Updated 2 years ago
EPFL-VILAB / fm-vision-evals
View on GitHub
How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks, ICLR 2026
☆72Mar 6, 2026Updated 4 months ago
JianxGao / C2F-Seg
View on GitHub
☆14Dec 21, 2023Updated 2 years ago