google-research-datasets / egotempoView external linksLinks
☆26Apr 26, 2025Updated 9 months ago
Alternatives and similar repositories for egotempo
Users that are interested in egotempo are comparing it to the libraries listed below
Sorting:
- ☆13May 12, 2025Updated 9 months ago
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…☆15Jun 4, 2025Updated 8 months ago
- ☆13Jan 22, 2025Updated last year
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 3 months ago
- Collaborative retina modelling across datasets and species.☆16Updated this week
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Sep 11, 2025Updated 5 months ago
- Code and Dataset for the CVPRW Paper "Where did I leave my keys? — Episodic-Memory-Based Question Answering on Egocentric Videos"☆29Aug 28, 2023Updated 2 years ago
- ☆13Jul 22, 2025Updated 6 months ago
- ☆16Sep 25, 2025Updated 4 months ago
- LLM system to deliver personalized podcasts around bookmarked papers on Zotero.☆15Jan 5, 2026Updated last month
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- Deep Learning on Object-centric 3D Neural Fields (TPAMI)☆16Aug 10, 2024Updated last year
- Code for "CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally"☆19Feb 14, 2025Updated last year
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated last year
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆22Nov 8, 2023Updated 2 years ago
- Evaluate Multimodal LLMs as Embodied Agents☆57Feb 14, 2025Updated last year
- Official implementation of "A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives", accepted at CVPR 2…☆24Jun 13, 2024Updated last year
- [CVPR 2021] Depth-conditioned Dynamic Message Propagation for Monocular 3D Object Detection☆27Jul 13, 2022Updated 3 years ago
- ☆27Jul 6, 2024Updated last year
- ☆32Dec 23, 2025Updated last month
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆13Jun 28, 2025Updated 7 months ago
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- [ICCV23] Official implementation of eP-ALM: Efficient Perceptual Augmentation of Language Models.☆27Oct 27, 2023Updated 2 years ago
- ☆82May 6, 2025Updated 9 months ago
- FlowR: Flowing from Sparse to Dense 3D Reconstructions (ICCV'25 Highlight)☆83Sep 20, 2025Updated 4 months ago
- [CVPR 2020] A generative model with latent factors that are independent and localized.☆12Mar 27, 2025Updated 10 months ago
- Linux distribution for space-grade robotics on the BeagleV-Fire RISC-V platform + FPGA support☆21Dec 24, 2025Updated last month
- Official Implementation of DiffCLIP: Differential Attention Meets CLIP☆52Mar 12, 2025Updated 11 months ago
- Code for our paper "Eventful Transformers: Leveraging Temporal Redundancy in Vision Transformers"☆36Jan 27, 2026Updated 2 weeks ago
- Temporal-controlled Frame Swap for Generating High-Fidelity Stereo Driving Data for Autonomy Analysis (BMVC2023)☆12Jun 9, 2023Updated 2 years ago
- Disable YubiKey output on MacOS without a modifier key pressed☆10Aug 10, 2022Updated 3 years ago
- Multi-Agent LLM System for Digital Scam Protection☆12Dec 19, 2024Updated last year
- Repository for the code assignment of the Deep Learning 1 course, Fall 2021 edition☆10Oct 31, 2022Updated 3 years ago
- ☆10Nov 17, 2022Updated 3 years ago
- A Google Chrome Extension that replaces the official New Tab page with a beautiful to-do list.☆12Mar 7, 2018Updated 7 years ago
- What Can You Learn from Your Muscles? Learning Visual Representation from Human Interactions (https://arxiv.org/pdf/2010.08539.pdf)☆39Mar 30, 2021Updated 4 years ago
- ECG analysis to classify anterior myocardial infarction cases.☆10May 17, 2017Updated 8 years ago
- Offical repository of DriveWorld-VLA☆25Feb 1, 2026Updated 2 weeks ago
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆43Dec 7, 2024Updated last year