☆26Apr 26, 2025Updated 10 months ago
Alternatives and similar repositories for egotempo
Users that are interested in egotempo are comparing it to the libraries listed below
Sorting:
- ☆13Jan 22, 2025Updated last year
- Collaborative retina modelling across datasets and species.☆18Updated this week
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆16Jun 3, 2025Updated 9 months ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Sep 11, 2025Updated 5 months ago
- ☆13May 12, 2025Updated 9 months ago
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…☆15Jun 4, 2025Updated 9 months ago
- Code and Dataset for the CVPRW Paper "Where did I leave my keys? — Episodic-Memory-Based Question Answering on Egocentric Videos"☆29Aug 28, 2023Updated 2 years ago
- ☆16Sep 25, 2025Updated 5 months ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- [TPAMI 2024] Deep Learning on Object-centric 3D Neural Fields☆16Aug 10, 2024Updated last year
- If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions☆17Apr 4, 2024Updated last year
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆22Nov 8, 2023Updated 2 years ago
- Code for "CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally"☆20Feb 27, 2026Updated last week
- [NeurIPS 2022] FiLM-Ensemble: Probabilistic Deep Learning via Feature-wise Linear Modulation☆26Dec 19, 2022Updated 3 years ago
- Evaluate Multimodal LLMs as Embodied Agents☆57Feb 14, 2025Updated last year
- Official implementation of "A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives", accepted at CVPR 2…☆24Jun 13, 2024Updated last year
- [CVPR 2021] Depth-conditioned Dynamic Message Propagation for Monocular 3D Object Detection☆27Jul 13, 2022Updated 3 years ago
- ☆27Mar 21, 2024Updated last year
- ☆27Jul 6, 2024Updated last year
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- ☆33Dec 23, 2025Updated 2 months ago
- Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types☆32Jul 16, 2025Updated 7 months ago
- ☆83May 6, 2025Updated 10 months ago
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 2 years ago
- [CVPR 2020] A generative model with latent factors that are independent and localized.☆12Mar 27, 2025Updated 11 months ago
- Linux distribution for space-grade robotics on the BeagleV-Fire RISC-V platform + FPGA support☆21Dec 24, 2025Updated 2 months ago
- ☆43Jul 9, 2025Updated 7 months ago
- Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.☆37Apr 17, 2023Updated 2 years ago
- Official Implementation of DiffCLIP: Differential Attention Meets CLIP☆53Mar 12, 2025Updated 11 months ago
- Code for our paper "Eventful Transformers: Leveraging Temporal Redundancy in Vision Transformers"☆36Jan 27, 2026Updated last month
- A Google Chrome Extension that replaces the official New Tab page with a beautiful to-do list.☆12Mar 7, 2018Updated 8 years ago
- Official implementation of "Attention-aware semantic communications for collaborative inference” (IEEE IoTJ 2024)☆13Jan 22, 2026Updated last month
- Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024☆43Dec 7, 2024Updated last year
- Temporal-controlled Frame Swap for Generating High-Fidelity Stereo Driving Data for Autonomy Analysis (BMVC2023)☆12Jun 9, 2023Updated 2 years ago
- ☆10Nov 17, 2022Updated 3 years ago
- Disable YubiKey output on MacOS without a modifier key pressed☆10Aug 10, 2022Updated 3 years ago
- ECG analysis to classify anterior myocardial infarction cases.☆10May 17, 2017Updated 8 years ago
- What Can You Learn from Your Muscles? Learning Visual Representation from Human Interactions (https://arxiv.org/pdf/2010.08539.pdf)☆39Mar 30, 2021Updated 4 years ago
- ☆87Mar 4, 2024Updated 2 years ago