The official code and data for paper "VidEgoThink: Assessing Egocentric Video Understanding Capabilities for Embodied AI"
☆16Mar 25, 2025Updated 11 months ago
Alternatives and similar repositories for VidEgoThink
Users that are interested in VidEgoThink are comparing it to the libraries listed below
Sorting:
- Advanced Embodied Intelligence Brain Model☆33Nov 5, 2025Updated 3 months ago
- ☆13May 13, 2025Updated 9 months ago
- FGLA: Fast Generation-Based Gradient Leakage Attacks against Highly Compressed Gradients☆14Dec 20, 2022Updated 3 years ago
- ☆18Aug 7, 2025Updated 6 months ago
- Prompt-Guided Retrieval For Non-Knowledge-Intensive Tasks☆12Sep 1, 2023Updated 2 years ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Sep 21, 2025Updated 5 months ago
- The official implementation of the paper SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder☆18Oct 19, 2025Updated 4 months ago
- The official repository of the paper "X as Supervision: Contending with Depth Ambiguity in Unsupervised Monocular 3D Pose Estimation"☆12Jan 22, 2025Updated last year
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- ☆13Jul 22, 2022Updated 3 years ago
- ☆22Jan 12, 2026Updated last month
- Code for Learned Thresholds Token Merging and Pruning for Vision Transformers (LTMP). A technique to reduce the size of Vision Transforme…☆17Nov 24, 2024Updated last year
- An undergraduate thesis project.☆11Jul 13, 2024Updated last year
- ☆15May 13, 2024Updated last year
- ☆11Aug 29, 2022Updated 3 years ago
- [EMNLP'22] Weakly-Supervised Temporal Article Grounding☆14Nov 25, 2023Updated 2 years ago
- Source code for our paper: "LoGU: Long-form Generation with Uncertainty Expressions".☆16May 27, 2025Updated 9 months ago
- ☆16Sep 25, 2025Updated 5 months ago
- matlab work☆13Jun 10, 2011Updated 14 years ago
- Code for Static and Dynamic Concepts for Self-supervised Video Representation Learning.☆11Jul 28, 2022Updated 3 years ago
- ☆13Nov 11, 2022Updated 3 years ago
- ☆13Jan 14, 2022Updated 4 years ago
- 此代码用于RoboMaster AI Challenge 2020的平面仿真☆10May 10, 2020Updated 5 years ago
- ☆19Apr 5, 2024Updated last year
- [CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning☆14Jun 7, 2025Updated 8 months ago
- ☆12Mar 12, 2023Updated 2 years ago
- This is the official implementation of the Concept Discovery Models paper.☆15Aug 27, 2023Updated 2 years ago
- ☆13Feb 26, 2025Updated last year
- Code Release for `Learning Answer Embeddings for Visual Question Answering`. (CVPR 2018)☆13Apr 6, 2019Updated 6 years ago
- Costa Rican license plate dataset generator☆13Oct 14, 2019Updated 6 years ago
- Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model☆82Nov 27, 2025Updated 3 months ago
- 检查:1)基金各个阶段的超额收益。2)基金经理是否变更☆14Jul 30, 2024Updated last year
- LINe: Out-of-Distribution Detection by Leveraging Important Neurons (CVPR 2023)☆13Jun 13, 2023Updated 2 years ago
- CVPR2025☆21Aug 16, 2025Updated 6 months ago
- IRIT experiments on the STAC corpus☆16Mar 19, 2018Updated 7 years ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆21Dec 22, 2025Updated 2 months ago
- ☆18Jun 20, 2025Updated 8 months ago
- ☆17Jan 26, 2024Updated 2 years ago
- Official codebase for "TAU-106K: A New Dataset for Comprehensive Understanding of Traffic Accident"☆19Apr 19, 2025Updated 10 months ago