A Holistic Embodied Cognition Benchmark
☆18Apr 3, 2025Updated 10 months ago
Alternatives and similar repositories for ECBench
Users that are interested in ECBench are comparing it to the libraries listed below
Sorting:
- ☆45Jun 10, 2025Updated 8 months ago
- [AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos☆32May 27, 2025Updated 9 months ago
- RGB-D fusion for two-hand reconstruction☆29Aug 6, 2024Updated last year
- A task sequencer framework for achieving a GPT-to-action system in robotics.☆17Mar 6, 2025Updated 11 months ago
- ☆37Nov 8, 2024Updated last year
- Egocentric Video Understanding Dataset (EVUD)☆33Jul 4, 2024Updated last year
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆115Jul 9, 2025Updated 7 months ago
- ☆18Jan 2, 2026Updated last month
- 简繁汉字并非一一对应,在转换中往往要根据上下文判断具体如何转换,本js文件在网络上流传的简繁转换程序基础上加入许多判断条件让简繁转换更为完善。☆10Nov 13, 2015Updated 10 years ago
- 一个支持跨模态大语言模型的webui. A chatbot webui that supports various multi-modal large language models☆11May 8, 2023Updated 2 years ago
- [ECCV2024] Official code implementation of Merlin: Empowering Multimodal LLMs with Foresight Minds☆96Jul 4, 2024Updated last year
- ☆43Oct 7, 2024Updated last year
- ✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio☆52Jul 11, 2025Updated 7 months ago
- [CVPR 2025] Official PyTorch code of "Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation".☆54May 25, 2025Updated 9 months ago
- ☆10Oct 17, 2022Updated 3 years ago
- 基于Action抓取必应每日超清壁纸展示&保存到分支☆11Updated this week
- Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset☆13Nov 19, 2022Updated 3 years ago
- API Utility for TOR(The Onion ROUTER) such as requesting a new IP, or generating API password. Uses Network API for control☆12Feb 27, 2025Updated last year
- ☆10Oct 20, 2022Updated 3 years ago
- [ICCV 2025] "Fine-grained Spatiotemporal Grounding on Egocentric Videos"☆22Nov 23, 2025Updated 3 months ago
- personal blog☆14Feb 23, 2024Updated 2 years ago
- [TMLR 2025] Reading List of Memory Augmented Multimodal Research, including multimodal context modeling, memory in vision and robotics, a…☆57Jan 17, 2026Updated last month
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs☆54Mar 9, 2025Updated 11 months ago
- ☆51Feb 5, 2025Updated last year
- ☆11Apr 2, 2024Updated last year
- KGML for EMNLP 2021☆10Feb 2, 2022Updated 4 years ago
- ☆12Jan 10, 2025Updated last year
- This is the official code for the paper "Reconstruct before Query: Continual Missing Modality Learning with Decomposed Prompt Collaborati…☆12Aug 13, 2024Updated last year
- Interpreting Chest X-rays Like a Radiologist: A Benchmark with Clinical Reasoning, release the dataset and the model weight☆13May 26, 2025Updated 9 months ago
- Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Finding]"☆15Aug 27, 2025Updated 6 months ago
- In this codebase we establish a benchmark for egocentric user adaptation based on Ego4d.First, we start from a population model which ha…☆15Jan 16, 2025Updated last year
- Domain Adaptation and Adapters☆16Feb 28, 2023Updated 3 years ago
- [ECCV2024] The official implementation of "Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation".☆13Feb 24, 2025Updated last year
- Accepted at IJCAI-2022☆11Sep 3, 2022Updated 3 years ago
- Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)☆54Apr 15, 2024Updated last year
- [ICLR 2025] CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion☆56Jul 1, 2025Updated 8 months ago
- Sim-to-Real Domain Adaptation for Lane Detection and Classification in Autonomous Driving☆16Dec 2, 2024Updated last year
- code for promptCSE, emnlp 2022☆11Apr 10, 2023Updated 2 years ago
- AMR-parser. Code for EMNLP2019 paper "Core Semantic First: A Top-down Approach for AMR Parsing."☆11Feb 23, 2020Updated 6 years ago