A Holistic Embodied Cognition Benchmark
☆19Apr 3, 2025Updated last year
Alternatives and similar repositories for ECBench
Users that are interested in ECBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Oct 28, 2025Updated 8 months ago
- [CVPR 2025] Beacon3D: Object-centric Evaluation for 3D Grounding-QA☆28Nov 25, 2025Updated 7 months ago
- ☆26Jun 5, 2025Updated last year
- Neural network methods for multimodal map reconstruction and their usage for robot navigation and control☆16Jun 11, 2024Updated 2 years ago
- Dual Adaptive Thinking (DAT) for object navigation☆14Sep 10, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning☆43Mar 2, 2026Updated 3 months ago
- [AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos☆38May 27, 2025Updated last year
- Unbiased Directed Object Attention Graph for Object Navigation☆15Nov 28, 2022Updated 3 years ago
- The code for PixelRefer & VideoRefer☆351Nov 16, 2025Updated 7 months ago
- ☆12Dec 6, 2024Updated last year
- Code for A Dual Semantic-Aware Recurrent Global-Adaptive Network For Vision-and-Language Navigation☆17Apr 25, 2024Updated 2 years ago
- [CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning☆13Jun 7, 2025Updated last year
- ✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio☆54Jul 11, 2025Updated 11 months ago
- [ECCV 2024 (Oral)] Towards Scene Graph Anticipation☆19Jun 17, 2026Updated 2 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- auto star for repo lists☆10Aug 26, 2023Updated 2 years ago
- [ICCV 2025 Highlight] The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"☆196Mar 17, 2025Updated last year
- A simple and flexible PyTorch implementation of Video StableDiffusion (ZeroScope_v2) based on diffusers.☆20Feb 15, 2024Updated 2 years ago
- 一个支持跨模态大语言模型的webui. A chatbot webui that supports various multi-modal large language models☆11May 8, 2023Updated 3 years ago
- ☆41Nov 8, 2024Updated last year
- Egocentric Video Understanding Dataset (EVUD)☆34Jul 4, 2024Updated last year
- The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM", IJCV2025☆280May 26, 2025Updated last year
- ☆28Aug 19, 2025Updated 10 months ago
- Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset☆13Nov 19, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆117Jul 9, 2025Updated 11 months ago
- ☆44Oct 7, 2024Updated last year
- Official Implementation for paper "Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm"☆23May 8, 2026Updated last month
- ☆37Mar 22, 2024Updated 2 years ago
- Official PyTorch code of GroundVQA (CVPR'24)☆63Sep 13, 2024Updated last year
- [ECCV2024] Official code implementation of Merlin: Empowering Multimodal LLMs with Foresight Minds☆97Jul 4, 2024Updated last year
- Recent Advances on MLLM's Reasoning Ability☆26Apr 11, 2025Updated last year
- The code for "Label-efficient Segmentation via Affinity Propagation". [NeurIPS2023]☆67Mar 4, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Accepted at IJCAI-2022☆11Sep 3, 2022Updated 3 years ago
- This is the offical repository of LLAVIDAL☆25Oct 4, 2025Updated 8 months ago
- [Main Conference @ EACL'26] [Workshop @ NeurIPS'24] 🎞️ LVNet.☆44Feb 10, 2026Updated 4 months ago
- A paper list of panoptic segmentation using deep learning☆12Sep 5, 2021Updated 4 years ago
- Code for OctoNav-Bench and OctoNav-R1☆73Apr 29, 2026Updated 2 months ago
- [AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding☆129Dec 10, 2024Updated last year
- This is the official repo of MLLM-CL.☆66May 16, 2026Updated last month