AI45Lab / Awesome-Trustworthy-Embodied-AILinks
☆64Updated this week
Alternatives and similar repositories for Awesome-Trustworthy-Embodied-AI
Users that are interested in Awesome-Trustworthy-Embodied-AI are comparing it to the libraries listed below
Sorting:
- A paper list for spatial reasoning☆142Updated 3 months ago
- ViewSpatial-Bench:Evaluating Multi-perspective Spatial Localization in Vision-Language Models☆60Updated 4 months ago
- MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning☆132Updated last year
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆189Updated 5 months ago
- ☆88Updated 2 months ago
- Official implementation of ECCV 2024 paper: Take A Step Back: Rethinking the Two Stages in Visual Reasoning☆15Updated 4 months ago
- SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning☆80Updated 2 months ago
- A curated list of awesome papers on dataset reduction, including dataset distillation (dataset condensation) and dataset pruning (coreset…☆58Updated 8 months ago
- ☆89Updated 2 months ago
- [CIKM-2024] Official code for work "ERASE: Error-Resilient Representation Learning on Graphs for Label Noise Tolerance"☆18Updated last year
- A tiny paper rating web☆39Updated 6 months ago
- A python script for downloading huggingface datasets and models.☆20Updated 5 months ago
- [arXiv 2025] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence☆53Updated last month
- Provide .bst files for NeurIPS latex template☆48Updated 5 months ago
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆86Updated last year
- [ICML2025] The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆124Updated 11 months ago
- An example reproduction checklist for AAAI-26 submissions.☆106Updated 2 months ago
- A collection of vision foundation models unifying understanding and generation.☆55Updated 9 months ago
- Official repo for EscapeCraft (an 3D environment for room escape) and benchmark MM-Escape. This work is accepted by ICCV 2025.☆34Updated 3 months ago
- [NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"☆264Updated 9 months ago
- ☆141Updated 7 months ago
- [ICLR 2025] This repo is the official implementation of "The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs".☆13Updated 8 months ago
- Responsible Robotic Manipulation☆12Updated last month
- [NeurIPS 2025] MLLMs Need 3D-Aware Representation Supervision for Scene Understanding☆103Updated 2 weeks ago
- Code for "Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes"☆54Updated last year
- Official Implementation of Diffusion Step Annealing (DiSA) in Autoregressive Image Generation☆141Updated 4 months ago
- [CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key☆81Updated last week
- [ICLR'25] Reconstructive Visual Instruction Tuning☆119Updated 5 months ago
- [NeurIPS 2025] OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding☆59Updated last week
- Accepted by CVPR 2024☆38Updated last year