☆46Feb 18, 2026Updated 2 weeks ago
Alternatives and similar repositories for Aurora-perception
Users that are interested in Aurora-perception are comparing it to the libraries listed below
Sorting:
- Spatial Aptitude Training for Multimodal Langauge Models☆24Feb 8, 2026Updated last month
- TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics☆22Nov 18, 2025Updated 3 months ago
- [CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning☆14Jun 7, 2025Updated 9 months ago
- Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”☆18Jan 27, 2026Updated last month
- [ICLR 2025 Oral] Official Implementation for "Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Un…☆21Oct 24, 2024Updated last year
- code for "CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models"☆19Mar 10, 2025Updated 11 months ago
- ☆37Jan 23, 2026Updated last month
- [CVPR 2025] Program synthesis for 3D spatial reasoning☆58Jun 16, 2025Updated 8 months ago
- ☆32Feb 7, 2026Updated last month
- STI-Bench : Are MLLMs Ready for Precise Spatial-Temporal World Understanding?☆38Jan 12, 2026Updated last month
- Awesome Vision-Language Compositionality, a comprehensive curation of research papers in literature.☆35Feb 13, 2025Updated last year
- [ICLR'25] Reconstructive Visual Instruction Tuning☆135Apr 9, 2025Updated 11 months ago
- [ECCV 2024] Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance☆41Sep 7, 2024Updated last year
- Codes for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding [ICML 2025]]☆45Jul 22, 2025Updated 7 months ago
- 🌴[CVPR 2024] OakInk2: A Dataset of Bimanual Hands-Object Manipulation in Complex Task Completion☆92Aug 11, 2025Updated 6 months ago
- PyTorch Implementation of "Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Larg…☆40Mar 2, 2026Updated last week
- [ECCV 2024] HandDGP: Camera-Space Hand Mesh Prediction with Differentiable Global Positioning☆39Feb 12, 2025Updated last year
- [CVPR 2025] PICO: Reconstructing 3D People In Contact with Objects☆64Sep 23, 2025Updated 5 months ago
- ☆69Nov 5, 2025Updated 4 months ago
- Code for our EMNLP 2022 paper: Generative Entity Typing with Curriculum Learning.☆13Aug 19, 2023Updated 2 years ago
- Code for the Behavior Retrieval Paper☆36Jul 24, 2023Updated 2 years ago
- ☆49Nov 28, 2024Updated last year
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)☆37Jan 1, 2024Updated 2 years ago
- Federated Meta-Learning for Emotion and Sentiment Aware Multi-modal Complaint Identification☆10May 30, 2024Updated last year
- ViViDex implementation under the SAPIEN simulator, ICRA 2025☆17Apr 9, 2025Updated 11 months ago
- Normalization Matters in Weakly Supervised Object Localization (ICCV 2021)☆11Oct 24, 2021Updated 4 years ago
- This is a project on visual spatial reasoning tasks-SIBench☆25Jan 12, 2026Updated last month
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆15Jul 31, 2025Updated 7 months ago
- ☆20Sep 5, 2025Updated 6 months ago
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- main augmentation script for real world robot dataset.☆39May 18, 2023Updated 2 years ago
- A Python package that provides evaluation and visualization tools for the HO-Cap dataset☆48Mar 22, 2025Updated 11 months ago
- Code for "BECoTTA: Input-dependent Online Blending of Experts for Continual Test-time Adaptation [ICML2024]".☆46Jun 16, 2024Updated last year
- ☆19Jan 26, 2025Updated last year
- ☆10Nov 23, 2023Updated 2 years ago
- ☆16Sep 1, 2025Updated 6 months ago
- [NeurIPS 2025] EOC-Bench, an innovative benchmark designed to systematically evaluate object-centric embodied cognition in dynamic egocen…☆22Jun 17, 2025Updated 8 months ago
- Source code of "Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers" EMNLP 2025☆16Jan 12, 2026Updated last month
- ☆13Jul 22, 2022Updated 3 years ago