mahtabbigverdi / Aurora-perceptionView external linksLinks
☆46Sep 8, 2025Updated 5 months ago
Alternatives and similar repositories for Aurora-perception
Users that are interested in Aurora-perception are comparing it to the libraries listed below
Sorting:
- Spatial Aptitude Training for Multimodal Langauge Models☆24Feb 8, 2026Updated last week
- TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics☆21Nov 18, 2025Updated 2 months ago
- ☆16Sep 25, 2025Updated 4 months ago
- Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”☆18Jan 27, 2026Updated 3 weeks ago
- [ICLR 2025 Oral] Official Implementation for "Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Un…☆18Oct 24, 2024Updated last year
- code for "CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models"☆19Mar 10, 2025Updated 11 months ago
- [CVPR 2025] Program synthesis for 3D spatial reasoning☆56Jun 16, 2025Updated 8 months ago
- ☆37Jan 23, 2026Updated 3 weeks ago
- ☆31Feb 7, 2026Updated last week
- STI-Bench : Are MLLMs Ready for Precise Spatial-Temporal World Understanding?☆36Jan 12, 2026Updated last month
- [ICLR'25] Reconstructive Visual Instruction Tuning☆135Apr 9, 2025Updated 10 months ago
- PyTorch Implementation of "Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Larg…☆39Dec 5, 2025Updated 2 months ago
- Codes for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding [ICML 2025]]☆45Jul 22, 2025Updated 6 months ago
- [ECCV 2024] HandDGP: Camera-Space Hand Mesh Prediction with Differentiable Global Positioning☆40Feb 12, 2025Updated last year
- [CVPR 2025] PICO: Reconstructing 3D People In Contact with Objects☆60Sep 23, 2025Updated 4 months ago
- Code for our EMNLP 2022 paper: Generative Entity Typing with Curriculum Learning.☆13Aug 19, 2023Updated 2 years ago
- Official Reimplementation of Diffusion-Guided Reconstruction of Everyday Hand-Object Interaction Clips (DiffHOI, ICCV23) https://judyye.g…☆37Sep 12, 2023Updated 2 years ago
- ☆49Nov 28, 2024Updated last year
- Code for the Behavior Retrieval Paper☆36Jul 24, 2023Updated 2 years ago
- Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)☆37Jan 1, 2024Updated 2 years ago
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model☆48Sep 13, 2024Updated last year
- Normalization Matters in Weakly Supervised Object Localization (ICCV 2021)☆11Oct 24, 2021Updated 4 years ago
- ☆20Sep 5, 2025Updated 5 months ago
- main augmentation script for real world robot dataset.☆39May 18, 2023Updated 2 years ago
- A Python package that provides evaluation and visualization tools for the HO-Cap dataset☆47Mar 22, 2025Updated 10 months ago
- [ICLR'26] Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology☆73Jan 26, 2026Updated 3 weeks ago
- Official repository for Fourier model that can generate periodic signals☆10Mar 10, 2022Updated 3 years ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆23Sep 21, 2025Updated 4 months ago
- ☆10May 12, 2023Updated 2 years ago
- ☆15Sep 11, 2025Updated 5 months ago
- Computation of binomial confidence intervals that achieve exact coverage.☆14Apr 23, 2025Updated 9 months ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated 8 months ago
- Code for PolyTask: Learning Unified Policies through Behavior Distillation☆12Oct 13, 2023Updated 2 years ago
- ☆10Nov 15, 2022Updated 3 years ago
- [NeurIPS 2025] EOC-Bench, an innovative benchmark designed to systematically evaluate object-centric embodied cognition in dynamic egocen…☆22Jun 17, 2025Updated 8 months ago
- ☆13Jul 22, 2022Updated 3 years ago
- Minimal codes for "Task-Oriented Dexterous Hand Pose Synthesis Using Differentiable Grasp Wrench Boundary Estimator [IROS 2024]"☆15Feb 12, 2025Updated last year
- Code and resources for EMNLP 2022 paper on 'Robustness of Fusion-based Multimodal Classifiers to Cross-Modal Content Dilutions'☆10Mar 11, 2024Updated last year