ByZ0e / AI2Thor_keyboard_player
AI2-THOR Data Collection Tool Based On Keyboard Interaction
☆46Updated 6 months ago
Alternatives and similar repositories for AI2Thor_keyboard_player:
Users that are interested in AI2Thor_keyboard_player are comparing it to the libraries listed below
- FELA: Learning Fine-Grained Alignment for Aerial Vision-Dialog Navigation, AAAI 25.☆21Updated last month
- ☆73Updated 2 months ago
- Official implementation of "Generating images with 3D annotations using diffusion models".☆46Updated 4 months ago
- A comprehensive collection of resources focused on addressing and understanding hallucination phenomena in MLLMs.☆35Updated 8 months ago
- ☆29Updated last year
- MAPLE: Masked Pseudo-Labeling autoEncoder for Semi-supervised Point Cloud Action Recognition.☆34Updated last year
- [AAAI 2025] Code for paper:Enhancing Multimodal Large Language Models Complex Reasoning via Similarity Computation☆28Updated this week
- ☆20Updated 3 months ago
- TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding [ACM MM'21]☆23Updated 2 years ago
- Rethinking Video-Text Understanding Retrieval from Counterfactually Augmented Data☆39Updated 5 months ago
- An open-source library with a powerful Contrastive Language-and-Motion (CLaM) pre-training evaluator☆88Updated 5 months ago
- Language-to-4D Modeling Towards 6-DoF Tracking and Shape Reconstruction in 3D Point Cloud Stream [CVPR2024]☆64Updated 10 months ago
- ☆15Updated 3 months ago
- Repository of our CVPR2023 paper "Lana: A Language-Capable Navigator for Instruction Following and Generation"☆87Updated last year
- Weakly supverised individual counting☆28Updated 5 months ago
- NWPU足基 ATOM_LINKER 唐天扬负责 硬件组☆39Updated 3 years ago
- Domain Prompt Learning with Quaternion Networks (CVPR2024 Highlight)☆75Updated 3 weeks ago
- [ECCV'24] ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation☆37Updated last month
- Official Implementation for "Mask-based modeling for Neural Radiance Fields" (ICLR 2024)☆36Updated 7 months ago
- ☆9Updated 3 weeks ago
- ☆36Updated 11 months ago
- ☆59Updated 8 months ago
- A PyTorch implementation for Temporal Textual Localization in Video via Adversarial Bi-Directional Interaction Networks☆35Updated 4 years ago
- ☆40Updated 3 months ago
- ☆90Updated 8 months ago
- [ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation☆101Updated 9 months ago
- A collection of URDF model used in Pybullet☆35Updated 3 months ago
- When Learning Is Out of Reach, Reset: Generalization in Autonomous Visuomotor Reinforcement Learning☆11Updated 6 months ago
- Official Implementation Code of Our Paper "UniVST: A Unified Framework for Training-free Localized Video Style Transfer"☆9Updated 2 weeks ago