yifeisu / FELALinks
FELA: Learning Fine-Grained Alignment for Aerial Vision-Dialog Navigation, AAAI 2025.
☆34Updated 9 months ago
Alternatives and similar repositories for FELA
Users that are interested in FELA are comparing it to the libraries listed below
Sorting:
- AI2-THOR Data Collection Tool Based On Keyboard Interaction☆53Updated last year
- A collection of URDF model used in Pybullet☆35Updated 11 months ago
- The official repository of C-CoTTA: Controllable Continual Test-Time Adaptation☆10Updated last year
- [CVPR24] Volumetric Environment Representation for Vision-Language Navigation☆120Updated last year
- ☆27Updated last month
- ☆79Updated 10 months ago
- [ICRA 2025]AVD2: Accident Video Diffusion for Accident Video Description☆85Updated 4 months ago
- ☆89Updated last year
- RAD: A Dataset and Benchmark for Real-Life Anomaly Detection with Robotic Observations☆38Updated 10 months ago
- [ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation☆119Updated last year
- [ECCV'24] ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation☆43Updated 7 months ago
- ☆14Updated last year
- Language-to-4D Modeling Towards 6-DoF Tracking and Shape Reconstruction in 3D Point Cloud Stream [CVPR2024]☆65Updated last year
- A comprehensive collection of resources focused on addressing and understanding hallucination phenomena in MLLMs.☆34Updated last year
- [NeurIPS' 24] Official implementation of the paper "Cloud Object Detector Adaptation by Integrating Different Source Knowledge"☆37Updated 6 months ago
- The official project website of "Augmentation-free Dense Contrastive Distillation for Efficient Semantic Segmentation" (Af-DCD for short,…☆17Updated last year
- Weakly supverised individual counting☆31Updated last year
- The official generation code and toolkits of VDW dataset (ICCV 2023)☆35Updated last year
- The implementation of PLU☆14Updated last year
- Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation☆110Updated last month
- A Strong Tracking Framework for 3D SOT on LiDAR Point Clouds☆77Updated 3 months ago
- A Unified Baseline Tracker for Multimodal Single and Multiple Object Tracking☆49Updated 11 months ago
- Official implementation of "Generating images with 3D annotations using diffusion models".☆47Updated last year
- When Learning Is Out of Reach, Reset: Generalization in Autonomous Visuomotor Reinforcement Learning☆12Updated last year
- ☆26Updated last year
- ☆33Updated 7 months ago
- ☆11Updated last year
- ✨✨latest advancements in VLA models(VIsion Language Action)☆85Updated 5 months ago
- TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding [ACM MM'21]☆21Updated 3 years ago
- [CVPR 2024] Interactive continual learning: Fast and slow thinking☆102Updated last year