yifeisu / FELALinks
FELA: Learning Fine-Grained Alignment for Aerial Vision-Dialog Navigation, AAAI 25.
☆33Updated 6 months ago
Alternatives and similar repositories for FELA
Users that are interested in FELA are comparing it to the libraries listed below
Sorting:
- [CVPR24] Volumetric Environment Representation for Vision-Language Navigation☆115Updated 9 months ago
- AI2-THOR Data Collection Tool Based On Keyboard Interaction☆51Updated last year
- [ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation☆116Updated last year
- [ECCV'24] ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation☆39Updated 4 months ago
- [ICRA 2025]AVD2: Accident Video Diffusion for Accident Video Description☆82Updated last month
- Weakly supverised individual counting☆29Updated 10 months ago
- ☆21Updated 8 months ago
- The official repository of C-CoTTA: Controllable Continual Test-Time Adaptation☆9Updated last year
- ☆80Updated 7 months ago
- Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation☆63Updated last month
- The official generation code and toolkits of VDW dataset (ICCV 2023)☆35Updated 11 months ago
- ☆89Updated last year
- Language-to-4D Modeling Towards 6-DoF Tracking and Shape Reconstruction in 3D Point Cloud Stream [CVPR2024]☆66Updated last year
- Repository of our CVPR2023 paper "Lana: A Language-Capable Navigator for Instruction Following and Generation"☆90Updated 2 years ago
- [IJCV 2024] RAD: A Dataset and Benchmark for Real-Life Anomaly Detection with Robotic Observations☆36Updated 8 months ago
- ☆54Updated last month
- ☆61Updated 2 years ago
- ☆24Updated last year
- ☆13Updated last year
- ☆11Updated last year
- [AAAI 2025] Code for paper:Enhancing Multimodal Large Language Models Complex Reasoning via Similarity Computation☆3Updated 5 months ago
- A collection of URDF model used in Pybullet☆36Updated 8 months ago
- A Strong Tracking Framework for 3D SOT on LiDAR Point Clouds☆74Updated last month
- MAPLE: Masked Pseudo-Labeling autoEncoder for Semi-supervised Point Cloud Action Recognition.☆34Updated last year
- [NeurIPS 2024] Referring Human Pose and Mask Estimation In the Wild☆43Updated 5 months ago
- A Unified Baseline Tracker for Multimodal Single and Multiple Object Tracking☆47Updated 8 months ago
- [TPAMI] Locating and Counting Heads in Crowds With a Depth Prior☆22Updated 3 years ago
- [Neurips 2023] dynpoint: dynamic neural point for view synthesis☆52Updated last year
- [CVPR2025] STCOcc: Sparse Spatial-Temporal Cascade Renovation for 3D Occupancy and Scene Flow Prediction☆55Updated last month
- A comprehensive collection of resources focused on addressing and understanding hallucination phenomena in MLLMs.☆34Updated last year