yifeisu / FELA
FELA: Learning Fine-Grained Alignment for Aerial Vision-Dialog Navigation, AAAI 25.
☆28Updated 4 months ago
Alternatives and similar repositories for FELA:
Users that are interested in FELA are comparing it to the libraries listed below
- [CVPR24] Volumetric Environment Representation for Vision-Language Navigation☆103Updated 8 months ago
- AI2-THOR Data Collection Tool Based On Keyboard Interaction☆49Updated 10 months ago
- [ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation☆112Updated last year
- The official repository of C-CoTTA: Controllable Continual Test-Time Adaptation☆10Updated 10 months ago
- [ECCV'24] ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation☆39Updated 2 months ago
- ☆90Updated last year
- [ICRA 2025]AVD2: Accident Video Diffusion for Accident Video Description☆79Updated 2 months ago
- Weakly supverised individual counting☆28Updated 9 months ago
- [IJCV 2024] RAD: A Dataset and Benchmark for Real-Life Anomaly Detection with Robotic Observations☆36Updated 6 months ago
- The official generation code and toolkits of VDW dataset (ICCV 2023)☆35Updated 10 months ago
- A collection of URDF model used in Pybullet☆36Updated 6 months ago
- MAPLE: Masked Pseudo-Labeling autoEncoder for Semi-supervised Point Cloud Action Recognition.☆34Updated last year
- ☆80Updated 6 months ago
- Repository of our CVPR2023 paper "Lana: A Language-Capable Navigator for Instruction Following and Generation"☆89Updated 2 years ago
- ☆10Updated last year
- ☆88Updated last month
- The implementation of PLU☆14Updated 8 months ago
- ☆13Updated last year
- Official Code of "GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering"☆110Updated 7 months ago
- Language-to-4D Modeling Towards 6-DoF Tracking and Shape Reconstruction in 3D Point Cloud Stream [CVPR2024]☆66Updated last year
- A Strong Tracking Framework for 3D SOT on LiDAR Point Clouds☆66Updated 2 weeks ago
- TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding [ACM MM'21]☆23Updated 3 years ago
- [AAAI 2025] Code for paper:Enhancing Multimodal Large Language Models Complex Reasoning via Similarity Computation☆3Updated 3 months ago
- ☆23Updated last year
- Official Implementation for "Mask-based modeling for Neural Radiance Fields" (ICLR 2024)☆37Updated 11 months ago
- ☆20Updated 6 months ago
- Camouflaged Object Detection☆15Updated 3 weeks ago
- The code and data samples of the paper "A Multi-Scale Matching Method for High Quality UAV-Based Geo- Localization"☆22Updated 2 years ago
- The official repository for AAAI 2025 paper: Surgical Workflow Recognition and Blocking Effectiveness Detection in Laparoscopic Liver Res…☆26Updated 2 weeks ago
- [TPAMI] Locating and Counting Heads in Crowds With a Depth Prior☆22Updated 2 years ago