yifeisu / FELA
FELA: Learning Fine-Grained Alignment for Aerial Vision-Dialog Navigation, AAAI 25.
☆23Updated 3 months ago
Alternatives and similar repositories for FELA:
Users that are interested in FELA are comparing it to the libraries listed below
- ☆19Updated 5 months ago
- The official repository for AAAI 2025 paper: Surgical Workflow Recognition and Blocking Effectiveness Detection in Laparoscopic Liver Res…☆25Updated 2 weeks ago
- MAPLE: Masked Pseudo-Labeling autoEncoder for Semi-supervised Point Cloud Action Recognition.☆33Updated last year
- Qt-based implementation of the vehicle panoramic stitching system.☆15Updated 10 months ago
- Official Code of "GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering"☆111Updated 5 months ago
- ☆11Updated 6 months ago
- Camouflaged Object Detection☆10Updated 2 weeks ago
- The implementation of PLU☆14Updated 7 months ago
- AI2-THOR Data Collection Tool Based On Keyboard Interaction☆49Updated 9 months ago
- ☆29Updated 2 years ago
- Weakly supverised individual counting☆28Updated 7 months ago
- A C++ version of the simplified message queue component is implemented based on RabbitMQ. In order to learn RabbitMQ, this project encomp…☆55Updated 7 months ago
- [TMLR 2022] DHA: End-to-End Joint Optimization of Data Augmentation Policy, Hyper-parameter and Architecture☆31Updated 5 months ago
- ROSE: Robust Cross Supervision with Neighborhood Mining for Source-free Graph Domain Adaptation☆19Updated 5 months ago
- [IROS 2024] SCANet: Correcting LEGO Assembly Errors with Self-Correct Assembly Network (FINALIST BEST APPLICATION PAPER)☆25Updated 5 months ago
- ☆17Updated 5 months ago
- High-Speed Spiking Recognition (HSSR)☆9Updated 11 months ago
- An official Project related to Paper "Perceiving Ambiguity and Semantics without Recognition: An Efficient and Effective Ambiguous Scene …☆21Updated last year
- ☆13Updated 10 months ago
- ☆7Updated 4 months ago
- ☆32Updated 2 years ago
- TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding [ACM MM'21]☆23Updated 2 years ago
- This project implements a pipeline communication system framework based on dual thread pools from scratch, which is essentially communica…☆19Updated 11 months ago
- A PyTorch implementation for Temporal Textual Localization in Video via Adversarial Bi-Directional Interaction Networks☆39Updated 4 years ago
- Official Implementation for "Mask-based modeling for Neural Radiance Fields" (ICLR 2024)☆37Updated 9 months ago
- [AAAI 2025] Code for paper:Enhancing Multimodal Large Language Models Complex Reasoning via Similarity Computation☆31Updated 2 months ago
- The official repository of C-CoTTA: Controllable Continual Test-Time Adaptation☆10Updated 9 months ago
- ☆10Updated last year
- ☆27Updated 6 months ago
- ☆40Updated 6 months ago