[ACM MM 25] Official repo of "UEMM-Air: Enable UAVs to Undertake More Multi-modal Tasks"
☆33Aug 20, 2025Updated 7 months ago
Alternatives and similar repositories for UEMM-Air
Users that are interested in UEMM-Air are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [中国图象图形学报&ChinaMM2025] 非空间配准多模态目标检测决策融合策略☆39Jul 16, 2025Updated 8 months ago
- Official Repo of "Code2MCP: Transforming Code Repositories into MCP Services", Scaling Environments for Agents Workshop @ NeurIPS 2025☆115Nov 4, 2025Updated 4 months ago
- [ACL'25 Oral] Code for the paper "UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban…☆26Jul 15, 2025Updated 8 months ago
- Repository containing the code and tools required to build & smulate on the Syndrone dataset.☆42Oct 15, 2025Updated 5 months ago
- [TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.☆52Jun 10, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official Repo of "RobustFlow: Towards Robust Agentic Workflow Generation"☆233Oct 19, 2025Updated 5 months ago
- ☆12Oct 8, 2024Updated last year
- ☆18Jan 5, 2026Updated 2 months ago
- Cross-Modality Attentive Feature Fusion for Object Detection in Multispectral Remote Sensing Imagery☆16Oct 7, 2022Updated 3 years ago
- [ACM MM 25] Official repo of "RemoteSAM: Towards Segment Anything for Earth Observation"☆218Jan 4, 2026Updated 2 months ago
- [CVPR2026 🌟] The first attempt to Marine Open Vocabulary Instance Segmentation☆45Mar 17, 2026Updated last week
- [CVPR 2026] ZoomEarth: Active Perception for Ultra-High-Resolution Geospatial Vision-Language Tasks☆32Mar 10, 2026Updated 2 weeks ago
- Paper List on Earth Observation in the Foundation Model Era☆30Mar 15, 2026Updated 2 weeks ago
- Testbed for multimodal retrieval augmented generation techniques with FiftyOne, LlamaIndex, and Milvus☆21Aug 9, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICCV 2025] UrbanLLaVA: A Multi-modal Large Language Model for Urban Intelligence with Spatial Reasoing and Understanding.☆71Feb 28, 2026Updated last month
- Code of the paper "Unseen from Seen: Rewriting Observation-Instruction Using Foundation Models for Augmenting Vision-Language Navigation"…☆17Nov 11, 2025Updated 4 months ago
- [ICLR'26] OF-Diff: Object Fidelity Diffusion for Remote Sensing Image Generation☆30Feb 6, 2026Updated last month
- Annotated dataset of quadrotor Eagle for object detection of UAVs☆15Apr 4, 2022Updated 3 years ago
- [CVPR'2022, TPAMI'2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation☆25Jan 21, 2025Updated last year
- YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现☆13Mar 5, 2025Updated last year
- Modified version of QPBO algorithm by Vladimir Kolmogorov for very large graphs.☆11Dec 14, 2018Updated 7 years ago
- [NeurIPS'24] MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts☆19Oct 7, 2024Updated last year
- EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing☆23May 29, 2025Updated 10 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A benchmark of UAV-ROD dataset.☆54Jun 1, 2021Updated 4 years ago
- This is the official code of “Enhancing Nighttime UAV Tracking with Light Distribution Suppression”.☆21Dec 2, 2024Updated last year
- Graph Regularized Flow Attention Network for Video Animal Counting from Drones☆25Apr 28, 2024Updated last year
- A simple, elegant web tool that allows you to create custom RSS feeds for arXiv search queries. Stay up-to-date with the latest research …☆35Mar 21, 2026Updated last week
- [NeurIPS 2024] PyTorch code for the paper "Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning…☆24Oct 24, 2025Updated 5 months ago
- Official implementation of Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts(IJCAI 2024)☆15Oct 16, 2024Updated last year
- ☆11Dec 8, 2021Updated 4 years ago
- Ma thesis @usyd☆10May 14, 2021Updated 4 years ago
- [SENSORS 2025] PicoSAM2 and PicoSAM3 are segmentation models running in-sensor on the Sony IMX500☆33Mar 13, 2026Updated 2 weeks ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- RAVEN: Resilient Aerial Navigation via Open-Set Semantic Memory and Behavior Adaptation☆32Updated this week
- FIWARE 401: IDM - Managing Users and Organizations☆10Jan 27, 2026Updated 2 months ago
- Calculating Disparity Maps using openCVs implemented algorithms.☆11May 5, 2017Updated 8 years ago
- FIWARE 104: Registering Context Providers☆11Jan 27, 2026Updated 2 months ago
- Integrate a DASH7 gateway with the ThingsBoard platform☆10Oct 10, 2018Updated 7 years ago
- ☆13Feb 23, 2023Updated 3 years ago
- This is the offical repository for "DetFusion: A Detection-driven Infrared and Visible Image Fusion Network" (ACM MM 2022).☆82Dec 10, 2022Updated 3 years ago