1e12Leon/UEMM-Air

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/1e12Leon/UEMM-Air)

1e12Leon / UEMM-Air

[ACM MM 25] Official repo of "UEMM-Air: Enable UAVs to Undertake More Multi-modal Tasks"

☆37

Alternatives and similar repositories for UEMM-Air

Users that are interested in UEMM-Air are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

1e12Leon / AirNavigation
View on GitHub
[AAAI2026 demo] Official repo of “AirNavigation: Let UAV Navigation Tells Its Own Story”
☆22Nov 1, 2025Updated 8 months ago
yijunshens / StateFactory
View on GitHub
Official implementation of "Reward Prediction with Factorized World States"
☆20Mar 11, 2026Updated 4 months ago
1e12Leon / RemoteSAM
View on GitHub
[ACM MM 25] Official repo of "RemoteSAM: Towards Segment Anything for Earth Observation"
☆243Jan 4, 2026Updated 6 months ago
earth-insights / Advanced-Earth-Observation
View on GitHub
Paper List on Earth Observation in the Foundation Model Era
☆31Jun 15, 2026Updated last month
like413 / OPT-RSVG
View on GitHub
[TGRS 2024] Language-Guided Progressive Attention for Visual Grounding in Remote Sensing Images.
☆56Jun 10, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
mangoggul / YOLO-MultiModal
View on GitHub
☆13Oct 8, 2024Updated last year
ZhanYang-nwpu / Awesome-Multimodal-Large-Language-Models-for-UAV-Vision-Language-Perception
View on GitHub
UAV-MLLMs
☆29Apr 7, 2026Updated 3 months ago
DEFENSE-SEU / RobustFlow
View on GitHub
Official Repo of "RobustFlow: Towards Robust Agentic Workflow Generation"
☆238Oct 19, 2025Updated 9 months ago
EmbodiedCity / UrbanVideo-Bench.code
View on GitHub
[ACL'25 Oral] Code for the paper "UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban…
☆31Jul 15, 2025Updated last year
irfan112 / yowov3-multistreaming-inferencing
View on GitHub
A real-time inferencing of multistreaming YOWOv3(Spatio Temporal Action Detection task) using (UCF101-24) dataset. The repo is extension …
☆26May 15, 2026Updated 2 months ago
bia006 / DARTS
View on GitHub
☆13Jul 19, 2023Updated 3 years ago
Bili-Sakura / awesome-remote-sensing-visual-generative-models
View on GitHub
A curated list of awesome remote sensing visual generative models, papers, datasets, and resources. This repository focuses exclusively o…
☆20Jul 8, 2026Updated last week
XiangTodayEatsWhat / EagleVision
View on GitHub
EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing
☆26May 29, 2025Updated last year
zhanghengdev / CFR
View on GitHub
[ICIP 2020]"Multispectral Fusion for Object Detection with Cyclic Fuse-and-Refine Blocks"
☆14Oct 6, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Sautenich / Awesome-Aerial-Vision-Language-Navigation
View on GitHub
The new spin-off of Visual Language Navigation.
☆61Jun 25, 2026Updated 3 weeks ago
WWLoveTransfer / SLSA-DA
View on GitHub
Sparsely-Labeled Source Assisted Domain Adaptation
☆12May 2, 2020Updated 6 years ago
supersupercong / PromptRestorer
View on GitHub
[NeurIPS23] PromptRestorer: A Prompting Image Restoration Method with Degradation Perception
☆16Aug 4, 2024Updated last year
VisionXLab / avi-math
View on GitHub
[ISPRS'25] Multimodal Mathematical Reasoning Embedded in Aerial Vehicle Imagery: Benchmarking, Analysis, and Exploration
☆18Jan 4, 2026Updated 6 months ago
yaoyueduzhen / UJDA
View on GitHub
Unsupervised domain adaptation with unified joint distribution alignment
☆18Jul 1, 2025Updated last year
MiliLab / Text-Before-Vision
View on GitHub
[ICML 2026] Text Before Vision: Staged Knowledge Injection Matters for Agentic RLVR in Ultra-High-Resolution Remote Sensing Understanding
☆16Mar 13, 2026Updated 4 months ago
yaoyueduzhen / HomOTL-ODDM
View on GitHub
Homogeneous Online Transfer Learning with Online Distribution Discrepancy Minimization
☆14Feb 11, 2020Updated 6 years ago
zzj-dyj / CLF-Net
View on GitHub
☆21Sep 9, 2022Updated 3 years ago
larics / UAV-Eagle
View on GitHub
Annotated dataset of quadrotor Eagle for object detection of UAVs
☆15Apr 4, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
pamaforce / TJU_vfmc_ticket
View on GitHub
基于抓包 API 接口实现的天津大学双校区七日内羽毛球、乒乓球、篮球等场馆预约流程程序化过程，仅供学习交流使用。
☆15Jul 2, 2024Updated 2 years ago
nopride03 / AirNav
View on GitHub
☆29Jun 10, 2026Updated last month
SDret / Pedestrian-Attribute-Recognition-as-Label-balanced-Multi-label-Learning
View on GitHub
Official pytorch implementation of the ICML2024 main conference paper: Pedestrian Attribute Recognition as Label-balanced Multi-label Lea…
☆13Jul 22, 2024Updated last year
xyh2016 / MTLF
View on GitHub
A Unified Framework for Metric Transfer Learning
☆17Oct 28, 2017Updated 8 years ago
DataXujing / YOLOv12-TensorRT
View on GitHub
YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现
☆14Mar 5, 2025Updated last year
fengkaibit / UAV-ROD
View on GitHub
A benchmark of UAV-ROD dataset.
☆54Jun 1, 2021Updated 5 years ago
jessemorris / multi_robot_perception
View on GitHub
Ma thesis @usyd
☆10May 14, 2021Updated 5 years ago
Event-AHU / Research-Pathways-for-Newcomers
View on GitHub
新生入学必读材料
☆15Jun 2, 2026Updated last month
NeoGeographyToolkit / MultipleViewPipeline
View on GitHub
MVP offers a method for extracting 3D that utilizes all views of the terrain, not just pair-wise combinations.
☆15Jun 13, 2013Updated 13 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kaist-ami / SoundBrush
View on GitHub
☆14Dec 8, 2025Updated 7 months ago
Yxxxb / LAVT-RS
View on GitHub
[CVPR'2022, TPAMI'2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation
☆26Jan 21, 2025Updated last year
amber0309 / Multidomain-Discriminant-Analysis
View on GitHub
Code for UAI 2019 paper "Domain Generalization via Multidomain Discriminant Analysis"
☆14Aug 28, 2019Updated 6 years ago
icey-zhang / E2E-MFD-HOD
View on GitHub
E2E-MFD-HOD
☆16Dec 23, 2024Updated last year
zc2023 / TokenHPE
View on GitHub
(CVPR 2023) TokenHPE: Learning Orientation Tokens for Efficient Head Pose Estimation via Transformers
☆14Oct 29, 2023Updated 2 years ago
zhangyikaii / Model-Spider
View on GitHub
The code repository for "Model Spider: Learning to Rank Pre-Trained Models Efficiently"
☆21Apr 12, 2024Updated 2 years ago
ZYangChen / DC-SatMVS
View on GitHub
[IEEE JSTARS] The official implementation of "Surface Depth Estimation from Multi-view Stereo Satellite Images with Distribution Contrast…
☆11May 16, 2025Updated last year