MultimodalGeo/GeoText-1652

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MultimodalGeo/GeoText-1652)

MultimodalGeo / GeoText-1652

An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching

☆118

Alternatives and similar repositories for GeoText-1652

Users that are interested in GeoText-1652 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Oli-iver / Depth-BID
View on GitHub
Offical repo for ECCV 2024: Depth-Aware Blind Image Decomposition for Real-World Weather Recovery
☆13Mar 7, 2024Updated 2 years ago
Ruiyang-061X / LiSe
View on GitHub
[ECCV'24] Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene.
☆40Sep 3, 2024Updated last year
Shuyu-XJTU / SVTA
View on GitHub
The official repo of "Towards Scalable Video Anomaly Retrieval: A Synthetic Video-Text Benchmark"
☆21Jun 5, 2025Updated last year
Zeus1037 / SEED
View on GitHub
SEED Dataset
☆29Jun 3, 2025Updated last year
chen742 / DCF
View on GitHub
This is the official implementation of "Transferring to Real-World Layouts: A Depth-aware Framework for Scene Adaptation" (Accepted at AC…
☆13Aug 24, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
yejy53 / CVG-Text
View on GitHub
[ICCV 2025] Where am I? Cross-View Geo-localization with Natural Language Descriptions.
☆72Dec 9, 2025Updated 7 months ago
Reza-Zhu / ACMMM23-Solution-MBEG
View on GitHub
Workshop on UAVs in Multimedia: Capturing the World from a New Perspective. Reza Zhu's Solution: MBEG
☆11May 17, 2024Updated 2 years ago
Ruiyang-061X / Awesome-MLLM-Uncertainty
View on GitHub
✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).
☆59Apr 2, 2025Updated last year
HaoDot / Video2BEV-Open
View on GitHub
Official Repo for ICCV25-Video2BEV: Transforming Drone Videos to BEVs for Video-based Geo-localization
☆44Jul 6, 2026Updated 2 weeks ago
C-water / SDPL_release
View on GitHub
SDPL: Shifting-Dense Partition Learning for UAV-view Geo-localization
☆25Aug 17, 2025Updated 11 months ago
Reza-Zhu / SUES-200-Benchmark
View on GitHub
SUES-200: A Multi-height Multi-scene Cross-view Image Benchmark Across Drone and Satellite
☆91Nov 6, 2024Updated last year
layumi / University1652-Baseline
View on GitHub
ACM Multimedia2020 University-1652: A Multi-view Multi-source Benchmark for Drone-based Geo-localization annotates 1652 buildings in 72 …
☆670Jul 10, 2026Updated last week
layumi / UAVM2023
View on GitHub
ACM MM Workshop on UAVs in Multimedia: Capturing the World from a New Perspective (UAVM 2023)
☆13Jul 4, 2026Updated 2 weeks ago
lingyuliu / Every-Painting-Awakened
View on GitHub
🎨Official Repo for Every Painting Awakened: A Training-free Framework for Painting-to-Animation Generation
☆57Apr 10, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
x-d-wang / Soft-Person-Reidentification-Network-Pruning-via-Blockwise-Adjacent-Filter-Decaying
View on GitHub
☆16Jan 13, 2024Updated 2 years ago
Skyy93 / Sample4Geo
View on GitHub
☆159Aug 8, 2025Updated 11 months ago
mode-str / crossview
View on GitHub
This repository contains the dataset link and the code for our paper MCCG: A ConvNeXt-based Multiple-Classifier Method for Cross-view Geo…
☆32Jun 6, 2026Updated last month
Yux1angJi / GTA-UAV
View on GitHub
[AAAI 2025 Oral🚁] Game4Loc: A UAV Geo-Localization Benchmark from Game Data
☆168Oct 20, 2025Updated 9 months ago
Texaser / MTN
View on GitHub
Progressive Text-to-3D Generation for Automatic 3D Prototyping (ACM TOMM)
☆54Mar 14, 2026Updated 4 months ago
Ruiyang-061X / SketchThinker-R1
View on GitHub
[ICLR'26] SketchThinker-R1: Towards Efficient Sketch-Style Reasoning in Large Multimodal Models
☆17Mar 26, 2026Updated 3 months ago
Ruiyang-061X / UA3D
View on GitHub
[ICCV'25] "Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection".
☆26Jan 12, 2026Updated 6 months ago
yejy53 / EP-BEV
View on GitHub
[ECCV 2024] About The official implementation of the paper "Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network“.
☆101Jul 8, 2025Updated last year
VisionXLab / DVGBench
View on GitHub
[ISPRS2026] DVGBench: Implicit-to-Explicit Visual Grounding Benchmark in UAV Imagery with Large Vision-Language Models
☆30Mar 24, 2026Updated 3 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
wtyhub / MuseNet
View on GitHub
[Pattern Recognition'24] Pytorch implementation of Multiple-environment Self-adaptive Network for Aerial-view Geo-localization https://a…
☆47Jul 6, 2026Updated 2 weeks ago
JT-Sun / UAVReason
View on GitHub
🚁 Can Vision-Language Models Think from the Sky? UAVReason for Aerial Reasoning and Generation
☆22Jul 11, 2026Updated last week
SummerpanKing / DAC
View on GitHub
[TCSVT'24] Enhancing Cross-View Geo-Localization with Domain Alignment and Scene Consistency
☆35May 7, 2025Updated last year
cjl-2000 / ComplexUAV
View on GitHub
☆26Sep 26, 2024Updated last year
Mabel0403 / CAMP
View on GitHub
[🎉IEEE TGRS'24] The official code for paper "CAMP: A Cross-View Geo-Localization Method using Contrastive Attributes Mining and Position…
☆35Jul 11, 2025Updated last year
ZelongZeng / PLCD
View on GitHub
[TMM 2022]The official code of IEEE Transactions on Multimedia paper "Geo-localization via ground-to-satellite cross-view image retrieval…
☆16Jun 24, 2024Updated 2 years ago
Collebt / EM-CVGL
View on GitHub
Code of Learning Cross-view Visual Geo-localization without Ground Truth
☆11Feb 17, 2025Updated last year
Yux1angJi / MMGeo
View on GitHub
[ICCV 2025] MMGeo: Multimodal Compositional Geo-Localization for UAVs
☆20Oct 20, 2025Updated 9 months ago
wangkunyu241 / SkyFind
View on GitHub
This is the open-sourced link of the TPAMI 2026 paper "SkyFind: A Large-Scale Benchmark Unveiling Referring Expression Comprehension for …
☆31May 27, 2026Updated last month
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
yifeisu / TG-GAT
View on GitHub
Target-Grounded Graph-Aware Transformer for Aerial Vision-and-Dialog Navigation, AVDN Challenge, ICCV CLVL 2023.
☆21Jan 2, 2024Updated 2 years ago
yeyimilk / LLMGeo
View on GitHub
LLMGeo: Benchmarking Large Language Models on Image Geolocation In-the-wild
☆16Oct 31, 2024Updated last year
IntelliSensing / UAV-VisLoc
View on GitHub
UAV-VisLoc: A Large-scale Dataset for UAV Visual Localization
☆277May 22, 2024Updated 2 years ago
MitsuiChen14 / DGTRS
View on GitHub
☆32Jun 10, 2026Updated last month
lingyuliu / DQ_Transformer
View on GitHub
Official Repo for Look, Compare and Draw: Differential Query Transformer for Automatic Oil Painting
☆17Mar 31, 2026Updated 3 months ago
rstanjieyi / GeoAI-in-NeurIPS-2024
View on GitHub
A collection of papers related to Geo-spatial Information Science in NeurIPS 2024.
☆56Jan 5, 2025Updated last year
xuyingxiao / NBR-Net
View on GitHub
NBR-Net: A Non-rigid Bi-directional Registration Network for Multi-temporal Remote Sensing Images
☆24Aug 17, 2022Updated 3 years ago