DOI: 10.3390/electronics15122720 ISSN: 2079-9292

KoSim-GL: A Large-Scale Simulation-Based Dataset for UAV Cross-View Geo-Localization in Korean Urban Environments

Heejin Ahn, Changhwan Lee, Sangwook Lee, HyeonJoong Wi, Insung Jang, Dong-Geol Choi

We propose KoSim-GL, a large-scale vision-based geo-localization dataset for drone positioning in GPS-denied environments. Geo-localization estimates a drone’s location by matching drone-view imagery against a geo-referenced satellite image database, offering a reliable alternative to GPS under conditions such as signal jamming, spoofing, or degradation in dense urban canyons. Although this task is challenging due to the domain gap between drone-view and satellite-view imagery, existing benchmarks are built predominantly around urban environments in the United States and China, leaving South Korea largely unrepresented, despite its distinctive landscape in which mountainous terrain coexists with dense high-rise districts and low-rise residential neighborhoods. To address this gap, we introduce KoSim-GL, constructed from drone-view images captured via an AirSim- and ROS-based flight simulator and satellite images collected through the Google Maps Tile API, covering the urban area of Daejeon, South Korea. Its key feature is a multi-view configuration that simultaneously captures five views, one nadir and four oblique, at each flight position across altitudes from 100 m to 600 m, enabling robust localization even in feature-sparse environments where nadir-only matching is prone to fail. In total, KoSim-GL comprises 2,450,315 drone images and 1704 satellite images. We further provide systematic comparisons against five existing benchmarks and baseline evaluations of ten representative geo-localization models under single- and multi-view settings. Experimental results show that the multi-view configuration substantially improves localization performance; for example, FSRA improves Recall@1 from 44.08% (single-view) to 65.37% (multi-view), a gain of 21.29 percentage points. The dataset is publicly available.

More from our Archive