Benchmarking Landsat-8 Collection 2 Level-2 Land Surface Temperature Accuracy Using SURFRAD Stations: Effects of Seasonality and Atmospheric Water Vapor
Almustafa AbdElkader Ayek, Mohannad Ali Loho, Nasser Ibrahem, Afnan Abdullah Alturki, Youssef M. Youssef, Mayada Abdelkader AbdelazizLand Surface Temperature (LST) is essential for climate monitoring, drought assessment, and urban heat analysis. Despite its importance, the Landsat-8 Collection 2 Level-2 (C2L2) LST product has not been rigorously validated using ground measurements—a critical gap this study addresses. We present the first comprehensive accuracy assessment using 382 coincident satellite–ground observations collected from seven Surface Radiation Budget Network (SURFRAD) stations distributed across diverse climatic regions of the United States during the period 2023–2025. The validation results indicate strong overall agreement between satellite-derived and ground-measured temperatures, yielding an RMSE of 4.20 °C, a coefficient of determination (R2) of 0.91, and a Pearson correlation coefficient (r) of 0.98. These statistics demonstrate the high reliability of the C2L2 LST product across a wide range of environmental conditions. Nevertheless, a systematic warm bias of 1.75 °C was observed, indicating a tendency toward temperature overestimation. Model performance exhibited pronounced seasonal variability. The highest accuracy was achieved during winter conditions (RMSE = 2.17 °C; r = 0.99), whereas performance declined considerably during summer months (RMSE = 5.84 °C; r = 0.91). Analysis of atmospheric water vapor content revealed significant associations with retrieval errors at high-elevation and arid locations, particularly at FPK (r = 0.78) and DRA (r = 0.75), based on 106 matched observations. These relationships provide important insight into the atmospheric factors contributing to seasonal variations in retrieval accuracy. Temperature-dependent analyses further demonstrated that retrieval uncertainty increases with surface temperature. Performance progressively deteriorated from cooler to warmer thermal regimes, with RMSE values increasing from approximately 2.05 °C for temperatures below 20 °C to 5.71 °C for temperatures exceeding 40 °C. Spatial evaluation also revealed substantial differences among stations. Relatively homogeneous, low-elevation sites exhibited superior performance (GWN: RMSE = 2.60 °C; SXF: RMSE = 2.55 °C), whereas stations located in mountainous or topographically complex environments showed reduced accuracy (TBL: RMSE = 5.14 °C; FPK: RMSE = 5.62 °C). These outcomes emphasize the influence of terrain complexity and atmospheric heterogeneity on LST retrieval performance. Overall, this study establishes the first comprehensive benchmark for evaluating the reliability of Landsat-8 C2L2 LST products. The results provide valuable guidance for their application in climate research, precision agriculture, hydrological modeling, and environmental monitoring. Furthermore, the findings identify specific environmental conditions requiring enhanced validation efforts and suggest opportunities for future algorithm refinement through improved atmospheric correction procedures and more accurate surface emissivity characterization.