A Machine-Learning-Based Study on All-Day Cloud Classification Using Himawari-8 Infrared Data
Yashuai Fu, Xiaofei Mi, Zhihua Han, Wenhao Zhang, Qiyue Liu, Xingfa Gu, Tao Yu- General Earth and Planetary Sciences
Clouds are diverse and complex, making accurate cloud type identification vital in improving the accuracy of weather forecasting and the effectiveness of climate monitoring. However, current cloud classification research has largely focused on daytime data. The lack of visible light data at night presents challenges in characterizing nocturnal cloud attributes, leading to difficulties in achieving continuous all-day cloud classification results. This study proposed an all-day infrared cloud classification model (AInfraredCCM) based on XGBoost. Initially, the latitude/longitude, 10 infrared channels, and 5 brightness temperature differences of the Himawari-8 satellite were selected as input features. Then, 1,314,275 samples were collected from the Himawari-8 full-disk data and cloud classification was conducted using the CPR/CALIOP merged cloud type product as training data. The key cloud types included cirrus, deep convective, altostratus, altocumulus, nimbostratus, stratocumulus, stratus, and cumulus. The cloud classification model achieved an overall accuracy of 86.22%, along with precision, recall, and F1-score values of 0.88, 0.84, and 0.86, respectively. The practicality of this model was validated across all-day temporal, daytime/nighttime, and seasonal scenarios. The results showed that the AInfraredCCM consistently performed well across various time periods and seasons, confirming its temporal applicability. In conclusion, this study presents an all-day cloud classification approach to obtain comprehensive cloud information for continuous weather monitoring, ultimately enhancing weather prediction accuracy and climate monitoring.