DOI: 10.3390/ijms27135653 ISSN: 1422-0067

An Energy Model Based on Molecular Structure for Predicting Histone Modification Levels at lncRNA Promoter Regions in HepG2 Cells

Menglan Li, Yingli Chen, Qianzhong Li, Pengyu Du, Dimeng Zhang, Yuanyuan Zhao

In hepatocellular carcinoma (HepG2), aberrant histone modifications are closely linked to long non-coding RNA (lncRNA) expression. However, existing computational models lack physical interpretability at specific promoter coordinates. To address this, we developed a position-specific statistical scoring model based on adjacent and next-adjacent nucleotide frequencies. We trained two independent, position-specific matrices representing increased and decreased modification states across 600 bp promoter windows centered on the true signal summits. Finally, ten-fold cross-validation revealed that significant energy differences between sequences with increased and decreased histone signals enable excellent classification performance. These results indicted a strong correlation between the total energy of local DNA structures and histone modification signal.

More from our Archive