Clover: An unbiased method for prioritizing differentially expressed genes using a data‐driven approach
Gina Miku Oba, Ryuichiro Nakato- Cell Biology
- Genetics
Abstract
Identifying key genes from a list of differentially expressed genes (DEGs) is a critical step in transcriptome analysis. However, current methods, including Gene Ontology analysis and manual annotation, essentially rely on existing knowledge, which is highly biased depending on the extent of the literature. As a result, understudied genes, some of which may be associated with important molecular mechanisms, are often ignored or remain obscure. To address this problem, we propose Clover, a data‐driven scoring method to specifically highlight understudied genes. Clover aims to prioritize genes associated with important molecular mechanisms by integrating three metrics: the likelihood of appearing in the DEG list, tissue specificity, and number of publications. We applied Clover to Alzheimer's disease data and confirmed that it successfully detected known associated genes. Moreover, Clover effectively prioritized understudied but potentially druggable genes. Overall, our method offers a novel approach to gene characterization and has the potential to expand our understanding of gene functions. Clover is an open‐source software written in Python3 and available on GitHub at