EasyCen
: A Lightweight Framework for Centromere Localisation and Repeat‐Organisation Profiling in Telomere‐to‐Telomere Genomes
Yunyun Lv, Yanping Li, Jia Li, Xidong Mu ABSTRACT
Accurate identification of centromeres in telomere‐to‐telomere (T2T) genomes remains difficult due to the rapid evolution of centromeric repeats and their lack of conserved sequence features. In this study, we present EasyCen, a lightweight sequence‐based framework for centromere identification and repeat‐architecture profiling across various eukaryotes. Rather than relying on repeat annotation or homology, EasyCen recognises centromeres based on recurrent positional features of repetitive DNA. Besides centromere localisation, EasyCen incorporates a repeat‐pair profiling module for exploratory characterisation of internal repeat organisation. Benchmarking on