Co-Option and Conflict: The Deep Evolutionary History of ZP-Domain Proteins from ECMs to Species Barriers
Natalia Bezborodkina, Daniil Smutin, Leonid AdoninThe Zona Pellucida (ZP) and its structural analogs are evolutionarily ancient extracellular matrix components. These are essential for oocyte protection, species-specific gamete recognition, and prevention of polyspermy across Metazoa. Defined by the conserved ZP-domain—comprising ZP-N and ZP-C subdomains—these glycoproteins self-assemble into fibrillar matrices through tightly regulated polymerization. Mechanisms of the regulated polymerization involve furin cleavage, disulfide bonding, and hydrophobic interactions. Once considered a vertebrate innovation, the canonical ZP-domain— defined by its bipartite ZP-N/ZP-C architecture, eight conserved cysteine residues, and capacity for matrix polymerization—is now recognized as an ancient metazoan extracellular module, with homologs identified in basal lineages including Porifera, Cnidaria, and Placozoa. While ZP-like sequences have been reported in choanoflagellates such as Salpingoeca rosetta, these lack the complete canonical features and are considered distant structural relatives rather than true ZP-modules. There they function in cell adhesion and tissue integrity, suggesting an origin predating the evolution of specialized reproductive coats. Previous phylogenetic analyses across 97 metazoan species have revealed that vertebrate ZP genes arose from ancestral duplications of the canonical ZP-module. Accordingly, they give rise to eight subfamilies (ZP1–ZP4, ZPD, ZPAX, ZPX, ZPY), with lineage-specific expansions, losses, and pseudogenization reflecting adaptations to diverse reproductive strategies. Positive selection in sperm-binding regions of ZP2 and ZP3 drives a rapid adaptive evolution. It underscores coevolutionary arms races with sperm ligands, contributing to reproductive isolation and speciation. In invertebrates such as abalone and insects, ZP-domain proteins mediate analogous functions through lineage-specific elaborations, including tandem repeats and domain shuffling. Post-translational modifications, particularly glycosylation, fine-tune sperm receptor specificity and matrix stability. The functional transition from a general protective barrier in early metazoans to a sophisticated gamete recognition interface in vertebrates exemplifies modular evolution. This synthesis highlights the domain-level deep homology of ZP-domain proteins as a foundational element of metazoan extracellular matrices, repurposed through gene duplication, neofunctionalization, and selection to meet the demands of evolving reproductive modes. These insights bridge evolutionary biology, reproductive medicine, and developmental genetics. However, major gaps remain, including unresolved orthology between vertebrate and invertebrate ZP genes, the relative contribution of glycans versus protein backbone in sperm recognition, and the lack of functional evidence for canonical ZP-domain proteins in insects. Future studies integrating glycoproteomics, single-cell transcriptomics, and CRISPR-based models are needed to resolve these questions.