DOI: 10.3390/data11060151 ISSN: 2306-5729

A Reproducible Synthetic Socio-Digital Network Dataset for Analyzing Digital Gaps in Community-Based Tourism Communities in Rural Ecuador

Dolores Mieles-Cevallos, Lourdes Suntagsi-Tuasa, Jael Zambrano-Mieles, Velasco Zambrano-Burgos, Miguel Vera, Nicolás Márquez, Cristian Vidal-Silva

Digital transformation has become an essential component of sustainable rural development, yet substantial inequalities persist in how communities access, adopt, and benefit from digital technologies. Understanding these disparities requires not only information about technological resources but also knowledge of the relational structures through which information, support, and opportunities circulate. This article presents a reproducible synthetic socio-digital network dataset designed to support the analysis of digital gaps in community-based tourism (CBT) environments. Rather than containing original respondent-level observations, the repository was computationally reconstructed from aggregate statistics derived from field studies conducted in three rural communities in the province of Guayas, Ecuador: Bucay (5 de Septiembre), Manglares Churute, and Ruta de los Chirijos. All node-level records, survey variables, and support relationships included in the repository were synthetically generated to preserve aggregate community characteristics while protecting participant confidentiality and preventing individual re-identification. The repository contains synthetic actor metadata, reconstructed socio-digital variables, directed support networks, graph representations in interoperable formats, and precomputed Social Network Analysis (SNA) indicators. The dataset includes 90 synthetic actors, more than one thousand generated support interactions distributed across multiple socio-digital dimensions, machine-readable metadata, and reusable scripts for preprocessing, validation, graph construction, and metric computation. The represented dimensions include financial assistance, training support, information exchange, technological support, social media promotion, institutional collaboration, trust, and emotional closeness. To facilitate reuse, all resources are distributed in standardized formats compatible with NetworkX, Gephi, Neo4j, and graph-learning frameworks. The repository follows FAIR principles and includes documentation intended to support transparency, reproducibility, and methodological benchmarking. Potential applications include social network analysis, graph mining, graph neural networks, digital inequality research, computational social science, community resilience studies, and educational activities. By providing an openly documented synthetic dataset and reproducible computational workflow, the repository contributes to the study of socio-digital systems, privacy-preserving data sharing, and community-level digital transformation processes.

More from our Archive