Artificial Intelligence, Deep Learning, and Computer Vision in Hysteroscopy: A Systematic Review
Rafał Watrowski, Attilio Di Spiezio Sardo, Peter Török, Andrea Rosati, Stoyan Kostov, Ibrahim Alkatout, Salvatore Giovanni VitaleBackground/Objectives: Hysteroscopy is the gold standard for visualization and treatment of intrauterine pathology. Because hysteroscopic interpretation remains operator-dependent, artificial intelligence (AI) has been evaluated as a tool to improve consistency, lesion recognition, and decision support. We aimed to systematically review AI, machine learning (ML), deep learning (DL), or computer-aided diagnosis (CAD) applications in hysteroscopy. Methods: A systematic search of PubMed/MEDLINE and EBSCOhost was performed from database inception to 8 March 2026, supplemented by targeted searches. Risk of bias was assessed using QUADAS-2 (diagnostic), PROBAST (prognostic), RoB2, and structured technical quality domains. Results: Nineteen primary studies were included, covering five areas: diagnostic classification and object detection (n = 8), real-time lesion detection and localization (n = 4), segmentation and visual-field support (n = 3), operative guidance (n = 1), and prognostic or decision-support applications (n = 3). Performance was highest in narrowly defined binary tasks and in large multicenter systems (e.g., ECCADx: AUC 0.979 internal, 0.975 external) and in prognostic fertility-prediction models after hysteroscopic adhesiolysis (AUC up to 0.992). Broader multiclass classification of heterogeneous lesions showed uneven and lower performance. Most studies were single-center, retrospective, and lacked external validation. Only one randomized study linked AI support to measurable procedural outcomes. Conclusions: The available studies indicate good technical performance in selected hysteroscopic tasks, particularly binary classification, focal lesion detection, and postoperative fertility stratification. Current evidence, however, remains limited by retrospective design, operator-dependent image acquisition, inconsistent validation, and scarce outcome-based clinical testing. In the short term, the most likely role of these systems is to support image interpretation, improve visual quality control, highlight suspicious lesions, and integrate hysteroscopic findings with complementary clinical data.