DOI: 10.1111/jebm.70137 ISSN: 1756-5383

Guidance for Grading the Evidence in Quantitative Umbrella Reviews

Hanan Khalil, Ritin Fernandez, Patraporn Bhatarasakoon, Kim Sears, Dawid Piper, Jennifer Stone, Cindy Stern, Kate Kynoch

ABSTRACT

Background

Umbrella reviews (URs) synthesize evidence across multiple systematic reviews and meta‐analyses to inform decision‐making. However, evaluating and integrating evidence certainty across overlapping and sometimes conflicting meta‐analyses remains a major methodological challenge of URs, limiting the reliability of conclusions.

Objective

To compare evidence evaluation frameworks for URs, assess their suitability for different study types, and provide guidance for managing overlapping evidence, conflicting certainty ratings, and forming overall conclusions.

Methods

We mapped published UR methods to identify current practices and gaps. Frameworks were compared across three dimensions: applicability to study types (interventional, causal observational, and descriptive observational), scalability and reproducibility, and capacity to handle methodological challenges specific to URs. Recommendations were developed through expert consensus and validated via case studies across diverse domains.

Results

Grading of Recommendations Assessment, Development and Evaluation (GRADE) is widely accepted for interventional evidence but shows limited applicability for observational URs due to subjective judgments and reproducibility issues. Credibility frameworks, based on predefined statistical thresholds, offer greater scalability and reproducibility but require adaptation for different study types. Key challenges include: (1) managing overlapping primary studies, (2) resolving conflicting certainty ratings between high‐quality reviews, and (3) ensuring transparent evidence integration.

Conclusions

Framework selection should be tailored to study type and context, with credibility frameworks better suited for observational evidence and GRADE optimal for interventional studies. Systematic approaches to overlapping evidence and conflicting ratings are essential for UR validity. We provide practical recommendations for framework selection, strategies for common challenges, and enhanced reporting standards to improve transparency and reproducibility.

More from our Archive