DOI: 10.1002/jad.12280 ISSN: 0140-1971

Comparing the reliability and validity of youth‐reported checklists and standardized interviews for categorical measurement of emotional and behavioral problems

Laura Duncan, Li Wang, Michael H. Boyle
  • Psychiatry and Mental health
  • Developmental and Educational Psychology
  • Social Psychology
  • Pediatrics, Perinatology and Child Health

Abstract

Introduction

Self‐completed checklists measuring youth mental health problems produce dimensional scale scores and can be converted to categorical classifications representing the presence/absence of psychopathology. We test whether categorical classifications from scale scores are equivalent psychometrically to categorical classifications of the same problems obtained by lay‐administered standardized structured diagnostic interviews.

Methods

The sample of n = 325 youth aged 12–18 (44% male) and their parent/caregivers come from combined test–retest reliability studies conducted in Ontario, Canada, from 2011 to 2015. Ontario Child Health Study Emotional Behavioural Scales‐Brief Version (OCHS‐EBS‐B) scores converted to categorical classifications of emotional and behavioral problems were compared with interview classifications. We test hypotheses of statistical equivalence and inferiority, using a confidence interval approach to detect if differences lie within the smallest effect size of interest of ±0.18. We compare categorical classifications on: (1) test–retest reliability (ҡ), (2) content validity (between‐instrument agreement), and (3) construct validity (strength of association with three mental health‐related constructs).

Results

Average test–retest reliabilities were 0.695 (checklists) and 0.670 (interviews). The reliability of checklist emotional problem classifications was not inferior to interview classifications and the difference in reliability between instruments for behavioral problems was small (−0.036). Average between‐instrument agreement was ҡ = 0.586 (observed) and ҡ = 0.841 (corrected for attenuation due to measurement error) indicating high content overlap. Statistical equivalence criteria were met in 5 of 6 construct validity comparisons.

Conclusions

Categorical classifications of emotional and behavioral problems from youth‐reported checklists are, on balance, equivalent to interview classifications. Checklists represent a simple, brief, inexpensive alternative to interviews.

More from our Archive