Assessment of artificial intelligence applications in responding to dental trauma

doi:10.1111/edt.12965

DOI: 10.1111/edt.12965 ISSN: 1600-4469

Assessment of artificial intelligence applications in responding to dental trauma

Idil Ozden, Merve Gokyar, Mustafa Enes Ozden, Hesna Sazak Ovecoglu

Show PDF Cite

Abstract

Background

This study assessed the consistency and accuracy of responses provided by two artificial intelligence (AI) applications, ChatGPT and Google Bard (Gemini), to questions related to dental trauma.

Materials and Methods

Based on the International Association of Dental Traumatology guidelines, 25 dichotomous (yes/no) questions were posed to ChatGPT and Google Bard over 10 days. The responses were recorded and compared with the correct answers. Statistical analyses, including Fleiss kappa, were conducted to determine the agreement and consistency of the responses.

Results

Analysis of 4500 responses revealed that both applications provided correct answers to 57.5% of the questions. Google Bard demonstrated a moderate level of agreement, with varying rates of incorrect answers and referrals to physicians.

Conclusions

Although ChatGPT and Google Bard are potential knowledge resources, their consistency and accuracy in responding to dental trauma queries remain limited. Further research involving specially trained AI models in endodontics is warranted to assess their suitability for clinical use.

Outline

Assessment of artificial intelligence applications in responding to dental trauma

Abstract

Background

Materials and Methods

Results

Conclusions

More from our Archive