ChatGPT‐4 performance in rhinology: A clinical case series

doi:10.1002/alr.23323

DOI: 10.1002/alr.23323 ISSN: 2042-6976

ChatGPT‐4 performance in rhinology: A clinical case series

Thomas Radulesco, Alberto Maria Saibene, Justin Michel, Luigi Angelo Vaira, Jérôme R. Lechien

Otorhinolaryngology
Immunology and Allergy

Show PDF Cite

Keypoints

Chatbot Generative Pre‐trained Transformer (ChatGPT)‐4 indicated more than twice additional examinations than practitioners in the management of clinical cases in rhinology.

The consistency between ChatGPT‐4 and practitioner in the indication of additional examinations may significantly vary from one examination to another.

The ChatGPT‐4 proposed a plausible and correct primary diagnosis in 62.5% cases, while pertinent and necessary additional examinations and therapeutic regimen were indicated in 7.5%–30.0% and 7.5%–32.5% of cases, respectively.

The stability of ChatGPT‐4 responses is moderate‐to‐high. The performance of ChatGPT‐4 was not influenced by the human‐reported level of difficulty of clinical cases.

Outline

ChatGPT‐4 performance in rhinology: A clinical case series

Keypoints

More from our Archive