Assessing the response consistency in Chat Generative Pretrained Transformer: The case of “What is Ayurveda?”
Janmejaya SamalINTRODUCTION:
Chat Generative Pretrained Transformer (ChatGPT) is a language-based chatbot launched in November 2022. This study was carried out to understand the consistency of the responses of ChatGPT to a single question when asked 10 different times, consecutively, within a particular time frame.
MATERIALS AND METHODS:
The study followed two steps: in the first step, only one question, “What is
RESULTS:
The median word count was 334 (standard deviation [SD] = 32.87, interquartile range [IQR] =313–354), sentences were 18 (SD = 2.36, IQR = 17.0–20.3), headings were 1.50 (SD = 0.82, IQR = 1–2), and the subheadings were 8 (SD = 2.08, IQR = 7–9). The linear regression between the number of words and other particulars showed a statistically significant association with the number of sentences (
CONCLUSIONS:
ChatGPT can be useful in many ways in education and research. However, it should be used with a pinch of salt as ChatGPT itself says, “ChatGPT can make mistakes. Check important info.”