Logo do repositório

Chatbot Underperformance in Biology and Image-Based Questions in Medical Education

dc.contributor.authorRizzi, Joyce Santana [UNESP]
dc.contributor.authorRequena, Lorraine Silva [UNESP]
dc.contributor.authorBicudo, Angelica Maria
dc.contributor.authorFilho, Pedro Tadao Hamamoto [UNESP]
dc.contributor.authorFerretti, Renato [UNESP]
dc.date.accessioned2026-04-09T12:45:33Z
dc.date.issued2025-12-04
dc.description.abstractAI chatbots have demonstrated variable performances across biological disciplines in medical education, particularly in multiple-choice and image-based assessments. However, their performance in addressing discipline-specific and image-based questions in biology remains unexamined. This study evaluated the accuracy and reliability of chatbots in answering biological questions from the Progress Test, a medical assessment applied across ten universities. We conducted an observational cross-sectional study by inputting 180 questions into the chatbots and categorising them according to morphology, function, and aggression. Each question was assessed for correctness across multiple chatbot attempts, and logistic regression and hierarchical clustering were applied to identify performance patterns. Although the chatbots answered functional and morphological questions accurately (from 85% (Gemini) to 91.7% (ChatGPT-4)), their accuracy decreased significantly for questions involving biological aggression and visual content. The agreement between chatbot responses remained weak, and Co-pilot displayed the lowest concordance. Chatbot accuracy decreased significantly in aggression-related disciplines and image-based questions. Logistic regression confirmed that the presence of images reduced the odds of correct answers by up to 17.6% (ChatGPT-4). Hierarchical clustering distinguished the two distinct response patterns, further validating these findings. These results highlight the potential of chatbots in medical education while emphasising their limitations in handling image-based and aggression-related content.
dc.description.affiliationLaboratory of Muscle Biology, Department of Structural and Functional Biology, Institute of Bioscience of Botucatu, Sao Paulo State University (UNESP), Botucatu, Brazil
dc.description.affiliationSchool of Medical Sciences, University of Campinas (UNICAMP), Campinas, Brazil
dc.description.affiliationBotucatu Medical School, Department of Neurosciences and Mental Health, São Paulo State University (UNESP), Botucatu, Brazil
dc.description.affiliationUnespLaboratory of Muscle Biology, Department of Structural and Functional Biology, Institute of Bioscience of Botucatu, Sao Paulo State University (UNESP), Botucatu, Brazil
dc.description.affiliationUnespBotucatu Medical School, Department of Neurosciences and Mental Health, São Paulo State University (UNESP), Botucatu, Brazil
dc.identifierhttps://app.dimensions.ai/details/publication/pub.1195793335
dc.identifier.dimensionspub.1195793335
dc.identifier.doi10.1080/28338073.2025.2596550
dc.identifier.issn2161-4083
dc.identifier.issn2833-8073
dc.identifier.pmcidPMC12679838
dc.identifier.urihttps://hdl.handle.net/11449/320929
dc.publisherTaylor & Francis
dc.relation.ispartofJournal of CME; n. 1; v. 14; p. 2596550
dc.rights.accessRightsAcesso abertopt
dc.rights.sourceRightsoa_all
dc.rights.sourceRightsgold
dc.sourceDimensions
dc.titleChatbot Underperformance in Biology and Image-Based Questions in Medical Education
dc.typeArtigopt
dspace.entity.typePublication
relation.isOrgUnitOfPublicationa3cdb24b-db92-40d9-b3af-2eacecf9f2ba
relation.isOrgUnitOfPublicationab63624f-c491-4ac7-bd2c-767f17ac838d
relation.isOrgUnitOfPublication.latestForDiscoverya3cdb24b-db92-40d9-b3af-2eacecf9f2ba
unesp.campusUniversidade Estadual Paulista (UNESP), Instituto de Biociências, Botucatupt
unesp.campusUniversidade Estadual Paulista (UNESP), Faculdade de Medicina, Botucatu

Arquivos

Pacote original

Agora exibindo 1 - 1 de 1
Carregando...
Imagem de Miniatura
Nome:
Chatbot Underperformance in Biology and Image-Based Questions in Medical Education.pdf
Tamanho:
869.62 KB
Formato:
Adobe Portable Document Format
Descrição: