Mostrar el registro sencillo del ítem

dc.contributor.authorTorres-Zegarra, B.C.es_PE
dc.contributor.authorRios-Garcia, W.es_PE
dc.contributor.authorÑaña-Cordova, A.M.es_PE
dc.contributor.authorArteaga-Cisneros, K.F.es_PE
dc.contributor.authorBenavente-Chalco, X.C.es_PE
dc.contributor.authorBustamante-Ordoñez, M.A.es_PE
dc.contributor.authorGutierrez-Rios, C.J.es_PE
dc.contributor.authorRamos-Godoy, C.A.es_PE
dc.contributor.authorTeresa Panta Quezada, K.L.es_PE
dc.contributor.authorGutiérrez-Arratia, J.D.es_PE
dc.contributor.authorFlores-Cohaila, J.A.es_PE
dc.date.accessioned2026-03-11T17:32:14Z
dc.date.available2026-03-11T17:32:14Z
dc.date.issued2023
dc.identifier.urihttp://hdl.handle.net/20.500.14074/10222
dc.description.abstractPurpose We aimed to describe the performance and evaluate the educational value of justifications provided by artificial intelligence chatbots, including GPT-3.5, GPT-4, Bard, Claude, and Bing, on the Peruvian National Medical Licensing Examination (P-NLME). Methods This was a cross-sectional analytical study. On July 25, 2023, each multiple-choice question (MCQ) from the P-NLME was entered into each chatbot (GPT-3, GPT-4, Bing, Bard, and Claude) 3 times. Then, 4 medical educators categorized the MCQs in terms of medical area, item type, and whether the MCQ required Peru-specific knowledge. They assessed the educational value of the justifications from the 2 top performers (GPT-4 and Bing). Results GPT-4 scored 86.7% and Bing scored 82.2%, followed by Bard and Claude, and the historical performance of Peruvian examinees was 55%. Among the factors associated with correct answers, only MCQs that required Peru-specific knowledge had lower odds (odds ratio, 0.23; 95% confidence interval, 0.09–0.61), whereas the remaining factors showed no associations. In assessing the educational value of justifications provided by GPT-4 and Bing, neither showed any significant differences in certainty, usefulness, or potential use in the classroom. Conclusion Among chatbots, GPT-4 and Bing were the top performers, with Bing performing better at Peru-specific MCQs. Moreover, the educational value of justifications provided by the GPT-4 and Bing could be deemed appropriate. However, it is essential to start addressing the educational value of these chatbots, rather than merely their performance on examinations.es_PE
dc.description.sponsorshipEste trabajo fue financiado por UK Research and Innovation, UKRI, (105173).es_PE
dc.formatapplication/pdfes_PE
dc.language.isoenges_PE
dc.publisherKorea Health Personnel Licensing Examination Institute.es_PE
dc.relation.ispartofhttps://www.scopus.com/pages/publications/85177454993es_PE
dc.relation.ispartofurn:issn:19755937es_PE
dc.relation.ispartofJ. Educ. Eval. Health Prof. 2023; 20: 30es_PE
dc.rightsinfo:eu-repo/semantics/openAccesses_PE
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/es_PE
dc.subjectMedical educationes_PE
dc.subjectEducational measurementes_PE
dc.subjectArtificial intelligencees_PE
dc.subjectPerues_PE
dc.titlePerformance of ChatGPT, Bard, Claude, and Bing on the Peruvian National Licensing Medical Examination: a cross-sectional study.es_PE
dc.typeinfo:eu-repo/semantics/articlees_PE
dc.type.versioninfo:eu-repo/semantics/publishedVersiones_PE
dc.subject.ocdehttps://purl.org/pe-repo/ocde/ford#5.03.01es_PE
dc.identifier.doihttps://doi.org/10.3352/jeehp.2023.20.30es_PE


Ficheros en el ítem

Thumbnail

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem

info:eu-repo/semantics/openAccess
Excepto si se señala otra cosa, la licencia del ítem se describe como info:eu-repo/semantics/openAccess
Universidad Nacional de Cajamarca

Av. Atahualpa 1050, Cajamarca - Perú | Telf. (+51)076-599220

Todos los contenidos de repositorio.unc.edu.pe están bajo la Licencia Creative Commons

repositorio@unc.edu.pe