现在的位置: 首页研究点评, 进展交流>正文
[JAMA Surg发表述评]:仍然需要智能化、以人为中心的方式增强人工智能效果
2025年09月25日 研究点评, 进展交流 [JAMA Surg发表述评]:仍然需要智能化、以人为中心的方式增强人工智能效果已关闭评论

Invited Commentary 

AI in Surgery

Intelligent, Human-Centric Delivery Is Needed to Maximize AI

Carla M. Pugh, Mitra Ann Alikhani, Adam Rodman

JAMA Surg Published Online: August 6, 2025

doi: 10.1001/jamasurg.2025.2550

Artificial intelligence (AI)–augmented learning is here, and many believe it is superior to all past endeavors. AI’s ability to deliver real-time, objective, standardized feedback is an unprecedented game changer. Giglio et al1 investigated whether AI feedback accelerates surgical skill acquisition using a virtual reality simulator. Medical student participants (N = 87) were stratified into 3 groups: (1) computer-generated verbal feedback based on AI performance assessment; (2) human verbal feedback using an AI script identical to that of group 1; or (3) verbal feedback based on human interpretation and personalization of the AI performance assessment. The article emphasizes that group 3 outperformed the others; however, group 2 also outperformed group 1 in trial 5 and the final realistic task.

Beyond skill acquisition, student engagement improved by replacing synthetic, computer-generated speech (group 1) with a human voice (group 2), despite the words being identical. These results underscore that the method of interaction with an AI system is as important as the accuracy and automated performance of the system itself. Other studies, both in clinical diagnosis and radiological interpretation, show that AI outputs alone do not necessarily enhance human performance.2,3 Put simply, delivery matters.

An area rich for further research is the natural depth and nuances of human feedback—timely eye contact mixed with verbal praise, varied tone, and content emphasis. This was not fully explored in the study by Giglio and colleagues, highlighting the need for mixed-methods research that captures instructors’ tone and verbiage. Large language model systems are commonly optimized for positive engagement and sycophancy. Early evidence suggests that these factors decrease cognitive load and critical thinking in students and should be carefully considered as AI is integrated into medical curricula.4 The concept of desirable difficulty has been identified as a key driver of durable learning.5

A quiet success of the study is its objective novice-to-expert scaling. Anchoring forces, path lengths, and speeds to expert benchmarks allow educators to ask, “How close to competent?” rather than merely “How fast?” The race for precision education requires objective, normative data; without this, both humans and models dispense subjective, qualitative feedback that risks perpetuating individual biases.6 While AI cannot yet provide mentorship, emotional framing, or translation to the operating theater, standardized, real-time analytics cannot be delivered at scale by humans. Optimizing algorithms while ignoring point-of-care implementation yields only marginal gains. AI can facilitate learning, but cultivating human expertise still demands a human-centered interface—one that preserves friction, nuance, and the motivation to improve.

抱歉!评论已关闭.

×
腾讯微博