JAMA : the journal of the American Medical Association
The most recent articles from:
JAMA
-
Large language models (LLMs) can assist in various health care activities, but current evaluation approaches may not adequately identify the most useful application areas. ⋯ Existing evaluations of LLMs mostly focus on accuracy of question answering for medical examinations, without consideration of real patient care data. Dimensions such as fairness, bias, and toxicity and deployment considerations received limited attention. Future evaluations should adopt standardized applications and metrics, use clinical data, and broaden focus to include a wider range of tasks and specialties.