Explore our publications and preprints advancing healthcare through rigorous AI evaluation.
Clinical evaluation of large language models (LLMs) currently relies on static datasets and isolated scenarios that fail to capture the […]
The large volume of abdominal computed tomography (CT) scans coupled with the shortage of radiologists have intensified the need for […]
General-purpose large language models (LLMs) are now commonplace throughout society, becoming de facto health advisors for millions worldwide1. The public […]
mportance: High-quality discharge summaries are essential for safe care transitions but contribute substantially to clinician documentation burden and burnout. While […]
Medical artificial intelligence (AI) tools, including clinical language models, vision–language models and multimodal health record models, are used to summarize […]
Large language model (LLM) chat tools have the potential to transform healthcare workflows by improving efficiency and reducing administrative burdens. […]
While large language models (LLMs) can support clinical documentation needs, standalone tools struggle with “workflow friction” from manual data entry. […]
AI chatbots are proliferating in healthcare systems. It is essential to explore how physicians use these tools in order to […]
AI systems are rapidly approaching expert-level diagnostic reasoning; however, management reasoning —the art of translating diagnoses into personalized care—remains distinctly […]
While large language models (LLMs) achieve near-perfect scores on medical licensing exams, these evaluations inadequately reflect the complexity and diversity […]
Get the latest on our studies, grant awards, and media coverage.