Publication
Systematic benchmarking demonstrates large language models have not reached the diagnostic accuracy of traditional rare-disease decision support tools
Justin T Reese, Leonardo Chimirri, Yasemin Bridges, Daniel Danis, J Harry Caufield, Michael A. Gargano, Carlo Kroll, Andrew Schmeder, Fengchen Liu, Kyran Wissink, Julie A McMurry, Adam SL Graefe, Enock Niyonkuru, Daniel R Korn, Elena Casiraghi, Giorgio Valentini, Julius OB Jacobsen, Melissa Haendel, Damian Smedley, Christopher J Mungall, Peter N Robinson
July 2024, Cold Spring Harbor Laboratory Press
DOI: 10.1101/2024.07.22.24310816