Systematic benchmarking demonstrates large language models have not reached the diagnostic accuracy of traditional rare-disease decision support tools

Justin T Reese; Leonardo Chimirri; Yasemin Bridges; Daniel Danis; J Harry Caufield; Michael A. Gargano; Carlo Kroll; Andrew Schmeder; Fengchen Liu; Kyran Wissink; Julie A McMurry; Adam SL Graefe; Enock Niyonkuru; Daniel R Korn; Elena Casiraghi; Giorgio Valentini; Julius OB Jacobsen; Melissa Haendel; Damian Smedley; Christopher J Mungall; Peter N Robinson

doi:10.1101/2024.07.22.24310816

This publication has not yet been explained in plain language by the author(s). However, you can still read the publication.

If you are one of the authors, claim this publication so you can create a plain language summary to help more people find, understand and use it.

This page is a summary of: Systematic benchmarking demonstrates large language models have not reached the diagnostic accuracy of traditional rare-disease decision support tools, July 2024, Cold Spring Harbor Laboratory Press,
DOI: 10.1101/2024.07.22.24310816.
You can read the full text:

Read

Contributors

The following have contributed to this page

Professor Giorgio Valentini
Universita degli Studi di Milano

Systematic benchmarking demonstrates large language models have not reached the diagnostic accuracy of traditional rare-disease decision support tools

Contributors

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

Systematic benchmarking demonstrates large language models have not reached the diagnostic accuracy of traditional rare-disease decision support tools

Publication not explained

Featured Image

Read the Original

Contributors

Share this page:

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management