What is it about?
Medical and clinical knowledge is mostly written as free text in articles, patents, electronic health records, and even social media posts. This makes it difficult to search, reuse, and connect information, because the same disease, drug, or gene can be described with many different names, abbreviations, or acronyms. Our article reviews how computer systems tackle this problem using a task called named entity linking (NEL). NEL systems first find important terms in text, such as diseases, chemicals, or genes, and then link each term to a precise entry in a biomedical knowledge organisation system (KOS) like UMLS, MeSH, or SNOMED CT. We analysed 102 research papers published between 2013 and 2024 to understand how these systems are built, what knowledge sources they use, and how the field has changed over the last decade. We describe the main families of methods (rule‑based, dictionary‑based, graph‑based, machine learning and deep learning, and more recent large language model approaches) and how they are evaluated on shared datasets. We also map which biomedical and clinical vocabularies and ontologies are most used, and highlight gaps, such as limited coverage for some sub‑domains and languages beyond English. Finally, we summarise common challenges (for example ambiguous abbreviations, missing concepts in ontologies, and noisy training data) and point to promising research directions.
Featured Image
Photo by Tim Wildsmith on Unsplash
Why is it important?
Reliable linking between clinical or biomedical text and standardised vocabularies is essential for many downstream applications, from clinical decision support and safety monitoring to large‑scale research using electronic health records and scientific literature. If a system picks the wrong concept for a term, it can mislead later analyses, bias models, and ultimately affect patient‑facing tools. By systematically organising the last decade of research on biomedical named entity linking and its underlying knowledge organisation systems, our survey gives practitioners a clear overview of available methods, resources, and evaluation datasets. This can help researchers choose appropriate techniques, identify suitable ontologies and benchmarks, and avoid repeating known pitfalls. For ontology and vocabulary developers, our analysis highlights where current KOS fall short for NEL (for example in coverage, structure, and multilingual support), suggesting how they can evolve to better support text‑mining and AI applications. Ultimately, improving NEL helps turn unstructured biomedical text into high‑quality, computable data that can be shared, reused, and trusted.
Perspectives
This work sits at the intersection of natural language processing and biomedical knowledge representation. Rather than proposing yet another algorithm, we step back and look at the ecosystem as a whole: methods, ontologies, datasets, and open problems. We hope it will serve as a reference point for both AI researchers and domain experts who need to connect clinical or biomedical text to structured knowledge in a robust and transparent way.
Francisco Couto
Universidade de Lisboa
Read the Original
This page is a summary of: Systematic Review of Named Entity Linking and Knowledge Organisation Systems in Biomedical and Clinical Domains, ACM Computing Surveys, February 2026, ACM (Association for Computing Machinery),
DOI: 10.1145/3796222.
You can read the full text:
Resources
Book: Data and Text Processing for Health and Life Sciences
Open‑access book for beginners on data and text processing for biomedical named entity linking and knowledge organisation systems.
MER: Biomedical Entity Recognition and Linking (GitHub)
Open‑source biomedical named entity recognition and normalization tool (MER) that links text mentions to multiple knowledge organisation systems, supporting applications related to this systematic review.
BENT: Biomedical Entity Normalization Toolkit
Open‑source toolkit for biomedical entity normalization (BENT), providing models and resources to map textual mentions to standardised concepts, complementing the methods surveyed in this systematic review.
Contributors
The following have contributed to this page







