What is it about?

Pathologists examine tissue slides to find cancer that has spread to lymph nodes (metastases). AI models — convolutional neural networks (CNNs) — can now do this with very high accuracy, but they are "black boxes": they don't reveal why they flag a given image as cancerous, which makes clinicians hesitant to trust them. A popular way to look inside is LIME, which highlights the image regions that drove a decision as a heatmap. The catch is that LIME's output depends heavily on how the image is first broken into segments, and choosing good settings by hand is slow, subjective, and the results wobble from one run to the next. We developed EvEx ("Evolved Explanations"), which uses a multi-objective genetic algorithm to automatically search for the best segmentation settings — optimising three quality goals at once — and then averages the best solutions into a single explanation. We also controlled the underlying randomness so the explanations become reproducible. Tested on a public pathology dataset (Patch-Camelyon) for lymph-node metastasis detection, the automatically generated heatmaps matched expert pathologists' own tumour markings — even though those markings were never shown to the algorithm during the search.

Featured Image

Why is it important?

The work removes a stubborn, manual bottleneck in explainable AI: instead of an expert hand-tuning parameters for hours and judging heatmaps by eye, EvEx uses objective metrics and an evolutionary search to do it automatically. Crucially, it makes the explanations reproducible — deterministic for a given random seed, and in close agreement across different seeds, especially in the most relevant regions — which matters for trust and auditing in medicine. Because EvEx builds on LIME, which only needs a model's outputs, it is model-agnostic: it works on any classifier, including ensembles and non-CNN models, unlike gradient- or CAM-based methods that require access to the network's internals. And the fact that the evolved heatmaps coincide with expert segmentations, without ever using them in the process, is strong evidence that the explanations are genuinely meaningful — pointing to real value in cases where expert annotations don't exist, or where it isn't yet known which image regions should matter. This is, to the authors' knowledge, the first in-depth attempt to make LIME viable for realistic clinical scenarios. The honest limitation is computational cost: each patch takes hours on a CPU, dropping to roughly 20 minutes with GPU acceleration.

Perspectives

What I find elegant about this work is using one classic idea — evolutionary optimisation — to tame another — a powerful but finicky explanation method. The result that genuinely stayed with me is that the algorithm, guided only by abstract quality metrics, kept converging on the same tissue regions that expert pathologists had independently marked as tumour, without ever being told where to look. To me that is the heart of trustworthy medical AI: not just a confident prediction, but an explanation we can reproduce, trace, and check against human expertise. This sits squarely in the line of work I continue to pursue in explainable AI for digital pathology, and the directions I'm most keen on are cutting the computational cost down to clinically practical times and generalising the same automated-tuning idea to explanation methods beyond LIME.

Prof. Dr. Eduardo Costa da Silva
Pontificia Universidade Catolica do Rio de Janeiro

Read the Original

This page is a summary of: Evolved explainable classifications for lymph node metastases, Neural Networks, April 2022, Elsevier,
DOI: 10.1016/j.neunet.2021.12.014.
You can read the full text:

Read

Contributors

The following have contributed to this page