Benchmarking Dimensionality Reduction Techniques for Spatial Transcriptomics

Md Ishtyaq Mahmud; Veena Kochat; Suresh Satpati; Jagan Mohan Reddy Dwarampudi; Kunal Rai; Tania Banerjee

doi:10.1145/3765612.3767237

What is it about?

This paper systematically compares widely used dimensionality reduction methods (such as PCA, non-negative matrix factorization (NMF), and deep learning–based autoencoders) for analyzing spatial transcriptomics data. Using a large cholangiocarcinoma tissue dataset, the authors evaluate how different techniques influence downstream clustering, biological interpretation, and marker-gene fidelity. In addition to benchmarking existing methods, the study introduces two biologically motivated evaluation metrics (1) Cluster Marker Coherence (CMC) and (2) Marker Exclusion Rate (MER), and a simple post-processing strategy that reassigns cells to more biologically appropriate clusters based on marker expression.

Photo by ANIRUDH on Unsplash

Why is it important?

Dimensionality reduction is a foundational step in nearly all spatial transcriptomics pipelines, yet method choice is often arbitrary or driven by convenience. This work shows that different methods can lead to meaningfully different biological conclusions, even when applied to the same data. By providing a unified, quantitative benchmarking framework and introducing interpretable biological metrics, the study enables researchers to make principled, application-specific choices rather than relying on default tools. The proposed MER-guided reassignment further demonstrates that biological accuracy can be improved without redesigning entire pipelines, making the findings immediately actionable for the community.

Perspectives

For computational biologists: The study highlights that no single dimensionality reduction method is universally optimal; method selection should depend on whether geometric separation, reconstruction accuracy, or biological marker fidelity is the primary goal. For experimental and translational researchers: The results emphasize that clustering artifacts can arise purely from computational choices, underscoring the need for biologically grounded validation metrics when interpreting spatial domains or cell types. For method developers: The work opens avenues for integrating marker-aware objectives directly into dimensionality reduction and clustering algorithms, particularly for spatially aware models.
Tania Banerjee
University of Houston

This page is a summary of: Benchmarking Dimensionality Reduction Techniques for Spatial Transcriptomics, October 2025, ACM (Association for Computing Machinery),
DOI: 10.1145/3765612.3767237.
You can read the full text:

Read

Contributors

The following have contributed to this page

Tania Banerjee
University of Houston

Benchmarking Dimensionality Reduction Techniques for Spatial Transcriptomics

What is it about?

Why is it important?

Perspectives

Contributors

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

Benchmarking Dimensionality Reduction Techniques for Spatial Transcriptomics

What is it about?

Featured Image

Why is it important?

Perspectives

Read the Original

Contributors

Share this page:

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management