HAYSTAC: A Bayesian framework for robust and rapid species identification in high-throughput sequencing data

Evangelos A. Dimopoulos; Alberto Carmagnini; Irina M. Velsko; Christina Warinner; Greger Larson; Laurent A. F. Frantz; Evan K. Irving-Pease

doi:10.1371/journal.pcbi.1010493

What is it about?

Identification of specific species in samples that contain DNA from multiple organisms is critical, yet many taxonomic identification tools available are often prone to false positive identifications. HAYSTAC is a user-friendly and computationally scalable bioinformatic tool that can robustly identify species present in low abundances from DNA sequencing data (e.g. pathogens).

Photo by Ion Fet on Unsplash

Why is it important?

HAYSTAC is a program that is specifically designed to efficiently handle both ancient and modern DNA data, as well as incomplete reference databases. Thus, it becomes the ideal tool for running highly accurate hypothesis-driven analyses (i.e., assessing the presence of a specific species) on variably sized reference databases.

Perspectives

Developing HAYSTAC and writing this article has been a great learning experience and pleasure, as the co-authors are excellent scientists with whom I have had long standing collaborations. I hope our methodological approach further encourages scientists to use metagenomics in their research to answer novel questions and trust more the resulting metagenomic identifications.
Evangelos Dimopoulos
University of Oxford

This page is a summary of: HAYSTAC: A Bayesian framework for robust and rapid species identification in high-throughput sequencing data, PLoS Computational Biology, September 2022, PLOS,
DOI: 10.1371/journal.pcbi.1010493.
You can read the full text:

Read

Contributors

The following have contributed to this page

Evangelos Dimopoulos
University of Oxford

A rapid species identification method from DNA sequencing data

What is it about?

Why is it important?

Perspectives

Contributors

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

A rapid species identification method from DNA sequencing data

What is it about?

Featured Image

Why is it important?

Perspectives

Read the Original

Contributors

Share this page:

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management