School of Medicine Publications and Presentations

Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program

Daniel Taliun
Daniel N. Harris
Michael D. Kessler
Jedidiah Carlson
John Blangero, The University of Texas Rio Grande Valley
Joanne E. Curran, The University of Texas Rio Grande ValleyFollow
Michael C. Mahaney, The University of Texas Rio Grande Valley
Harald H. H. Goring, The University of Texas Rio Grande Valley
Ravindranath Duggirala, The University of Texas Rio Grande Valley
Juan M. Peralta, The University of Texas Rio Grande Valley

Document Type

Article

Publication Date

2-10-2021

Abstract

The Trans-Omics for Precision Medicine (TOPMed) programme seeks to elucidate the genetic architecture and biology of heart, lung, blood and sleep disorders, with the ultimate goal of improving diagnosis, treatment and prevention of these diseases. The initial phases of the programme focused on whole-genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here we describe the TOPMed goals and design as well as the available resources and early insights obtained from the sequence data. The resources include a variant browser, a genotype imputation server, and genomic and phenotypic data that are available through dbGaP (Database of Genotypes and Phenotypes)1. In the first 53,831 TOPMed samples, we detected more than 400 million single-nucleotide and insertion or deletion variants after alignment with the reference genome. Additional previously undescribed variants were detected through assembly of unmapped reads and customized analysis in highly variable loci. Among the more than 400 million detected variants, 97% have frequencies of less than 1% and 46% are singletons that are present in only one individual (53% among unrelated individuals). These rare variants provide insights into mutational processes and recent human evolutionary history. The extensive catalogue of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and noncoding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and reach of genome-wide association studies to include variants down to a frequency of approximately 0.01%.

Recommended Citation

Taliun, D., Harris, D.N., Kessler, M.D. et al. Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program. Nature 590, 290–299 (2021). https://doi.org/10.1038/s41586-021-03205-y

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Publication Title

Nature

DOI

10.1038/s41586-021-03205-y

Academic Level

faculty

Mentor/PI Department

Office of Human Genetics

Download

Included in

Medical Genetics Commons

COinS

School of Medicine Publications and Presentations

Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program

Document Type

Publication Date

Abstract

Recommended Citation

Creative Commons License

Publication Title

DOI

Academic Level

Mentor/PI Department

Included in

Browse

Search

Author Corner

Links

School of Medicine Publications and Presentations

Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program

Authors

Document Type

Publication Date

Abstract

Recommended Citation

Creative Commons License

Publication Title

DOI

Academic Level

Mentor/PI Department

Included in

Share

Browse

Search

Author Corner

Links