School of Medicine Publications and Presentations

Generating human papillomavirus (HPV) reference databases to maximize genomic mapping

Victor Trevino, Tecnologico de Monterrey
Mariel Aracely Oyervides-Muñoz, Universidad Autonoma de Nuevo Leon
Genaro Antonio Ramirez-Correa, The University of Texas Rio Grande ValleyFollow
Lourdes Garza

Document Type

Article

Publication Date

1-2022

Abstract

Genomic experiments analyzing human papillomaviruses (HPVs) require a carefully selected list of sequences as a reference database to map millions of reads. The available sources, such as the Papillomavirus Episteme (PaVE), are organized based on variations in the L1 gene rather than the whole HPV sequence. Moreover, the PaVE process uses complex multiple sequence alignments containing hundreds or thousands of sequences. These issues complicate the generation of a reference database for genomics, leading to the generation of per-analysis-defined databases. Here, we propose a de novo strategy considering all HPV sequences reported in the NCBI database to define a subset of highly representative HPV sequences. The strategy is based on oligonucleotide frequency profiling of the whole sequence followed by hierarchical clustering. Using data from HPV capture experiments, we demonstrate that this strategy selects suitable sequences as a reference database to map most mappable reads unambiguously. We provide some recommendations to improve HPV mapping. The generated .fasta files can be accessed at https://github.com/vtrevino/HPV-Ref-Genomes.

Comments

Reprints and Permissions

https://rdcu.be/dokTF

Recommended Citation

Trevino, V., Oyervides, M., Ramírez-Correa, G.A. et al. Generating human papillomavirus (HPV) reference databases to maximize genomic mapping. Arch Virol 167, 57–65 (2022). https://doi.org/10.1007/s00705-021-05256-y

Publication Title

Archives of Virology

DOI

10.1007/s00705-021-05256-y

Academic Level

faculty

Mentor/PI Department

Molecular Science

Link to Full Text

COinS

School of Medicine Publications and Presentations

Generating human papillomavirus (HPV) reference databases to maximize genomic mapping

Document Type

Publication Date

Abstract

Comments

Recommended Citation

Publication Title

DOI

Academic Level

Mentor/PI Department

Browse

Search

Author Corner

Links

School of Medicine Publications and Presentations

Generating human papillomavirus (HPV) reference databases to maximize genomic mapping

Authors

Document Type

Publication Date

Abstract

Comments

Recommended Citation

Publication Title

DOI

Academic Level

Mentor/PI Department

Share

Browse

Search

Author Corner

Links