Keywords
Haplotyping, Phasing, Trios, Genomics, Next-generation sequencing
Abstract
Background: When analyzing DNA sequence data of an individual, knowing which nucleotide was inherited from each parent can be benefcial when trying to identify certain types of DNA variants. Mendelian inheritance logic can be used to accurately phase (haplotype) the majority (67–83%) of an individual’s heterozygous nucleotide positions when genotypes are available for both parents (trio). However, when all members of a trio are heterozygous at a position, Mendelian inheritance logic cannot be used to phase. For such positions, a computational phasing algorithm can be used. Existing phasing algorithms use a haplotype reference panel, sequencing reads, and/or parental genotypes to phase an individual; however, they are limited in that they can only phase certain types of variants, require a specifc genotype build, require large amounts of storage capacity, and/or require long run times. We created trioPhaser to address these challenges
Results: trioPhaser uses gVCF fles from an individual and their parents as initial input, and then outputs a phased VCF fle. Input trio data are frst phased using Mendelian inheritance logic. Then, the positions that cannot be phased using inheritance information alone are phased by the SHAPEIT4 phasing algorithm. Using whole-genome sequencing data of 52 trios, we show that trioPhaser, on average, increases the total number of phased positions by 21.0% and 10.5%, respectively, when compared to the number of positions that SHAPEIT4 or Mendelian inheritance logic can phase when either is used alone. In addition, we show that the accuracy of the phased calls output by trioPhaser are similar to linked-read and read-backed phasing.
Conclusion: trioPhaser is a containerized software tool that uses both Mendelian inheritance logic and SHAPEIT4 to phase trios when gVCF fles are available. By implementing both phasing methods, more variant positions are phased compared to what either method is able to phase alone.
Original Publication Citation
Miller DB‡ and Piccolo SR* . trioPhaser: using Mendelian inheritance logic to improve genomic phasing of trios. BMC Bioinformatics, 22, Article number: 559 (2021)
BYU ScholarsArchive Citation
Miller, Dustin B. and Piccolo, Stephen R., "trioPhaser: Using Mendelian Inheritance Logic to Improve Genomic Phasing of Trios" (2021). Faculty Publications. 7482.
https://scholarsarchive.byu.edu/facpub/7482
Document Type
Peer-Reviewed Article
Publication Date
2021-11-22
Publisher
BioMed Central
Language
English
College
Life Sciences
Department
Biology
Copyright Use Information
https://lib.byu.edu/about/copyright/