domingo, 25 de septiembre de 2011

PLoS Genetics: Phased Whole-Genome Genetic Risk in a Family Quartet Using a Major Allele Reference Sequence

Phased Whole-Genome Genetic Risk in a Family Quartet Using a Major Allele Reference Sequence

An individual's genetic profile plays an important role in determining risk for disease and response to medical therapy. The development of technologies that facilitate rapid whole-genome sequencing will provide unprecedented power in the estimation of disease risk. Here we develop methods to characterize genetic determinants of disease risk and response to medical therapy in a nuclear family of four, leveraging population genetic profiles from recent large scale sequencing projects. We identify the way in which genetic information flows through the family to identify sequencing errors and inheritance patterns of genes contributing to disease risk. In doing so we identify genetic risk factors associated with an inherited predisposition to blood clot formation and response to blood thinning medications. We find that this aligns precisely with the most significant disease to occur to date in the family, namely pulmonary embolism, a blood clot in the lung. These ethnicity-specific, family-based approaches to interpretation of individual genetic profiles are emblematic of the next generation of genetic risk assessment using whole-genome sequencing.

Frederick E. Dewey1, Rong Chen2, Sergio P. Cordero3, Kelly E. Ormond4,5, Colleen Caleshu1, Konrad J. Karczewski3,4, Michelle Whirl-Carrillo4, Matthew T. Wheeler1, Joel T. Dudley2,3, Jake K. Byrnes4, Omar E. Cornejo4, Joshua W. Knowles1, Mark Woon4, Katrin Sangkuhl4, Li Gong4, Caroline F. Thorn4, Joan M. Hebert4, Emidio Capriotti4, Sean P. David4, Aleksandra Pavlovic1, Anne West6, Joseph V. Thakuria7, Madeleine P. Ball8, Alexander W. Zaranek8, Heidi L. Rehm9, George M. Church8, John S. West10, Carlos D. Bustamante4, Michael Snyder4, Russ B. Altman4,11, Teri E. Klein4, Atul J. Butte2, Euan A. Ashley1*
1 Center for Inherited Cardiovascular Disease, Division of Cardiovascular Medicine, Stanford University, Stanford, California, United States of America, 2 Division of Systems Medicine, Department of Pediatrics, Stanford University School of Medicine, Stanford, California, United States of America, 3 Biomedical Informatics Graduate Training Program, Stanford University School of Medicine, Stanford, California, United States of America, 4 Department of Genetics, Stanford University School of Medicine, Stanford, California, United States of America, 5 Center for Biomedical Ethics, Stanford University, Stanford, California, United States of America, 6 Wellesley College, Wellesley, Massachusetts, United States of America, 7 Division of Genetics, Massachusetts General Hospital, Boston, Massachusetts, United States of America, 8 Department of Genetics, Harvard Medical School, Boston, Massachusetts, United States of America, 9 Department of Pathology, Harvard Medical School, Boston, Massachusetts, United States of America, 10 Personalis, Palo Alto, California, United States of America, 11 Department of Bioengineering, Stanford University, Stanford, California, United States of America

Abstract 

Whole-genome sequencing harbors unprecedented potential for characterization of individual and family genetic variation. Here, we develop a novel synthetic human reference sequence that is ethnically concordant and use it for the analysis of genomes from a nuclear family with history of familial thrombophilia. We demonstrate that the use of the major allele reference sequence results in improved genotype accuracy for disease-associated variant loci. We infer recombination sites to the lowest median resolution demonstrated to date (<1,000 base pairs). We use family inheritance state analysis to control sequencing error and inform family-wide haplotype phasing, allowing quantification of genome-wide compound heterozygosity. We develop a sequence-based methodology for Human Leukocyte Antigen typing that contributes to disease risk prediction. Finally, we advance methods for analysis of disease and pharmacogenomic risk across the coding and non-coding genome that incorporate phased variant data. We show these methods are capable of identifying multigenic risk for inherited thrombophilia and informing the appropriate pharmacological therapy. These ethnicity-specific, family-based approaches to interpretation of genetic variation are emblematic of the next generation of genetic risk assessment using whole-genome sequencing.

Author Summary 

An individual's genetic profile plays an important role in determining risk for disease and response to medical therapy. The development of technologies that facilitate rapid whole-genome sequencing will provide unprecedented power in the estimation of disease risk. Here we develop methods to characterize genetic determinants of disease risk and response to medical therapy in a nuclear family of four, leveraging population genetic profiles from recent large scale sequencing projects. We identify the way in which genetic information flows through the family to identify sequencing errors and inheritance patterns of genes contributing to disease risk. In doing so we identify genetic risk factors associated with an inherited predisposition to blood clot formation and response to blood thinning medications. We find that this aligns precisely with the most significant disease to occur to date in the family, namely pulmonary embolism, a blood clot in the lung. These ethnicity-specific, family-based approaches to interpretation of individual genetic profiles are emblematic of the next generation of genetic risk assessment using whole-genome sequencing.

full-text:
PLoS Genetics: Phased Whole-Genome Genetic Risk in a Family Quartet Using a Major Allele Reference Sequence

No hay comentarios:

Publicar un comentario