A public resource facilitating clinical use of genomes
- Madeleine P. Balla,1,
- Joseph V. Thakuriaa,b,c,1,
- Alexander Wait Zaraneka,c,1,
- Tom Cleggc,
- Abraham M. Rosenbauma,d,
- Xiaodi Wua,e,
- Misha Angristf,
- Jong Bhakg,h,
- Jason Bobei,
- Matthew J. Callowj,
- Carlos Canok,
- Michael F. Choua,
- Wendy K. Chungl,
- Shawn M. Douglasa,
- Preston W. Estepi,m,
- Athurva Goren,
- Peter Hulicko,
- Alberto Labargak,
- Je-Hyuk Leea,
- Jeantine E. Lunshofp,q,
- Byung Chul Kimh,
- Jong-Il Kimr,s,
- Zhe Lin,
- Michael F. Murrayt,
- Geoffrey B. Nilsenj,
- Brock A. Petersj,
- Anugraha M. Ramana,
- Hugh Y. Rienhoffu,
- Kimberly Robaskya,v,
- Matthew T. Wheelerw,
- Ward Vandewegec,
- Daniel B. Vorhausx,
- Joyce L. Yanga,
- Luhan Yanga,
- John Aacha,
- Euan A. Ashleyw,y,
- Radoje Drmanacj,
- Seong-Jin Kimz,
- Jin Billy Lia,aa,
- Leonid Peshkinbb,
- Christine E. Seidmancc,
- Jeong-Sun Seor,dd,
- Kun Zhangn,
- Heidi L. Rehmee, and
- George M. Churcha,2
+ Author Affiliations
aDepartment of Genetics, Harvard Medical School, Boston, MA 02115;
bDivision of Medical Genetics, Massachusetts General Hospital, Boston, MA 02114;
cClinical Future Inc., Cambridge, MA 02142;
dIon Torrent by Life Technologies, Guilford, CT 06437;
eDepartment of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO 63110;
fDuke University Institute for Genome Sciences and Policy, Durham, NC 27708-0141;
gTheragen BiO Institute, TheragenEtex Inc., Suwon, 443-270, Korea;
hGenomics Department, Personal Genomics Institute, Suwon 443-766, Korea;
iPersonalGenomes.org, Boston, MA 02215;
jComplete Genomics, Inc., Mountain View, CA 94043;
kDepartment of Computer Science and A.I., University of Granada, 18071 Granada, Spain;
lDepartments of Pediatrics and Medicine, Columbia University, New York, NY 10032;
mTeloMe, Inc., Waltham, MA 02451;
nDepartment of Bioengineering, University of California at San Diego, La Jolla, CA 92093;
oDivision of Genetics, NorthShore University HealthSystem, Evanston, IL 60201;
pFaculty of Earth and Life Sciences, Department of Molecular Cell Physiology, VU University Amsterdam, 1081 HV Amsterdam, The Netherlands;
qFaculty of Health, Medicine and Life Sciences, Maastricht University, 6200 MD Maastricht, The Netherlands;
rGenomic Medicine Institute, Medical Research Center, College of Medicine, Seoul National University, Seoul, Korea;
sPsoma Therapeutics Inc., Gasan-dong, Kumchun-gu, Seoul 153-781, Korea;
tDivision of Genetics, Brigham and Women’s Hospital, Boston, MA 02115;
uwww.MyDaughtersDNA.org, San Carlos, California 94070;
vBioinformatics Program, Boston University, Boston, MA 02215;
wStanford Center for Inherited Cardiovascular Disease, Stanford University School of Medicine, Stanford, CA 94305;
xRobinson Bradshaw & Hinson, P.A., Chapel Hill, NC 27517;
yPersonalis, Inc., Palo Alto, CA 94301;
zCha Cancer Institute, Cha University of Medicine and Science, Seoul 135-081, Korea;
aaDepartment of Genetics, Stanford University, Stanford, CA 94305;
bbDepartment of Systems Biology, Harvard Medical School, Boston, MA 02115;
ccDepartment of Genetics, Harvard Medical School and Howard Hughes Medical Institute, Boston, MA 02115;
ddMacrogen, Seoul, Korea; and
eeDepartment of Pathology, Harvard Medical School, Boston, MA 02115
Edited by C. Thomas Caskey, Baylor College of Medicine, Houston, TX, and approved June 11, 2012 (received for review February 1, 2012)
Rapid advances in DNA sequencing promise to enable new diagnostics and individualized therapies. Achieving personalized medicine, however, will require extensive research on highly reidentifiable, integrated datasets of genomic and health information. To assist with this, participants in the Personal Genome Project choose to forgo privacy via our institutional review board- approved “open consent” process. The contribution of public data and samples facilitates both scientific discovery and standardization of methods. We present our findings after enrollment of more than 1,800 participants, including whole-genome sequencing of 10 pilot participant genomes (the PGP-10). We introduce the Genome-Environment-Trait Evidence (GET-Evidence) system. This tool automatically processes genomes and prioritizes both published and novel variants for interpretation. In the process of reviewing the presumed healthy PGP-10 genomes, we find numerous literature references implying serious disease. Although it is sometimes impossible to rule out a late-onset effect, stringent evidence requirements can address the high rate of incidental findings. To that end we develop a peer production system for recording and organizing variant evaluations according to standard evidence guidelines, creating a public forum for reaching consensus on interpretation of clinically relevant variants. Genome analysis becomes a two-step process: using a prioritized list to record variant evaluations, then automatically sorting reviewed variants using these annotations. Genome data, health and trait information, participant samples, and variant interpretations are all shared in the public domain—we invite others to review our results using our participant samples and contribute to our interpretations. We offer our public resource and methods to further personalized medical research.
↵1M.P.B., J.V.T., and A.W.Z. contributed equally to this work.
- ↵2To whom correspondence should be addressed. E-mail: firstname.lastname@example.org.
Author contributions: M.P.B., J.V.T., A.W.Z., M.A., J. Bobe, M.F.C., S.M.D., P.W.E., J.E.L., D.B.V., H.L.R., and G.M.C. designed research; M.P.B., J.V.T., A.W.Z., T.C., A. M. Rosenbaum, X.W., W.K.C., P.W.E., A. M. Raman, K.R., C.E.S., and H.L.R. performed research; M.P.B., J.V.T., A.W.Z., T.C., A. M. Rosenbaum, X.W., J. Bhak, C.C., A.G., A.L., J.-H.L., B.C.K., Z.L., A. M. Raman, W.V., J.L.Y., L.Y., S.-J.K., J.B.L., L.P., and K.Z. contributed new reagents/analytic tools; M.P.B., J.V.T., A.W.Z., T.C., A. M. Rosenbaum, X.W., M.J.C., P.H., J.-I.K., M.F.M., G.B.N., B.A.P., H.Y.R., K.R., M.T.W., W.V., J.A., E.A.A., R.D., and J.-S.S. analyzed data; and M.P.B., J.V.T., A.W.Z., and G.M.C. wrote the paper.
Conflict of interest statement: G.M.C. has advisory roles in and research sponsorships from several companies involved in genome sequencing technology and personal genomics (http://arep.med.harvard.edu/gmc/tech.html).
This article is a PNAS Direct Submission.
This article is part of the special series of Inaugural Articles by members of the National Academy of Sciences elected in 2011.
See Profile on page 11893.
Data deposition: The sequences reported in this paper are made available through the Personal Genome Project (http://www.personalgenomes.org/data/PGP12.05/).
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1201904109/-/DCSupplemental.
Freely available online through the PNAS open access option.