Analysis of Somatic Mutations in Longitudinal Whole-genome Sequencing Data

Grants and Contracts Details


Abstract While recent efforts have focused on better understanding the germline mutation rate of different types of DNA variants, the dynamics of somatic mutations in the human genome are poorly understood. Somatic mutations are a well-known cause of cancer and may play a causative role in other diseases and aging. Analyses of somatic mutations have generally been limited to single-cell or low-coverage studies. To accurately assess the rate of somatic mutations over time would require a longitudinal, high coverage dataset. For ~450 individuals in the Utah CEPH (Centre d''Etude du Polymorphism Humain) pedigrees, blood was drawn at two timepoints approximately 15 years apart. This proposal describes experiments to determine the rate of somatic mutations throughout the human genome by utilizing high-coverage (720x) whole- genome sequencing data at both of these timepoints in a subset of CEPH individuals. The genomes included in this study will be used to determine the rate of somatic mutations and improve our understanding of differences in this rate between individuals. Building on previous work performed at the University of Utah, we will determine the relationship between the germline and somatic mutation rate. Further, using detailed phenotypic data collected for these individuals, experiments will be performed to determine how this rate is correlated with disease outcomes such as cancer and cardiovascular disease. From this work, a series of filtering algorithms will be produced to allow for comprehensive analysis of somatic mutations occurring between two timepoints in the same individual. This dataset provides me with a unique opportunity to study somatic mutations over time using high-quality data. This proposal builds upon my experience in analyzing mobile element-mediated somatic mutations in longitudinal whole-genome data of cancer patients, and my recent work analyzing germline short tandem repeat mutations in the CEPH pedigrees.
Effective start/end date7/1/236/30/26


  • National Human Genome Research Institute: $249,000.00


Explore the research topics touched on by this project. These labels are generated based on the underlying awards/grants. Together they form a unique fingerprint.