Abstract
Background: Analysis of fungal genome sequence assemblies reveals that telomeres are poorly represented even though telomeric reads tend to be superabundant. We surmised that the problem might lie in the DNA shearing conditions used to create clone libraries for genome sequencing.Results: A shotgun strategy was used to sequence and assemble circular and linear cosmid DNAs sheared using conditions typical for a genome project. The DNA sheared in circular form assembled into a single sequence contig. However, the linearized cosmid produced an incomplete assembly because the two DNA termini, though greatly overrepresented in the clone library used for sequencing, were separated from neighboring sequences by gaps of ~1.4 and 1.8 kb. These gap sizes were reduced, but not eliminated, by shearing the linear cosmid into smaller fragments. Mapping of shearing breakpoints revealed a paucity of breaks in the subterminal regions of the linearized cosmid and also near chromosome ends of the fungus Neurospora crassa.Conclusion: Together, our data indicate that the ends of linear DNA molecules are recalcitrant to hydrodynamic shearing. We propose that this causes DNA termini to be overrepresented in the resulting fragment population but ultimately prevents their incorporation into sequence assemblies.
Original language | English |
---|---|
Article number | 87 |
Journal | BMC Genomics |
Volume | 11 |
Issue number | 1 |
DOIs | |
State | Published - Feb 2 2010 |
Bibliographical note
Funding Information:The authors would like to thank Dr. David Thornbury and Brian King for technical assistance, and Chris Schardl for helpful discussions. This work was supported by National Science Foundation grant MCB-0135462 and a United States Department of Agriculture special grant, USDA-CREES 2001-34457-10343. This is Kentucky Agricultural Experiment Station publication no. 09-12-110 and is published with the permission of the director.
ASJC Scopus subject areas
- Biotechnology
- Genetics