Construction and evaluation of a new rat reference genome assembly, GRCr8, from long reads and long-range scaffolding

Kai Li, Melissa L. Smith, J. Chris Blazier, Kelli J. Kochan, Jonathan M.D. Wood, Kerstin Howe, Anne E. Kwitek, Melinda R. Dwinell, Hao Chen, Julia L. Ciosek, Patrick Masterson, Terence D. Murphy, Theodore S. Kalbfleisch, Peter A. Doris

Research output: Contribution to journalArticlepeer-review

1 Citation (SciVal)

Abstract

We report the construction and analysis of a new reference genome assembly for Rattus norvegicus, the laboratory rat, a widely used experimental animal model organism. The assembly has been adopted as the rat reference assembly by the Genome Reference Consortium and is named GRCr8. The assembly has employed 40× Pacific Biosciences (PacBio) HiFi sequencing coverage and scaffolding using optical mapping and Hi-C. We used genomic DNA from a male BN/NHsdMcwi (BN) rat of the same strain and from the same colony as the prior reference assembly, mRatBN7.2. The assembly is at chromosome level with 98.7% of the sequence assigned to chromosomes. All chromosomes have increased in size compared with the prior assembly and k-mer analysis indicates that the subject animal is fully inbred and that the genome is represented as a single haploid assembly. Notable increases are observed in Chromosomes 3, 11, and 12 in the prospective rDNA regions. In addition, Chr Y has increased threefold in size and is more consistent with the rat karyotype than previous assemblies. Several other chromosomes have grown by the incorporation of sizable discrete new blocks. These contain highly repetitive sequences and encode numerous previously unannotated genes. In addition, centromeric sequences are incorporated in most chromosomes. Genome annotation has been performed by NCBI RefSeq, which confirms improvement in assembly quality and adds more than 1100 new protein coding genes. PacBio Iso-Seq data have been acquired from multiple tissues of the subject animal and are released concurrently with the new assembly to aid further analyses.

Original languageEnglish
Pages (from-to)2081-2093
Number of pages13
JournalGenome Research
Volume34
Issue number11
DOIs
StatePublished - Nov 2024

Bibliographical note

Publisher Copyright:
© 2024 Li et al.

ASJC Scopus subject areas

  • Genetics
  • Genetics(clinical)

Fingerprint

Dive into the research topics of 'Construction and evaluation of a new rat reference genome assembly, GRCr8, from long reads and long-range scaffolding'. Together they form a unique fingerprint.

Cite this