Cytosine deamination and selection of CpG suppressed clones are the two major independent biological forces that shape codon usage bias in coronaviruses

Patrick C.Y. Woo, Beatrice H.L. Wong, Yi Huang, Susanna K.P. Lau, Kwok Yung Yuen

Research output: Contribution to journalArticlepeer-review

70 Citations (Scopus)

Abstract

Using the complete genome sequences of 19 coronavirus genomes, we analyzed the codon usage bias, dinucleotide relative abundance and cytosine deamination in coronavirus genomes. Of the eight codons that contain CpG, six were markedly suppressed. The mean NNU/NNC ratio of the six amino acids using either NNC or NNU as codon is 3.262, suggesting cytosine deamination. Among the 16 dinucleotides, CpG was most markedly suppressed (mean relative abundance 0.509). No correlation was observed between CpG abundance and mean NNU/NNC ratio. Among the 19 coronaviruses, CoV-HKU1 showed the most extreme codon usage bias and extremely high NNU/NNC ratio of 8.835. Cytosine deamination and selection of CpG suppressed clones by the immune system are the two major independent biochemical and biological selective forces that shape codon usage bias in coronavirus genomes. The underlying mechanism for the extreme codon usage bias, cytosine deamination and G + C content in CoV-HKU1 warrants further studies.

Original languageEnglish
Pages (from-to)431-442
Number of pages12
JournalVirology
Volume369
Issue number2
DOIs
Publication statusPublished - Dec 20 2007
Externally publishedYes

ASJC Scopus Subject Areas

  • Virology

Keywords

  • Codon usage bias
  • Coronavirus
  • CpG suppression
  • Cytosine deamination

Cite this