Repository logo
 

Mapping overlapping functional elements embedded within the protein-coding regions of RNA viruses.


Change log

Authors

Firth, Andrew E 

Abstract

Identification of the full complement of genes and other functional elements in any virus is crucial to fully understand its molecular biology and guide the development of effective control strategies. RNA viruses have compact multifunctional genomes that frequently contain overlapping genes and non-coding functional elements embedded within protein-coding sequences. Overlapping features often escape detection because it can be difficult to disentangle the multiple roles of the constituent nucleotides via mutational analyses, while high-throughput experimental techniques are often unable to distinguish functional elements from incidental features. However, RNA viruses evolve very rapidly so that, even within a single species, substitutions rapidly accumulate at neutral or near-neutral sites providing great potential for comparative genomics to distinguish the signature of purifying selection. Computationally identified features can then be efficiently targeted for experimental analysis. Here we analyze alignments of protein-coding virus sequences to identify regions where there is a statistically significant reduction in the degree of variability at synonymous sites, a characteristic signature of overlapping functional elements. Having previously tested this technique by experimental verification of discoveries in selected viruses, we now analyze sequence alignments for ∼700 RNA virus species to identify hundreds of such regions, many of which have not been previously described.

Description

Keywords

Algorithms, Codon, Genetic Variation, Genome, Viral, Phylogeny, RNA Viruses, Recombination, Genetic, Sequence Alignment, Viral Proteins

Journal Title

Nucleic Acids Res

Conference Name

Journal ISSN

0305-1048
1362-4962

Volume Title

42

Publisher

Oxford University Press (OUP)
Sponsorship
Biotechnology and Biological Sciences Research Council (BB/J007072/1)
Biotechnology and Biological Sciences Research Council (BB/J015652/1)
Wellcome Trust (088789/Z/09/Z)
This work was supported by the Wellcome Trust [grant number 088789] and the U.K. Biotechnology and Biological Sciences Research Council [grant numbers BB/J007072/1 and BB/J015652/1].