Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Cross-species analysis of genic GC3 content and DNA methylation patterns

Tatarinova, Tatiana ; Elhaik, Eran LU orcid and Pellegrini, Matteo (2013) In Genome Biology and Evolution 5(8). p.56-1443
Abstract

The GC content in the third codon position (GC(3)) exhibits a unimodal distribution in many plant and animal genomes. Interestingly, grasses and homeotherm vertebrates exhibit a unique bimodal distribution. High GC(3) was previously found to be associated with variable expression, higher frequency of upstream TATA boxes, and an increase of GC(3) from 5' to 3'. Moreover, GC(3)-rich genes are predominant in certain gene classes and are enriched in CpG dinucleotides that are potential targets for methylation. Based on the GC(3) bimodal distribution we hypothesize that GC(3) has a regulatory role involving methylation and gene expression. To test that hypothesis, we selected diverse taxa (rice, thale cress, bee, and human) that varied in... (More)

The GC content in the third codon position (GC(3)) exhibits a unimodal distribution in many plant and animal genomes. Interestingly, grasses and homeotherm vertebrates exhibit a unique bimodal distribution. High GC(3) was previously found to be associated with variable expression, higher frequency of upstream TATA boxes, and an increase of GC(3) from 5' to 3'. Moreover, GC(3)-rich genes are predominant in certain gene classes and are enriched in CpG dinucleotides that are potential targets for methylation. Based on the GC(3) bimodal distribution we hypothesize that GC(3) has a regulatory role involving methylation and gene expression. To test that hypothesis, we selected diverse taxa (rice, thale cress, bee, and human) that varied in the modality of their GC(3) distribution and tested the association between GC(3), DNA methylation, and gene expression. We examine the relationship between cytosine methylation levels and GC(3), gene expression, genome signature, gene length, and other gene compositional features. We find a strong negative correlation (Pearson's correlation coefficient r = -0.67, P value < 0.0001) between GC(3) and genic CpG methylation. The comparison between 5'-3' gradients of CG(3)-skew and genic methylation for the taxa in the study suggests interplay between gene-body methylation and transcription-coupled cytosine deamination effect. Compositional features are correlated with methylation levels of genes in rice, thale cress, human, bee, and fruit fly (which acts as an unmethylated control). These patterns allow us to generate evolutionary hypotheses about the relationships between GC(3) and methylation and how these affect expression patterns. Specifically, we propose that the opposite effects of methylation and compositional gradients along coding regions of GC(3)-poor and GC(3)-rich genes are the products of several competing processes.

(Less)
Please use this url to cite or link to this publication:
author
; and
publishing date
type
Contribution to journal
publication status
published
keywords
Animals, Arabidopsis/genetics, Base Composition/genetics, Bees/genetics, DNA Methylation, Drosophila melanogaster/genetics, Evolution, Molecular, Gene Expression, Genes, Plant, Humans, Models, Genetic, Oryza/genetics, Plants/genetics, Species Specificity
in
Genome Biology and Evolution
volume
5
issue
8
pages
56 - 1443
publisher
Oxford University Press
external identifiers
  • scopus:84891655850
  • pmid:23833164
ISSN
1759-6653
DOI
10.1093/gbe/evt103
language
English
LU publication?
no
id
5ec0d660-0550-46a9-b9c3-0252fbcc4afb
date added to LUP
2019-11-10 16:57:25
date last changed
2024-09-18 13:35:13
@article{5ec0d660-0550-46a9-b9c3-0252fbcc4afb,
  abstract     = {{<p>The GC content in the third codon position (GC(3)) exhibits a unimodal distribution in many plant and animal genomes. Interestingly, grasses and homeotherm vertebrates exhibit a unique bimodal distribution. High GC(3) was previously found to be associated with variable expression, higher frequency of upstream TATA boxes, and an increase of GC(3) from 5' to 3'. Moreover, GC(3)-rich genes are predominant in certain gene classes and are enriched in CpG dinucleotides that are potential targets for methylation. Based on the GC(3) bimodal distribution we hypothesize that GC(3) has a regulatory role involving methylation and gene expression. To test that hypothesis, we selected diverse taxa (rice, thale cress, bee, and human) that varied in the modality of their GC(3) distribution and tested the association between GC(3), DNA methylation, and gene expression. We examine the relationship between cytosine methylation levels and GC(3), gene expression, genome signature, gene length, and other gene compositional features. We find a strong negative correlation (Pearson's correlation coefficient r = -0.67, P value &lt; 0.0001) between GC(3) and genic CpG methylation. The comparison between 5'-3' gradients of CG(3)-skew and genic methylation for the taxa in the study suggests interplay between gene-body methylation and transcription-coupled cytosine deamination effect. Compositional features are correlated with methylation levels of genes in rice, thale cress, human, bee, and fruit fly (which acts as an unmethylated control). These patterns allow us to generate evolutionary hypotheses about the relationships between GC(3) and methylation and how these affect expression patterns. Specifically, we propose that the opposite effects of methylation and compositional gradients along coding regions of GC(3)-poor and GC(3)-rich genes are the products of several competing processes.</p>}},
  author       = {{Tatarinova, Tatiana and Elhaik, Eran and Pellegrini, Matteo}},
  issn         = {{1759-6653}},
  keywords     = {{Animals; Arabidopsis/genetics; Base Composition/genetics; Bees/genetics; DNA Methylation; Drosophila melanogaster/genetics; Evolution, Molecular; Gene Expression; Genes, Plant; Humans; Models, Genetic; Oryza/genetics; Plants/genetics; Species Specificity}},
  language     = {{eng}},
  number       = {{8}},
  pages        = {{56--1443}},
  publisher    = {{Oxford University Press}},
  series       = {{Genome Biology and Evolution}},
  title        = {{Cross-species analysis of genic GC3 content and DNA methylation patterns}},
  url          = {{http://dx.doi.org/10.1093/gbe/evt103}},
  doi          = {{10.1093/gbe/evt103}},
  volume       = {{5}},
  year         = {{2013}},
}