Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Gene expression and nucleotide composition are associated with genic methylation level in Oryza sativa

Elhaik, Eran LU orcid ; Pellegrini, Matteo and Tatarinova, Tatiana V (2014) In BMC Bioinformatics 15.
Abstract

BACKGROUND: The methylation of cytosines at CpG dinucleotides, which plays an important role in gene expression regulation, is one of the most studied epigenetic modifications. Thus far, the detection of DNA methylation has been determined mostly by experimental methods, which are not only prone to bench effects and artifacts but are also time-consuming, expensive, and cannot be easily scaled up to many samples. It is therefore useful to develop computational prediction methods for DNA methylation. Our previous studies highlighted the existence of correlations between the GC content of the third codon position (GC₃), methylation, and gene expression. We thus designed a model to predict methylation in Oryza sativa based on genomic... (More)

BACKGROUND: The methylation of cytosines at CpG dinucleotides, which plays an important role in gene expression regulation, is one of the most studied epigenetic modifications. Thus far, the detection of DNA methylation has been determined mostly by experimental methods, which are not only prone to bench effects and artifacts but are also time-consuming, expensive, and cannot be easily scaled up to many samples. It is therefore useful to develop computational prediction methods for DNA methylation. Our previous studies highlighted the existence of correlations between the GC content of the third codon position (GC₃), methylation, and gene expression. We thus designed a model to predict methylation in Oryza sativa based on genomic sequence features and gene expression data.

RESULTS: We first derive equations to describe the relationship between gene methylation levels, GC₃, expression, length, and other gene compositional features. We next assess gene compositional features involving sixmers and their association with methylation levels and other gene level properties. By applying our sixmer-based approach on rice gene expression data we show that it can accurately predict methylation (Pearson's correlation coefficient r = 0.79) for the majority (79%) of the genes. Matlab code with our model is included.

CONCLUSIONS: Gene expression variation can be used as predictors of gene methylation levels.

(Less)
Please use this url to cite or link to this publication:
author
; and
publishing date
type
Contribution to journal
publication status
published
keywords
Base Composition/genetics, Codon/genetics, Computational Biology, Cytosine/metabolism, DNA Methylation, Gene Expression Regulation, Plant, Humans, Oryza/genetics, Reproducibility of Results, Sequence Analysis, DNA
in
BMC Bioinformatics
volume
15
article number
23
pages
7 pages
publisher
BioMed Central (BMC)
external identifiers
  • pmid:24447369
  • scopus:84892621417
ISSN
1471-2105
DOI
10.1186/1471-2105-15-23
language
English
LU publication?
no
id
f3283567-b845-4755-afe2-88d782e427be
date added to LUP
2019-11-10 16:57:41
date last changed
2024-09-19 12:50:52
@article{f3283567-b845-4755-afe2-88d782e427be,
  abstract     = {{<p>BACKGROUND: The methylation of cytosines at CpG dinucleotides, which plays an important role in gene expression regulation, is one of the most studied epigenetic modifications. Thus far, the detection of DNA methylation has been determined mostly by experimental methods, which are not only prone to bench effects and artifacts but are also time-consuming, expensive, and cannot be easily scaled up to many samples. It is therefore useful to develop computational prediction methods for DNA methylation. Our previous studies highlighted the existence of correlations between the GC content of the third codon position (GC₃), methylation, and gene expression. We thus designed a model to predict methylation in Oryza sativa based on genomic sequence features and gene expression data.</p><p>RESULTS: We first derive equations to describe the relationship between gene methylation levels, GC₃, expression, length, and other gene compositional features. We next assess gene compositional features involving sixmers and their association with methylation levels and other gene level properties. By applying our sixmer-based approach on rice gene expression data we show that it can accurately predict methylation (Pearson's correlation coefficient r = 0.79) for the majority (79%) of the genes. Matlab code with our model is included.</p><p>CONCLUSIONS: Gene expression variation can be used as predictors of gene methylation levels.</p>}},
  author       = {{Elhaik, Eran and Pellegrini, Matteo and Tatarinova, Tatiana V}},
  issn         = {{1471-2105}},
  keywords     = {{Base Composition/genetics; Codon/genetics; Computational Biology; Cytosine/metabolism; DNA Methylation; Gene Expression Regulation, Plant; Humans; Oryza/genetics; Reproducibility of Results; Sequence Analysis, DNA}},
  language     = {{eng}},
  month        = {{01}},
  publisher    = {{BioMed Central (BMC)}},
  series       = {{BMC Bioinformatics}},
  title        = {{Gene expression and nucleotide composition are associated with genic methylation level in Oryza sativa}},
  url          = {{http://dx.doi.org/10.1186/1471-2105-15-23}},
  doi          = {{10.1186/1471-2105-15-23}},
  volume       = {{15}},
  year         = {{2014}},
}