Statistical methods for identifying conserved residues in multiple sequence alignment.

Ahola, Virpi; Aittokallio, Tero; Uusipaikka, Esa; Vihinen, Mauno

Statistical methods for identifying conserved residues in multiple sequence alignment.

Mark

Ahola, Virpi ; Aittokallio, Tero ; Uusipaikka, Esa and Vihinen, Mauno ^LU

(2004) In Statistical Applications in Genetics and Molecular Biology 3. p.28-28

Abstract: The assessment of residue conservation in a multiple sequence alignment is a central issue in bioinformatics. Conserved residues and regions are used to determine structural and functional motifs or evolutionary relationships between the sequences of a multiple sequence alignment. For this reason, residue conservation is a valuable measure for database and motif search or for estimating the quality of alignments. In this paper, we present statistical methods for identifying conserved residues in multiple sequence alignments. While most earlier studies examine the positional conservation of the alignment, we focus on the detection of individual conserved residues at a position. The major advantages of multiple comparison methods originate... (More); The assessment of residue conservation in a multiple sequence alignment is a central issue in bioinformatics. Conserved residues and regions are used to determine structural and functional motifs or evolutionary relationships between the sequences of a multiple sequence alignment. For this reason, residue conservation is a valuable measure for database and motif search or for estimating the quality of alignments. In this paper, we present statistical methods for identifying conserved residues in multiple sequence alignments. While most earlier studies examine the positional conservation of the alignment, we focus on the detection of individual conserved residues at a position. The major advantages of multiple comparison methods originate from their ability to select conserved residues simultaneously and to consider the variability of the residue estimates. Large-scale simulations were used for the comparative analysis of the methods. Practical performance was studied by comparing the structurally and functionally important residues of Src homology 2 (SH2) domains to the assignments of the conservation indices. The applicability of the indices was also compared in three additional protein families comprising different degrees of entropy and variability in alignment positions. The results indicate that statistical multiple comparison methods are sensitive and reliable in identifying conserved residues. (Less)

Please use this url to cite or link to this publication: https://lup.lub.lu.se/record/3635364

author

Ahola, Virpi ; Aittokallio, Tero ; Uusipaikka, Esa and Vihinen, Mauno ^LU

publishing date

2004

type

Contribution to journal

publication status

published

subject

Medical Genetics

in

Statistical Applications in Genetics and Molecular Biology

volume

3

pages

28 - 28

publisher

Berkeley Electronic Press

external identifiers

pmid:16646807
scopus:18544390359

ISSN

2194-6302

DOI

10.2202/1544-6115.1074

language

English

LU publication?

no

id

7ffbeb8b-a721-4337-9f23-93c531caa4cd (old id 3635364)

alternative location

http://www.ncbi.nlm.nih.gov/pubmed/16646807?dopt=Abstract

date added to LUP

2016-04-04 07:23:37

date last changed

2022-04-07 22:41:09

@article{7ffbeb8b-a721-4337-9f23-93c531caa4cd,
  abstract     = {{The assessment of residue conservation in a multiple sequence alignment is a central issue in bioinformatics. Conserved residues and regions are used to determine structural and functional motifs or evolutionary relationships between the sequences of a multiple sequence alignment. For this reason, residue conservation is a valuable measure for database and motif search or for estimating the quality of alignments. In this paper, we present statistical methods for identifying conserved residues in multiple sequence alignments. While most earlier studies examine the positional conservation of the alignment, we focus on the detection of individual conserved residues at a position. The major advantages of multiple comparison methods originate from their ability to select conserved residues simultaneously and to consider the variability of the residue estimates. Large-scale simulations were used for the comparative analysis of the methods. Practical performance was studied by comparing the structurally and functionally important residues of Src homology 2 (SH2) domains to the assignments of the conservation indices. The applicability of the indices was also compared in three additional protein families comprising different degrees of entropy and variability in alignment positions. The results indicate that statistical multiple comparison methods are sensitive and reliable in identifying conserved residues.}},
  author       = {{Ahola, Virpi and Aittokallio, Tero and Uusipaikka, Esa and Vihinen, Mauno}},
  issn         = {{2194-6302}},
  language     = {{eng}},
  pages        = {{28--28}},
  publisher    = {{Berkeley Electronic Press}},
  series       = {{Statistical Applications in Genetics and Molecular Biology}},
  title        = {{Statistical methods for identifying conserved residues in multiple sequence alignment.}},
  url          = {{http://dx.doi.org/10.2202/1544-6115.1074}},
  doi          = {{10.2202/1544-6115.1074}},
  volume       = {{3}},
  year         = {{2004}},
}

Lund University Publications

LUND UNIVERSITY LIBRARIES

Statistical methods for identifying conserved residues in multiple sequence alignment.