New methods for protein structure threading and domain identification

Lindstam, Mats

New methods for protein structure threading and domain identification

Mark

Lindstam, Mats ^LU (2005)

Abstract: The availability of three-dimensional structures of proteins has provided enormous insight into the detailed molecular machinery of the living cell. Since the late 1950s and early 1960s, experimental methods for deriving protein structures have contributed more than 30,000 entries to the Protein Data Bank, representing the collected knowledge of protein and other macromolecular structures. However, sequencing techniques developed since the 1970s have identified numbers of protein sequences that far exceed the current number of experimentally solved structures. Providing alternative methods for deducing structural information of proteins has been one of the goals in the field of structural bioinformatics, in an effort to relieve the... (More); The availability of three-dimensional structures of proteins has provided enormous insight into the detailed molecular machinery of the living cell. Since the late 1950s and early 1960s, experimental methods for deriving protein structures have contributed more than 30,000 entries to the Protein Data Bank, representing the collected knowledge of protein and other macromolecular structures. However, sequencing techniques developed since the 1970s have identified numbers of protein sequences that far exceed the current number of experimentally solved structures. Providing alternative methods for deducing structural information of proteins has been one of the goals in the field of structural bioinformatics, in an effort to relieve the pressure put upon experimentalists by using computational methods for protein structure prediction, functional annotation and classification. We have addressed some of these issues and present new methods used primarily for protein structure prediction via homology modelling.

Current protein databases classify protein domains into groups based on similar structural features, but this work is manually intense and is not able to keep up with rapid growth in available structural data. Therefore, automated algorithms used to consistently define and systematically identify protein domains have been of increasing importance. Proteins domains are often preferred as templates over intact multi-domain proteins in fold recognition and in homology modelling calculations. A single protein can consist of several domains that may appear in a unique combination compared to proteins in the structure database, therefore sequence-based searches against individual domains more likely to be successful. Therefore we have developed a method of identifying protein domains DOMID.

We have approached the structure modelling challenge by deriving a method to perform multiple-template sequence to structure alignment for proteins which is particularly effective for target proteins with low sequence identify to known structures using conserved structural features and sequences similarities. (Less)

Please use this url to cite or link to this publication: https://lup.lub.lu.se/record/544964

author

Lindstam, Mats ^LU

supervisor

Anders Liljas ^LU

opponent

Associate Professor Ph.D. Nielsen, Morten, BioCentrum-DTU, Technical University of Denmark

organization

Biochemistry and Structural Biology

publishing date

2005

type

Thesis

publication status

published

subject

Biological Sciences

keywords

biomatematik, Molecular biophysics, Molekylär biofysik, medicinsk informatik, Bioinformatik, biomathematics biometrics, medical informatics, structure prediction, Bioinformatics, domain identification, Structural bioinformatics, threading

pages

145 pages

publisher

Molecular Biophysics, Lund University

defense location

Kemicentrum, sölvegatan 39, Lund Lecture room C

defense date

2005-06-01 13:15:00

ISBN

91-7422-083-7

language

English

LU publication?

yes

id

194ea937-d2b1-4725-bb17-954383f9215d (old id 544964)

date added to LUP

2016-04-04 10:15:40

date last changed

2025-04-04 13:54:22

@phdthesis{194ea937-d2b1-4725-bb17-954383f9215d,
  abstract     = {{The availability of three-dimensional structures of proteins has provided enormous insight into the detailed molecular machinery of the living cell. Since the late 1950s and early 1960s, experimental methods for deriving protein structures have contributed more than 30,000 entries to the Protein Data Bank, representing the collected knowledge of protein and other macromolecular structures. However, sequencing techniques developed since the 1970s have identified numbers of protein sequences that far exceed the current number of experimentally solved structures. Providing alternative methods for deducing structural information of proteins has been one of the goals in the field of structural bioinformatics, in an effort to relieve the pressure put upon experimentalists by using computational methods for protein structure prediction, functional annotation and classification. We have addressed some of these issues and present new methods used primarily for protein structure prediction via homology modelling.<br/><br>
<br/><br>
Current protein databases classify protein domains into groups based on similar structural features, but this work is manually intense and is not able to keep up with rapid growth in available structural data. Therefore, automated algorithms used to consistently define and systematically identify protein domains have been of increasing importance. Proteins domains are often preferred as templates over intact multi-domain proteins in fold recognition and in homology modelling calculations. A single protein can consist of several domains that may appear in a unique combination compared to proteins in the structure database, therefore sequence-based searches against individual domains more likely to be successful. Therefore we have developed a method of identifying protein domains DOMID.<br/><br>
<br/><br>
We have approached the structure modelling challenge by deriving a method to perform multiple-template sequence to structure alignment for proteins which is particularly effective for target proteins with low sequence identify to known structures using conserved structural features and sequences similarities.}},
  author       = {{Lindstam, Mats}},
  isbn         = {{91-7422-083-7}},
  keywords     = {{biomatematik; Molecular biophysics; Molekylär biofysik; medicinsk informatik; Bioinformatik; biomathematics biometrics; medical informatics; structure prediction; Bioinformatics; domain identification; Structural bioinformatics; threading}},
  language     = {{eng}},
  publisher    = {{Molecular Biophysics, Lund University}},
  school       = {{Lund University}},
  title        = {{New methods for protein structure threading and domain identification}},
  year         = {{2005}},
}

Lund University Publications

LUND UNIVERSITY LIBRARIES

New methods for protein structure threading and domain identification