New methods for protein structure threading and domain identification
(2005)- Abstract
- The availability of three-dimensional structures of proteins has provided enormous insight into the detailed molecular machinery of the living cell. Since the late 1950s and early 1960s, experimental methods for deriving protein structures have contributed more than 30,000 entries to the Protein Data Bank, representing the collected knowledge of protein and other macromolecular structures. However, sequencing techniques developed since the 1970s have identified numbers of protein sequences that far exceed the current number of experimentally solved structures. Providing alternative methods for deducing structural information of proteins has been one of the goals in the field of structural bioinformatics, in an effort to relieve the... (More)
- The availability of three-dimensional structures of proteins has provided enormous insight into the detailed molecular machinery of the living cell. Since the late 1950s and early 1960s, experimental methods for deriving protein structures have contributed more than 30,000 entries to the Protein Data Bank, representing the collected knowledge of protein and other macromolecular structures. However, sequencing techniques developed since the 1970s have identified numbers of protein sequences that far exceed the current number of experimentally solved structures. Providing alternative methods for deducing structural information of proteins has been one of the goals in the field of structural bioinformatics, in an effort to relieve the pressure put upon experimentalists by using computational methods for protein structure prediction, functional annotation and classification. We have addressed some of these issues and present new methods used primarily for protein structure prediction via homology modelling.
Current protein databases classify protein domains into groups based on similar structural features, but this work is manually intense and is not able to keep up with rapid growth in available structural data. Therefore, automated algorithms used to consistently define and systematically identify protein domains have been of increasing importance. Proteins domains are often preferred as templates over intact multi-domain proteins in fold recognition and in homology modelling calculations. A single protein can consist of several domains that may appear in a unique combination compared to proteins in the structure database, therefore sequence-based searches against individual domains more likely to be successful. Therefore we have developed a method of identifying protein domains DOMID.
We have approached the structure modelling challenge by deriving a method to perform multiple-template sequence to structure alignment for proteins which is particularly effective for target proteins with low sequence identify to known structures using conserved structural features and sequences similarities. (Less)
Please use this url to cite or link to this publication:
https://lup.lub.lu.se/record/544964
- author
- Lindstam, Mats LU
- supervisor
- opponent
-
- Associate Professor Ph.D. Nielsen, Morten, BioCentrum-DTU, Technical University of Denmark
- organization
- publishing date
- 2005
- type
- Thesis
- publication status
- published
- subject
- keywords
- biomatematik, Molecular biophysics, Molekylär biofysik, medicinsk informatik, Bioinformatik, biomathematics biometrics, medical informatics, structure prediction, Bioinformatics, domain identification, Structural bioinformatics, threading
- pages
- 145 pages
- publisher
- Molecular Biophysics, Lund University
- defense location
- Kemicentrum, sölvegatan 39, Lund Lecture room C
- defense date
- 2005-06-01 13:15:00
- ISBN
- 91-7422-083-7
- language
- English
- LU publication?
- yes
- id
- 194ea937-d2b1-4725-bb17-954383f9215d (old id 544964)
- date added to LUP
- 2016-04-04 10:15:40
- date last changed
- 2018-11-21 20:57:44
@phdthesis{194ea937-d2b1-4725-bb17-954383f9215d, abstract = {{The availability of three-dimensional structures of proteins has provided enormous insight into the detailed molecular machinery of the living cell. Since the late 1950s and early 1960s, experimental methods for deriving protein structures have contributed more than 30,000 entries to the Protein Data Bank, representing the collected knowledge of protein and other macromolecular structures. However, sequencing techniques developed since the 1970s have identified numbers of protein sequences that far exceed the current number of experimentally solved structures. Providing alternative methods for deducing structural information of proteins has been one of the goals in the field of structural bioinformatics, in an effort to relieve the pressure put upon experimentalists by using computational methods for protein structure prediction, functional annotation and classification. We have addressed some of these issues and present new methods used primarily for protein structure prediction via homology modelling.<br/><br> <br/><br> Current protein databases classify protein domains into groups based on similar structural features, but this work is manually intense and is not able to keep up with rapid growth in available structural data. Therefore, automated algorithms used to consistently define and systematically identify protein domains have been of increasing importance. Proteins domains are often preferred as templates over intact multi-domain proteins in fold recognition and in homology modelling calculations. A single protein can consist of several domains that may appear in a unique combination compared to proteins in the structure database, therefore sequence-based searches against individual domains more likely to be successful. Therefore we have developed a method of identifying protein domains DOMID.<br/><br> <br/><br> We have approached the structure modelling challenge by deriving a method to perform multiple-template sequence to structure alignment for proteins which is particularly effective for target proteins with low sequence identify to known structures using conserved structural features and sequences similarities.}}, author = {{Lindstam, Mats}}, isbn = {{91-7422-083-7}}, keywords = {{biomatematik; Molecular biophysics; Molekylär biofysik; medicinsk informatik; Bioinformatik; biomathematics biometrics; medical informatics; structure prediction; Bioinformatics; domain identification; Structural bioinformatics; threading}}, language = {{eng}}, publisher = {{Molecular Biophysics, Lund University}}, school = {{Lund University}}, title = {{New methods for protein structure threading and domain identification}}, year = {{2005}}, }