Protein Structure Classification and Loop Modeling Using Multiple Ramachandran Distributions

Najibi, Seyed Morteza; Maadooliat, Mehdi; Zhou, Lan; Huang, Jianhua Z.; Gao, Xin

Protein Structure Classification and Loop Modeling Using Multiple Ramachandran Distributions

Mark

; Maadooliat, Mehdi ; Zhou, Lan ; Huang, Jianhua Z. and Gao, Xin (2017) In Computational and Structural Biotechnology Journal 15. p.243-254

Abstract: Recently, the study of protein structures using angular representations has attracted much attention among structural biologists. The main challenge is how to efficiently model the continuous conformational space of the protein structures based on the differences and similarities between different Ramachandran plots. Despite the presence of statistical methods for modeling angular data of proteins, there is still a substantial need for more sophisticated and faster statistical tools to model the large-scale circular datasets. To address this need, we have developed a nonparametric method for collective estimation of multiple bivariate density functions for a collection of populations of protein backbone angles. The proposed method takes... (More); Recently, the study of protein structures using angular representations has attracted much attention among structural biologists. The main challenge is how to efficiently model the continuous conformational space of the protein structures based on the differences and similarities between different Ramachandran plots. Despite the presence of statistical methods for modeling angular data of proteins, there is still a substantial need for more sophisticated and faster statistical tools to model the large-scale circular datasets. To address this need, we have developed a nonparametric method for collective estimation of multiple bivariate density functions for a collection of populations of protein backbone angles. The proposed method takes into account the circular nature of the angular data using trigonometric spline which is more efficient compared to existing methods. This collective density estimation approach is widely applicable when there is a need to estimate multiple density functions from different populations with common features. Moreover, the coefficients of adaptive basis expansion for the fitted densities provide a low-dimensional representation that is useful for visualization, clustering, and classification of the densities. The proposed method provides a novel and unique perspective to two important and challenging problems in protein structure research: structure-based protein classification and angular-sampling-based protein loop structure prediction.
(Less)

Please use this url to cite or link to this publication: https://lup.lub.lu.se/record/4fdac188-493b-4a88-af32-1bfd66541c1d

author

Najibi, Seyed Morteza ^LU

; Maadooliat, Mehdi ; Zhou, Lan ; Huang, Jianhua Z. and Gao, Xin

publishing date

2017

type

Contribution to journal

publication status

published

keywords

Bivariate splines, Log-spline density estimation, Protein classification, Protein structure, Ramachandran distribution, Roughness penalty, SCOP, Trigonometric B-spline

in

Computational and Structural Biotechnology Journal

volume

15

pages

12 pages

publisher

Research Network of Computational and Structural Biotechnology

external identifiers

scopus:85013872144
pmid:28280526

DOI

10.1016/j.csbj.2017.01.011

language

English

LU publication?

no

id

4fdac188-493b-4a88-af32-1bfd66541c1d

date added to LUP

2019-07-12 01:43:47

date last changed

2026-02-06 05:46:43

@article{4fdac188-493b-4a88-af32-1bfd66541c1d,
  abstract     = {{<p>Recently, the study of protein structures using angular representations has attracted much attention among structural biologists. The main challenge is how to efficiently model the continuous conformational space of the protein structures based on the differences and similarities between different Ramachandran plots. Despite the presence of statistical methods for modeling angular data of proteins, there is still a substantial need for more sophisticated and faster statistical tools to model the large-scale circular datasets. To address this need, we have developed a nonparametric method for collective estimation of multiple bivariate density functions for a collection of populations of protein backbone angles. The proposed method takes into account the circular nature of the angular data using trigonometric spline which is more efficient compared to existing methods. This collective density estimation approach is widely applicable when there is a need to estimate multiple density functions from different populations with common features. Moreover, the coefficients of adaptive basis expansion for the fitted densities provide a low-dimensional representation that is useful for visualization, clustering, and classification of the densities. The proposed method provides a novel and unique perspective to two important and challenging problems in protein structure research: structure-based protein classification and angular-sampling-based protein loop structure prediction.</p>}},
  author       = {{Najibi, Seyed Morteza and Maadooliat, Mehdi and Zhou, Lan and Huang, Jianhua Z. and Gao, Xin}},
  keywords     = {{Bivariate splines; Log-spline density estimation; Protein classification; Protein structure; Ramachandran distribution; Roughness penalty; SCOP; Trigonometric B-spline}},
  language     = {{eng}},
  pages        = {{243--254}},
  publisher    = {{Research Network of Computational and Structural Biotechnology}},
  series       = {{Computational and Structural Biotechnology Journal}},
  title        = {{Protein Structure Classification and Loop Modeling Using Multiple Ramachandran Distributions}},
  url          = {{http://dx.doi.org/10.1016/j.csbj.2017.01.011}},
  doi          = {{10.1016/j.csbj.2017.01.011}},
  volume       = {{15}},
  year         = {{2017}},
}

Lund University Publications

LUND UNIVERSITY LIBRARIES

Protein Structure Classification and Loop Modeling Using Multiple Ramachandran Distributions