Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Protein family neighborhood analyzer-ProFaNA

Baranowski, Bartosz and Pawłowski, Krzysztof LU (2023) In PeerJ 11.
Abstract

Background: Functionally related genes are well known to be often grouped in close vicinity in the genomes, particularly in prokaryotes. Notwithstanding the diverse evolutionary mechanisms leading to this phenomenon, it can be used to predict functions of uncharacterized genes. Methods: Here, we provide a simple but robust statistical approach that leverages the vast amounts of genomic data available today. Considering a protein domain as a functional unit, one can explore other functional units (domains) that significantly often occur within the genomic neighborhoods of the queried domain. This analysis can be performed across different taxonomic levels. Provisions can also be made to correct for the uneven sampling of the taxonomic... (More)

Background: Functionally related genes are well known to be often grouped in close vicinity in the genomes, particularly in prokaryotes. Notwithstanding the diverse evolutionary mechanisms leading to this phenomenon, it can be used to predict functions of uncharacterized genes. Methods: Here, we provide a simple but robust statistical approach that leverages the vast amounts of genomic data available today. Considering a protein domain as a functional unit, one can explore other functional units (domains) that significantly often occur within the genomic neighborhoods of the queried domain. This analysis can be performed across different taxonomic levels. Provisions can also be made to correct for the uneven sampling of the taxonomic space by genomic sequencing projects that often focus on large numbers of very closely related strains, e.g., pathogenic ones. To this end, an optional procedure for averaging occurrences within subtaxa is available. Results: Several examples show this approach can provide useful functional predictions for uncharacterized gene families, and how to combine this information with other approaches. The method is made available as a web server at http://bioinfo.sggw.edu.pl/neighborhood_analysis.

(Less)
Please use this url to cite or link to this publication:
author
and
organization
publishing date
type
Contribution to journal
publication status
published
subject
keywords
Comparative genomics, Gene function prediction, Genomic neighborhoods, Protein domains
in
PeerJ
volume
11
article number
e15715
publisher
PeerJ
external identifiers
  • pmid:37492397
  • scopus:85170025741
ISSN
2167-8359
DOI
10.7717/peerj.15715
language
English
LU publication?
yes
id
11892088-271f-4b11-871b-6639d80cf13c
date added to LUP
2023-11-03 10:55:45
date last changed
2024-04-19 03:31:44
@article{11892088-271f-4b11-871b-6639d80cf13c,
  abstract     = {{<p>Background: Functionally related genes are well known to be often grouped in close vicinity in the genomes, particularly in prokaryotes. Notwithstanding the diverse evolutionary mechanisms leading to this phenomenon, it can be used to predict functions of uncharacterized genes. Methods: Here, we provide a simple but robust statistical approach that leverages the vast amounts of genomic data available today. Considering a protein domain as a functional unit, one can explore other functional units (domains) that significantly often occur within the genomic neighborhoods of the queried domain. This analysis can be performed across different taxonomic levels. Provisions can also be made to correct for the uneven sampling of the taxonomic space by genomic sequencing projects that often focus on large numbers of very closely related strains, e.g., pathogenic ones. To this end, an optional procedure for averaging occurrences within subtaxa is available. Results: Several examples show this approach can provide useful functional predictions for uncharacterized gene families, and how to combine this information with other approaches. The method is made available as a web server at http://bioinfo.sggw.edu.pl/neighborhood_analysis.</p>}},
  author       = {{Baranowski, Bartosz and Pawłowski, Krzysztof}},
  issn         = {{2167-8359}},
  keywords     = {{Comparative genomics; Gene function prediction; Genomic neighborhoods; Protein domains}},
  language     = {{eng}},
  publisher    = {{PeerJ}},
  series       = {{PeerJ}},
  title        = {{Protein family neighborhood analyzer-ProFaNA}},
  url          = {{http://dx.doi.org/10.7717/peerj.15715}},
  doi          = {{10.7717/peerj.15715}},
  volume       = {{11}},
  year         = {{2023}},
}