Skip to main content

Lund University Publications

LUND UNIVERSITY LIBRARIES

Using standard microbiome reference groups to simplify beta-diversity analyses and facilitate independent validation

Maziarz, Marlena LU ; Pfeiffer, Ruth M. ; Wan, Yunhu and Gail, Mitchell H. (2018) In Bioinformatics 34(19). p.3249-3257
Abstract

Motivation: Comparisons of microbiome communities across populations are often based on pairwise distance measures (beta-diversity). Standard analyses (principal coordinate plots, permutation tests, kernel methods) require access to primary data if another investigator wants to add or compare independent data. We propose using standard reference measurements to simplify microbiome beta-diversity analyses, to make them more transparent, and to facilitate independent validation and comparisons across studies. Results: Using stool and nasal reference sets from the Human Microbiome Project (HMP), we computed mean distances (actually Bray-Curtis or Pearson correlation dissimilarities) to each reference set for each new sample. Thus, each new... (More)

Motivation: Comparisons of microbiome communities across populations are often based on pairwise distance measures (beta-diversity). Standard analyses (principal coordinate plots, permutation tests, kernel methods) require access to primary data if another investigator wants to add or compare independent data. We propose using standard reference measurements to simplify microbiome beta-diversity analyses, to make them more transparent, and to facilitate independent validation and comparisons across studies. Results: Using stool and nasal reference sets from the Human Microbiome Project (HMP), we computed mean distances (actually Bray-Curtis or Pearson correlation dissimilarities) to each reference set for each new sample. Thus, each new sample has two mean distances that can be plotted and analyzed with classical statistical methods. To test the approach, we studied independent (not reference) HMP subjects. Simple Hotelling tests demonstrated statistically significant differences in mean distances to reference sets between all pairs of body sites (stool, skin,nasal, saliva and vagina) at the phylum, class, order, family and genus levels. Using the distance to a single reference set was usually sufficient, but using both reference sets always worked well. The use of reference sets simplifies standard analyses of beta-diversity and facilitates the independent validation and combining of such data because others can compute distances to the same reference sets. Moreover, standard statistical methods for survival analysis, logistic regression and other procedures can be applied to vectors of mean distances to reference sets, thereby greatly expanding the potential uses of beta-diversity information. More work is needed to identify the best reference sets for particular applications.

(Less)
Please use this url to cite or link to this publication:
author
; ; and
publishing date
type
Contribution to journal
publication status
published
subject
in
Bioinformatics
volume
34
issue
19
pages
9 pages
publisher
Oxford University Press
external identifiers
  • scopus:85054062103
  • pmid:29668831
ISSN
1367-4803
DOI
10.1093/bioinformatics/bty297
language
English
LU publication?
no
id
32d28a01-3a10-47c1-b9be-0d4bed6560fb
date added to LUP
2019-08-05 11:47:02
date last changed
2024-04-16 17:17:40
@article{32d28a01-3a10-47c1-b9be-0d4bed6560fb,
  abstract     = {{<p>Motivation: Comparisons of microbiome communities across populations are often based on pairwise distance measures (beta-diversity). Standard analyses (principal coordinate plots, permutation tests, kernel methods) require access to primary data if another investigator wants to add or compare independent data. We propose using standard reference measurements to simplify microbiome beta-diversity analyses, to make them more transparent, and to facilitate independent validation and comparisons across studies. Results: Using stool and nasal reference sets from the Human Microbiome Project (HMP), we computed mean distances (actually Bray-Curtis or Pearson correlation dissimilarities) to each reference set for each new sample. Thus, each new sample has two mean distances that can be plotted and analyzed with classical statistical methods. To test the approach, we studied independent (not reference) HMP subjects. Simple Hotelling tests demonstrated statistically significant differences in mean distances to reference sets between all pairs of body sites (stool, skin,nasal, saliva and vagina) at the phylum, class, order, family and genus levels. Using the distance to a single reference set was usually sufficient, but using both reference sets always worked well. The use of reference sets simplifies standard analyses of beta-diversity and facilitates the independent validation and combining of such data because others can compute distances to the same reference sets. Moreover, standard statistical methods for survival analysis, logistic regression and other procedures can be applied to vectors of mean distances to reference sets, thereby greatly expanding the potential uses of beta-diversity information. More work is needed to identify the best reference sets for particular applications.</p>}},
  author       = {{Maziarz, Marlena and Pfeiffer, Ruth M. and Wan, Yunhu and Gail, Mitchell H.}},
  issn         = {{1367-4803}},
  language     = {{eng}},
  month        = {{10}},
  number       = {{19}},
  pages        = {{3249--3257}},
  publisher    = {{Oxford University Press}},
  series       = {{Bioinformatics}},
  title        = {{Using standard microbiome reference groups to simplify beta-diversity analyses and facilitate independent validation}},
  url          = {{http://dx.doi.org/10.1093/bioinformatics/bty297}},
  doi          = {{10.1093/bioinformatics/bty297}},
  volume       = {{34}},
  year         = {{2018}},
}