dc.contributor.author |
Ren, B |
en |
dc.contributor.author |
Bacallado, S |
en |
dc.contributor.author |
Favaro, S |
en |
dc.contributor.author |
Vatanen, Tommi |
en |
dc.contributor.author |
Huttenhower, C |
en |
dc.contributor.author |
Trippa, L |
en |
dc.date.accessioned |
2020-06-11T23:41:18Z |
en |
dc.date.issued |
2020-04-16 |
en |
dc.identifier.citation |
Annals of Applied Probability : An official journal of the Institute of Mathematical Statistics 14(1):494-517 16 Apr 2020 |
en |
dc.identifier.issn |
1050-5164 |
en |
dc.identifier.uri |
http://hdl.handle.net/2292/51505 |
en |
dc.description.abstract |
Detecting associations between microbial compositions and sample characteristics is one of the most important tasks in microbiome studies. Most of the existing methods apply univariate models to single microbial species separately, with adjustments for multiple hypothesis testing. We propose a Bayesian analysis for a generalized mixed effects linear model tailored to this application. The marginal prior on each microbial composition is a Dirichlet process, and dependence across compositions is induced through a linear combination of individual covariates, such as disease biomarkers or the subject’s age, and latent factors. The latent factors capture residual variability and their dimensionality is learned from the data in a fully Bayesian procedure. The proposed model is tested in data analyses and simulation studies with zero-inflated compositions. In these settings and within each sample, a large proportion of counts per microbial species are equal to zero. In our Bayesian model a priori the probability of compositions with absent microbial species is strictly positive. We propose an efficient algorithm to sample from the posterior and visualizations of model parameters which reveal associations between covariates and microbial compositions. We evaluate the proposed method in simulation studies, and then analyze a microbiome dataset for infants with type 1 diabetes which contains a large proportion of zeros in the sample-specific microbial compositions. |
en |
dc.publisher |
Institute of Mathematical Statistics |
en |
dc.relation.ispartofseries |
The annals of applied probability : an official journal of the Institute of Mathematical Statistics |
en |
dc.rights |
Items in ResearchSpace are protected by copyright, with all rights reserved, unless otherwise indicated. Previously published items are made available in accordance with the copyright policy of the publisher. |
en |
dc.rights.uri |
https://researchspace.auckland.ac.nz/docs/uoa-docs/rights.htm |
en |
dc.rights.uri |
https://imstat.org/journals-and-publications/annals-of-applied-probability/annals-of-applied-probability-manuscript-submission/ |
en |
dc.title |
Bayesian mixed effects models for zero-inflated compositions in microbiome data analysis |
en |
dc.type |
Journal Article |
en |
dc.identifier.doi |
10.1214/19-AOAS1295 |
en |
pubs.issue |
1 |
en |
pubs.begin-page |
494 |
en |
pubs.volume |
14 |
en |
dc.rights.holder |
Copyright: Institute of Mathematical Statistics |
en |
pubs.author-url |
https://projecteuclid.org/euclid.aoas/1587002684 |
en |
pubs.end-page |
517 |
en |
dc.rights.accessrights |
http://purl.org/eprint/accessRights/OpenAccess |
en |
pubs.subtype |
Article |
en |
pubs.elements-id |
800844 |
en |
pubs.org-id |
Liggins Institute |
en |
pubs.record-created-at-source-date |
2020-05-07 |
en |