Determining significance in metagenomic samples

Gregory Ditzler, Robi Polikar, Gail Rosen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Many ecology studies employ ordination methods to visually inspect metagenomic data sets, which initially may contain thousands of dimensions that represent operational taxonomic units (OTUs) of a sample. Many times, MANOVA (applied to a pairwise distance set) is applied to determine how different the groups in the study are from one another. It is convenient to have a p-value that allows us to interpret if two or more groups are different than one another with statistical confidence, where the null hypothesis is that the two populations are not different than the other. With MANOVA all groups are tested under the hypothesis that they are equal. In this work, we present a statistical framework for obtaining a p-value to compare multiple groups that is derived from a non-parametric statistical test, which uses data derived from the OTU features. The result is a matrix of p-values for the comparison on multiple groups in a metagenomic data set. We test our approach on a real-world database using several variations of ordination techniques.

Original languageEnglish (US)
Title of host publication2012 38th Annual Northeast Bioengineering Conference, NEBEC 2012
Pages143-144
Number of pages2
DOIs
StatePublished - Jun 29 2012
Event38th Annual Northeast Bioengineering Conference, NEBEC 2012 - Philadelphia, PA, United States
Duration: Mar 16 2012Mar 18 2012

Publication series

Name2012 38th Annual Northeast Bioengineering Conference, NEBEC 2012

Other

Other38th Annual Northeast Bioengineering Conference, NEBEC 2012
CountryUnited States
CityPhiladelphia, PA
Period3/16/123/18/12

Fingerprint

Statistical tests
Ecology

All Science Journal Classification (ASJC) codes

  • Bioengineering

Cite this

Ditzler, G., Polikar, R., & Rosen, G. (2012). Determining significance in metagenomic samples. In 2012 38th Annual Northeast Bioengineering Conference, NEBEC 2012 (pp. 143-144). [6207004] (2012 38th Annual Northeast Bioengineering Conference, NEBEC 2012). https://doi.org/10.1109/NEBC.2012.6207004
Ditzler, Gregory ; Polikar, Robi ; Rosen, Gail. / Determining significance in metagenomic samples. 2012 38th Annual Northeast Bioengineering Conference, NEBEC 2012. 2012. pp. 143-144 (2012 38th Annual Northeast Bioengineering Conference, NEBEC 2012).
@inproceedings{b7b48b8debfb493e9ab4278068d308f7,
title = "Determining significance in metagenomic samples",
abstract = "Many ecology studies employ ordination methods to visually inspect metagenomic data sets, which initially may contain thousands of dimensions that represent operational taxonomic units (OTUs) of a sample. Many times, MANOVA (applied to a pairwise distance set) is applied to determine how different the groups in the study are from one another. It is convenient to have a p-value that allows us to interpret if two or more groups are different than one another with statistical confidence, where the null hypothesis is that the two populations are not different than the other. With MANOVA all groups are tested under the hypothesis that they are equal. In this work, we present a statistical framework for obtaining a p-value to compare multiple groups that is derived from a non-parametric statistical test, which uses data derived from the OTU features. The result is a matrix of p-values for the comparison on multiple groups in a metagenomic data set. We test our approach on a real-world database using several variations of ordination techniques.",
author = "Gregory Ditzler and Robi Polikar and Gail Rosen",
year = "2012",
month = "6",
day = "29",
doi = "https://doi.org/10.1109/NEBC.2012.6207004",
language = "English (US)",
isbn = "9781467311410",
series = "2012 38th Annual Northeast Bioengineering Conference, NEBEC 2012",
pages = "143--144",
booktitle = "2012 38th Annual Northeast Bioengineering Conference, NEBEC 2012",

}

Ditzler, G, Polikar, R & Rosen, G 2012, Determining significance in metagenomic samples. in 2012 38th Annual Northeast Bioengineering Conference, NEBEC 2012., 6207004, 2012 38th Annual Northeast Bioengineering Conference, NEBEC 2012, pp. 143-144, 38th Annual Northeast Bioengineering Conference, NEBEC 2012, Philadelphia, PA, United States, 3/16/12. https://doi.org/10.1109/NEBC.2012.6207004

Determining significance in metagenomic samples. / Ditzler, Gregory; Polikar, Robi; Rosen, Gail.

2012 38th Annual Northeast Bioengineering Conference, NEBEC 2012. 2012. p. 143-144 6207004 (2012 38th Annual Northeast Bioengineering Conference, NEBEC 2012).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Determining significance in metagenomic samples

AU - Ditzler, Gregory

AU - Polikar, Robi

AU - Rosen, Gail

PY - 2012/6/29

Y1 - 2012/6/29

N2 - Many ecology studies employ ordination methods to visually inspect metagenomic data sets, which initially may contain thousands of dimensions that represent operational taxonomic units (OTUs) of a sample. Many times, MANOVA (applied to a pairwise distance set) is applied to determine how different the groups in the study are from one another. It is convenient to have a p-value that allows us to interpret if two or more groups are different than one another with statistical confidence, where the null hypothesis is that the two populations are not different than the other. With MANOVA all groups are tested under the hypothesis that they are equal. In this work, we present a statistical framework for obtaining a p-value to compare multiple groups that is derived from a non-parametric statistical test, which uses data derived from the OTU features. The result is a matrix of p-values for the comparison on multiple groups in a metagenomic data set. We test our approach on a real-world database using several variations of ordination techniques.

AB - Many ecology studies employ ordination methods to visually inspect metagenomic data sets, which initially may contain thousands of dimensions that represent operational taxonomic units (OTUs) of a sample. Many times, MANOVA (applied to a pairwise distance set) is applied to determine how different the groups in the study are from one another. It is convenient to have a p-value that allows us to interpret if two or more groups are different than one another with statistical confidence, where the null hypothesis is that the two populations are not different than the other. With MANOVA all groups are tested under the hypothesis that they are equal. In this work, we present a statistical framework for obtaining a p-value to compare multiple groups that is derived from a non-parametric statistical test, which uses data derived from the OTU features. The result is a matrix of p-values for the comparison on multiple groups in a metagenomic data set. We test our approach on a real-world database using several variations of ordination techniques.

UR - http://www.scopus.com/inward/record.url?scp=84862729706&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84862729706&partnerID=8YFLogxK

U2 - https://doi.org/10.1109/NEBC.2012.6207004

DO - https://doi.org/10.1109/NEBC.2012.6207004

M3 - Conference contribution

SN - 9781467311410

T3 - 2012 38th Annual Northeast Bioengineering Conference, NEBEC 2012

SP - 143

EP - 144

BT - 2012 38th Annual Northeast Bioengineering Conference, NEBEC 2012

ER -

Ditzler G, Polikar R, Rosen G. Determining significance in metagenomic samples. In 2012 38th Annual Northeast Bioengineering Conference, NEBEC 2012. 2012. p. 143-144. 6207004. (2012 38th Annual Northeast Bioengineering Conference, NEBEC 2012). https://doi.org/10.1109/NEBC.2012.6207004