Inter-functional analysis of high-throughput phenotype data by non-parametric clustering and its application to photosynthesis

  • Qiaozi Gao
  • , Elisabeth Ostendorf
  • , Jeffrey A. Cruz
  • , Rong Jin
  • , David M. Kramer
  • , Jin Chen

Producción científica: Articlerevisión exhaustiva

6 Citas (Scopus)

Resumen

Motivation: Phenomics is the study of the properties and behaviors of organisms (i.e. their phenotypes) on a high-throughput scale. New computational tools are needed to analyze complex phenomics data, which consists of multiple traits/behaviors that interact with each other and are dependent on external factors, such as genotype and environmental conditions, in a way that has not been well studied. Results: We deployed an efficient framework for partitioning complex and high dimensional phenotype data into distinct functional groups. To achieve this, we represented measured phenotype data from each genotype as a cloud-of-points, and developed a novel non-parametric clustering algorithm to cluster all the genotypes. When compared with conventional clustering approaches, the new method is advantageous in that it makes no assumption about the parametric form of the underlying data distribution and is thus particularly suitable for phenotype data analysis. We demonstrated the utility of the new clustering technique by distinguishing novel phenotypic patterns in both synthetic data and a high-throughput plant photosynthetic phenotype dataset. We biologically verified the clustering results using four Arabidopsis chloroplast mutant lines. Availability and implementation: Software is available at www.msu.edu/-jinchen/NPM.

Idioma originalEnglish
Páginas (desde-hasta)67-76
Número de páginas10
PublicaciónBioinformatics
Volumen32
N.º1
DOI
EstadoPublished - ene 1 2016

Nota bibliográfica

Publisher Copyright:
© 2015 The Author 2015. Published by Oxford University Press. All rights reserved.

Financiación

This research was supported by the U.S. Department of Energy, Office of Science, Basic Energy Sciences [DE-FG02-91ER20021] for work at the PRL by JC and DMK on data collections and analyses, DOE [DE-AR000202] for experimental work at PRL by EO, the National Science Foundation [award number 1458556] for work by JC on algorithm development, and the MSU Center for Advanced Algal and Plant Phenotyping for development and use of phenotyping tools.

FinanciadoresNúmero del financiador
MSU Center for Advanced Algal and Plant Phenotyping
National Science Foundation (NSF)1458556
Michigan State University-U.S. Department of Energy (MSU-DOE) Plant Research LaboratoryDE-AR000202
Office of Science Programs
Office of Basic Energy SciencesDE-FG02-91ER20021

    ASJC Scopus subject areas

    • Statistics and Probability
    • Biochemistry
    • Molecular Biology
    • Computer Science Applications
    • Computational Theory and Mathematics
    • Computational Mathematics

    Huella

    Profundice en los temas de investigación de 'Inter-functional analysis of high-throughput phenotype data by non-parametric clustering and its application to photosynthesis'. En conjunto forman una huella única.

    Citar esto