PhenStat: A Tool Kit for Standardized Analysis of High Throughput Phenotypic Data

PLoS One. 2015 Jul 6;10(7):e0131274. doi: 10.1371/journal.pone.0131274. eCollection 2015.

Abstract

The lack of reproducibility with animal phenotyping experiments is a growing concern among the biomedical community. One contributing factor is the inadequate description of statistical analysis methods that prevents researchers from replicating results even when the original data are provided. Here we present PhenStat--a freely available R package that provides a variety of statistical methods for the identification of phenotypic associations. The methods have been developed for high throughput phenotyping pipelines implemented across various experimental designs with an emphasis on managing temporal variation. PhenStat is targeted to two user groups: small-scale users who wish to interact and test data from large resources and large-scale users who require an automated statistical analysis pipeline. The software provides guidance to the user for selecting appropriate analysis methods based on the dataset and is designed to allow for additions and modifications as needed. The package was tested on mouse and rat data and is used by the International Mouse Phenotyping Consortium (IMPC). By providing raw data and the version of PhenStat used, resources like the IMPC give users the ability to replicate and explore results within their own computing environment.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Datasets as Topic / standards
  • Datasets as Topic / statistics & numerical data
  • Female
  • High-Throughput Screening Assays / methods
  • High-Throughput Screening Assays / standards*
  • High-Throughput Screening Assays / statistics & numerical data
  • Linear Models
  • Male
  • Mice
  • Phenotype*
  • Rats
  • Reference Standards
  • Reproducibility of Results*
  • Software*