Improving the Identification of Phenotypic Abnormalities and Sexual Dimorphism in Mice When Studying Rare Event Categorical Characteristics.

Genetics 2017 02 7;205(2):491-501. Epub 2016 Dec 7.

Department of Statistics and Operations Research, School of Mathematical Sciences, Tel Aviv University, Israel.

Biological research frequently involves the study of phenotyping data. Many of these studies focus on rare event categorical data, and functional genomics studies typically study the presence or absence of an abnormal phenotype. With the growing interest in the role of sex, there is a need to assess the phenotype for sexual dimorphism. The identification of abnormal phenotypes for downstream research is challenged by the small sample size, the rare event nature, and the multiple testing problem, as many variables are monitored simultaneously. Here, we develop a statistical pipeline to assess statistical and biological significance while managing the multiple testing problem. We propose a two-step pipeline to initially assess for a treatment effect, in our case example genotype, and then test for an interaction with sex. We compare multiple statistical methods and use simulations to investigate the control of the type-one error rate and power. To maximize the power while addressing the multiple testing issue, we implement filters to remove data sets where the hypotheses to be tested cannot achieve significance. A motivating case study utilizing a large scale high-throughput mouse phenotyping data set from the Wellcome Trust Sanger Institute Mouse Genetics Project, where the treatment is a gene ablation, demonstrates the benefits of the new pipeline on the downstream biological calls.

Download full-text PDF

Source
http://dx.doi.org/10.1534/genetics.116.195388DOI Listing
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5289831PMC
February 2017
16 Reads

Publication Analysis

Top Keywords

rare event
12
multiple testing
12
event categorical
8
phenotyping data
8
testing problem
8
sexual dimorphism
8
sex compare
4
interaction sex
4
genotype test
4
treatment case
4
case example
4
example genotype
4
compare multiple
4
test interaction
4
statistical methods
4
type-one error
4
error rate
4
rate power
4
control type-one
4
investigate control
4

References

(Supplied by CrossRef)
Selective inference on multiple families of hypotheses.
Benjamini et al.
J. R. Stat. Soc. Series B Stat. Methodol. 2014
Controlling the false discovery rate - a practical and powerful approach to multiple testing.
Benjamini et al.
J. R. Stat. Soc. B 1995

Similar Publications