計量生物学
Online ISSN : 2185-6494
Print ISSN : 0918-4430
ISSN-L : 0918-4430
原著
The Q-Q Plot of p-values for Predicting Outcomes with the Gene Expression Data
Yoichi M. ItoYasuhiro FujiwaraYasuo Ohashi
著者情報
ジャーナル フリー

2007 年 28 巻 1 号 p. 37-46

詳細
抄録

Michiels et al. (2005) showed that a list of genes identified as predictors of prognosis via a non-repeated training — validation approach is unstable and advocate the validation by repeated random sampling. They considered that the genes which were selected as top 50 genes in more than half of their jackknife samples were stable for prediction. However, there is no rationale of the determination of the length of the gene list and the threshold of stability. Since evaluating an accumulation of low p-values in the repeated random sampling is essentially required for a stability assessment, it is better to compare the distribution of p-values of a gene observed with the distribution of p-values under the null hypothesis directly. In this study, the Quantile-Quantile plot (Q-Q plot) of p-values with null reference was proposed for this purpose. We applied the proposed method to a clinical data for primary breast cancer. The Q-Q plot approach can reveal that the genes with a similar p-value in the ordinary analysis have different p-value distributions in the repeated random sampling, and the gene with low p-values accumulated in the repeated random sampling could be evaluated according to the reference lines in the Q-Q plot.

著者関連情報
© 2007 The Biometric Society of Japan
前の記事 次の記事
feedback
Top