計量生物学
Online ISSN : 2185-6494
Print ISSN : 0918-4430
ISSN-L : 0918-4430
25 巻, 2 号
選択された号の論文の4件中1~4を表示しています
原著
総説
  • 松山 裕
    2004 年 25 巻 2 号 p. 89-116
    発行日: 2004/12/31
    公開日: 2012/02/08
    ジャーナル フリー
    Missing data is a prevalent complication in the analysis of data from longitudinal studies, and remains an active area of research for biostatisticians and other quantitative methodologists. This paper reviews several statistical methods that are used to address outcome-related drop-out. We begin with a review of important concepts such as missing data patterns, missing data mechanisms, ignorability and likelihood-based inference, which were originally proposed by Rubin (1976, Biometrika 63, 581-592). Secondly, we review the simple analysis methods for handling drop-outs such as a complete-case analysis, an available data analysis and a last observation carried forward analysis, and their limitations are given. Thirdly, we review the more sophisticated approaches for handling drop-outs, which take account of the missing data mechanisms in the analysis. Inverse probability weighted methods and multiple imputation methods, which represent two distinct paradigms for handling missing data, are reviewed. The analysis methods for non-ignorable drop-outs are also reviewed. Three approaches, selection models, pattern mixture models and latent variable models are presented. We illustrate the analysis techniques using the longitudinal clinical trial of contracepting women reported by Machine et al (1988, Contraception 38, 165-179). We briefly review the analysis methods in the presence of missing covariates. Finally, we give some notice in the analysis of missing data.
  • 松浦 正明, 牛嶋 大, 宮田 敏
    2004 年 25 巻 2 号 p. 117-134
    発行日: 2004/12/31
    公開日: 2012/02/08
    ジャーナル フリー
    Recently many methods and tools for bioinformatics have been developed rapidly due to technological progress and the successful Genome Project. However new methodologies for detecting useful and important biomarkers or causal disease genes are still to be developed in order to establish tailor-made medical treatments or personalized medicine. This paper discusses the high throughput genome-related data with clinical information and reviews methods of analyses to search for biomarkers or causal genes. We point out problems in data for statistical analyses and in methods used widely as standards. The data explained here include SNP (Single Nucleotide Polymorphism) data, microarray data and mass-spectrometry proteome data. As for the analyses for high throughput data we discuss study design issues, problems in multidimensional data, and False Discovery Rate (FDR) in multiple testing problems. In the SNP data analyses, we describe that haplotype block based research has replaced separate SNP based analyses as a main research style in association study. In the microarray data analyses, we introduce the usefulness of AdaBoost to search for biomarkers as well as to analyze in the proteome data.
feedback
Top