2020 年 6 巻 1 号 p. 81-85
Background: Sample size estimation is an important and integral part of a research protocol. While “how large a sample?” is a simple question, the answer is only meaningful within the context of the research question.
Methods: Clear definitions of the variable of interest and target population parameters are key to estimating sample size. In turn the sample must be sized such that it can accurately detect the ‘effect’ of interest, adequately represent the target population and maintain maximum design efficiency. Four basic pieces of information are utilized in most sample estimation across all clinical research: significance level, power, magnitude of effect and variability of the variable of interest.
Results/Discussion: Preliminary determination of these will greatly facilitate work with a biostatistician or a computer application to create a sample size estimation. While applications can support relatively simple sample size calculations consultation with a biostatistician is recommended.
In research design, an important (and certainly very frequently asked) question is “how large a sample is needed”. Perhaps this is because it seems like such a simple question or because it can seem shrouded in mystery or both. But any discussion of sample size must begin with the research question and then with the sample design; both of which are derived from and a critical part of a full study protocol. This is critical because sampling design and, in turn, sample size are based on and only meaningful within the context of the research question and a concern with effectively and accurately answering that question. The goal of this paper is to introduce the basic concepts needed to understand sample size estimation. Our objective is, in part, that the basics introduced here will apply and enlighten even in the wide range of designs required for medical research. Although the examples used are drawn from clinical research, the approach is the same in all basic research.
Once the research question and variables (i.e. effects) of interest are clearly defined, the target population, i.e. a group of people who share a common character or condition (the variables of interest) (1), can be described. The sample to be taken should be, as much as possible, representative of this target population designed (for purposes of this paper we will assume the researcher will work with a sample rather than the entire target population). The researcher is entirely responsible for assuring this representation through the sample design adopted. The sample design has a minimum of four key factors: an effective sample methodology, identification of the sample frame, sampling methods and sample size estimation (2)1.
So how big should the sample be? The answer has two parts; first: “large enough to detect the key variable, effect or association of interest.” This depends on the magnitude of that variable, effect or association in the target population (3). If the magnitude is large, e.g. in an imaging study where we might wish to show that brain natriuretic peptide (BNP) (at baseline) in the granulocyte-colony-stimulating-factor (G-CSF) group measurement of 149.5 units is larger than the control group measure of 184.7 (a difference of 35.2 percentage points) (4), it requires a smaller sample. If the magnitude is small, e.g. a difference of 3.1 BNP (at 6 month) units between the control (106.6) and G-CSF groups (103.5) (4), it requires a larger sample. Hence, if the sample is too small the study can (easily) fail to detect a clinically important effect or association (5). Too large a sample and the study can be too costly (i.e. waste of live animals in animal research), difficult, and possibly detect an effect that is not clinically important.
Second: “large enough to effectively represent the target population”; this depends in part on the kind of study being designed and homogeneity of the target population. Generally, there are two types of clinical research studies: clinical trials (which include the introduction of an intervention) and observational. While random assignment is the ideal, in observational studies (arguably the most common type of clinical research study) the investigator does not assign treatment to subjects but instead examines pre-existing treatments, exposures or policies and their effects (6). Because treatment is not randomly assigned observational studies normally experience greater heterogeneity in the target population; this heterogeneity must be ‘coped with’ in the sampling plan because representativeness is key (1). Heterogeneity in a target population generally requires a larger sample size to accurately capture the variable of interest. And then two different samples are required, one for the control (or sometimes alternative treatment) group, which often represents the population that originated the cases, and another for the case/treatment group, based on diagnosis of the disease or condition of interest (2). The inclusion and exclusion criteria in the study plan should make it very clear how heterogeneous or homogeneous the target population is.
In contrast, clinical trials aim to obtain a homogeneous sample, often those who will benefit from (and will consent to) an intervention (2) and may not always represent a general target population. Multicenter studies are conducted, in part, to off-set for these limitations.
Four basic pieces of needed informationSo how big should the sample be? In general, there are four basic pieces of information needed to estimate sample size:
Estimating a meaningful magnitude of difference may be a challenge. While typically estimated as the difference between treatment and control outcomes, observational, epidemiological, and other study designs require alternative methods. In single sample, one-arm, designs magnitude of difference may be estimated based on difference between the treatment arm and a known population proportion, findings from a previous study or a historical control. Difference may have to be based not on previous data but on estimates of what would be a ‘minimal clinically important difference (MCID)’, that is “something significant enough to change patient management (9).” Also there may be several ‘variables of interest;’ (in this situation sample size could be based on the one variable that requires that largest sample) which can be confusing.
Any one of these four pieces of information can influence the estimated size of the needed sample; selection of a lower level of power; a higher level of significance, a larger magnitude of difference can decrease the needed sample (11). While critical peer reviews (for a journal article) may ‘catch' poorly designed and sized samples, it is the responsibility of the researcher to develop meaningful estimates and, in turn, correct sample size estimates.
Two basic sample size calculationsThese four pieces of information, or some portion of them, are generally utilized in most sample size formulations across all fields (human, animal and others of clinical research (12)). They may take on different configurations, but researching them and developing estimates and/or determining standards (e.g. for significance and power levels) before you go see the biostatistician (or before you access a sample size application) will result in a far more meaningful and correct estimate of the needed sample. Remember, if you don't calculate the sample size correctly, you may not be able to prove your hypothesis.
We present two examples, first an example of estimating sample size for comparing two groups with a continuous variable (a two independent samples test) assuming equal variance and samples (per group), illustrates how these four pieces of information are utilized. The second deals with estimating sample size for two proportions.
Example 1. Two continuous measuresFor the first example, a basic formula for calculating sample size (n) is (13, 14):
For this example, we will use the statistics from the results section of Toyama, et al, paper in ANC Vol. 5, p. 24, (4) as the basis to calculate the sample size for a new study. if BNP represents our variable of interest and the 6-month control group represents our target population, we would set:
For the second example, a basic formula for estimating sample size (n) for two proportions is (15):
For this example, we will use statistics from the results section of the Muramatsu, et. al. paper, Table 1, in ANC Vol. 5, p 35 (16); if “B blocker (%)” represents our treatment of interest and Aortic cal +/- our variable of interest, which provides two proportions for the B blocker and the control groups for comparison, we would set
Note that the only different information needed, compared with example 1, are the expected proportions of the two groups. These proportions are comparable to the minimum magnitude of difference as we discussed.
Applying these values to our example formula:
While these examples are drawn from clinical research, it is not difficult to substitute population and variable of interest and the treatment vs control groups from a basic science research study into the formulas.
Support: Biostatisticians and applicationsThe optimal support you can obtain is to consult with a biostatistician early in your research design. A biostatistician can help you to:
Four basic information pieces commonly needed in sample size estimates.
A number of applications are available on-line, free of charge or to purchase, which can facilitate sample size estimations as well as estimations of power if sample size is fixed. They work well for simple estimations such as the above example. But as the study design and statistical methods become more complex these applications can quickly meet their limits. Given the importance of sample size as an integral part of the research plan, we strongly advise that an experienced biostatistician be consulted.
Sample size is an essential part of a meaningful study protocol. Derived from the research question and the “who” of the target population, how the sample is selected and how large the sample needs to be are equally important components. The four basic pieces of information needed for sample size calculations need to be understood for their importance as well. While basic formulas and applications exist, estimation of sample sizes can become complex with increasing complexity of study design and analysis tools used. There a variety of sample size estimation applications that can adequately facilitate estimations in less complex study designs. It is recommended that, if possible, a biostatistician be consulted throughout the study and especially for more complex designs.
None.
None.
None.
This paper deals with a limited range of research design and needed sample size estimations to introduce and address basic sampling concepts. While there are a wide variety of alternative study designs and many different approaches than those discussed here, the basics that are introduced still apply. Specifically addressing the wide variety of approaches, however, is beyond the scope of this paper.