CCSG

Appendix D (Sample Size Determination)

Recommended steps

Define how many nested cells will be relevant for the analysis and what should be the minimal number of cases in each cell allowing for substantial analyses.
Have the survey sponsor specify the desired level of precision.
Convert these 95% confidence intervals into a sampling variance of the mean, [latex]var\left (\bar{y}\right )[/latex].
- Example: the survey sponsor wants a 95% confidence interval of .08 around the statistic of interest. Since the half width of a 95% confidence interval (CI) is [latex]\frac{1}{2}\left(95\% CI\right)=1.96\left ( se(\bar{y})\right )[/latex], this formula can be rearranged with basic algebra to calculate the precision (sampling variance of the mean) from this confidence interval: [latex]var(\bar{y})=\left(se(\bar{y})^2\right)=\left(\frac{.5(95\% CI)}{1.96}\right)^2=\left(\frac{.04}{1.96}\right)^2=.0004165[/latex].
Obtain an estimate of [latex]S^2[/latex] (population element).
If the statistic of interest is not a proportion, find an estimate of [latex]S^2[/latex]from a previous survey on the same target population.
If the statistic of interest is a proportion, the sampler can use the expected value of the proportion ([latex]p[/latex]), even if it is a guess, to estimate [latex]S^2[/latex] by using the formula [latex]S^2=p(1-p)[/latex].
Estimate the needed number of completed interviews for a simple random sample (SRS) by dividing the estimate of [latex]S^2[/latex] by the sampling variance of the mean.
- Example: the obtained estimate of [latex]S^2[/latex] is .6247. Therefore the needed number of completed interviews for an SRS ([latex]n_{srs}[/latex]) is: [latex]n_{srs}=\frac{.6247}{.0004165}=1,499.88\approx1,500[/latex].
- Multiply the number of completed interviews by the design effect to account for a non-SRS design.
  - Example: the design effect of a stratified clustered sample is 1.25. Taking into account the design effect, the number of completed interviews for this complex (i.e., stratified clustered) sample is: [latex]n_{complex}=n_{srs}\times d_{eff}=1,500\times 1.25=1,875[/latex].
- The sample size must account for three additional factors:
- Not all sampled elements will want to participate in the survey (i.e., response rate).
- Not all sampled elements, given the target population, will be eligible to participate (i.e., eligibility rate).
- The frame will likely fail to cover all elements in the survey population coverage rate).
- Calculate the necessary sample size by dividing the number of completed interviews by the expected response rate, eligibility rate, and coverage rate.
- The sampler can estimate these three rates by looking at the rates obtained in previous surveys with the same survey population and survey design.
  - Example: the expected response rate is 75%, the expected eligibility rate is 90%, and the expected coverage rate is 95%. Therefore, the necessary sample size is: [latex]n_{final}=\frac{n_{complex}}{\text{Response rate}\times \text{Eligibility rate}\times \text{Coverage rate}}=\frac{1,875}{.75\times .9\times .95}=2,923.97\approx2924[/latex].