Solved – How to compute margin of error with a given confidence interval

samplingself-studysurvey

I was given the following question:

A survey found that 89% of a random sample of 1024 American adults approved of cloning endangered animals. Find the margin of error for this survey if we want 90% confidence in our estimate of the percent of American adults who approve of cloning endangered animals.

I know that for 90% Confidence, $\text{ME}\sim 0.82/\sqrt{n}$.

I attempted using this formula with n equal to both 1024 and (.89)1024. I got 0.025625 and 0.02716, respectively. The answer given for the problem is 1.61%. I do not understand where I went wrong. Perhaps I am using the Margin of Error formula incorrectly?

Thanks. 🙂

Best Answer

Because you are dealing with proportions, the variance is given by:

$$\frac{p(1-p)}{n}$$

And so the 90% CI ME is equal to $1.645\times \sqrt{\frac{p(1-p)}{n}}=1.645\times \sqrt{\frac{0.89(1-0.89)}{1024}}=0.016$

Related Solutions

Confidence Interval – Determining Sample Sizes for Binomial Confidence Intervals

(1) Yes.

(2) Yes. There are only $n+1$ possible outcomes for a binomial random variable, so it is possible to look at what happens for each possible outcome - in fact this is faster than simulating lots and lots of outcomes!

Let $X$ be the number of "successes" among the $n$ customers and let $\hat{p}=X/n$. The confidence interval is $\hat{p}\pm z_{\alpha/2}\sqrt{\hat{p}(1-\hat{p})/n}$, so the halfwidth is $z_{\alpha/2}\sqrt{\hat{p}(1-\hat{p})/n}$. Thus we want to compute $P(z_{\alpha/2}\sqrt{\hat{p}(1-\hat{p})/n}\leq 0.005)$. In R, we can do this as follows:

target.halfWidth<-0.005

p<-0.016 #true proportion
n.vec<-seq(from=1000, to=3000, by=100) #number of samples

# Vector to store results
prob.hw<-rep(NA,length(n.vec))

# Loop through desired sample size options
for (i in 1: length(n.vec))
{
n<-n.vec[i]

# Look at all possible outcomes
x<-0:n
p.est<-x/n

# Compute halfwidth for each option
halfWidth<-qnorm(0.95)*sqrt(p.est*(1-p.est)/n)

# What is the probability that the halfwidth is less than 0.005?
prob.hw[i]<-sum({halfWidth<=target.halfWidth}*dbinom(x,n,p))
}

# Plot results
plot(n.vec,prob.hw,type="b")
abline(0.95,0,col=2)

# Get the minimal n required
n.vec[min(which(prob.hw>=0.95))]

The answer is $n=2200$ in this case as well.

Finally, it is usually a good idea to verify that the asymptotic normal approximation interval actually gives the desired coverage. In R, we can compute the coverage probability (i.e. the actual confidence level) as:

p<-0.016
n<-2200
x<-0:n
p.est<-x/n
halfWidth<-qnorm(0.95)*sqrt(p.est*(1-p.est)/n)
# Coverage probability
sum({abs(p-p.est)<=halfWidth}*dbinom(x,n,p))

Different $p$ give different coverages. For $p$ around $0.015$, the actual confidence level of the nominal $90\%$ interval seems to be about $89\%$ in general, which I presume is fine for your purposes.

(3) When you sample from a finite population, the number of successes is not binomial but hypergeometric. If the population is large compared to your sample size, the binomial works just fine as an approximation. If you sample 1000 out of 5000, say, it does not. Have a look at confidence intervals for proportions based on the hypergeometric distribution!

Answers to additional questions:

Let $(p_L,p_U)$ be the confidence interval.

1) In that case you are no longer computing $P(p_L-p_U\leq0.01)$ but $$P\Big(p_L-p_U\leq0.01~\mbox{and}~p\in(p_L,p_U)\Big),$$ i.e. the probability that the length of intervals that actually contain $p$ is at most 0.01. This may be an interesting quantity, depending on what you're interested in...

2) Maybe, but probably not. If the population size is large compared to the sample size you don't need it, and if it's not then the binomial distribution is not appropriate to begin with!

3) Sprop seems to contain confidence intervals based on the hypergeometric intervals, so that should work just fine.

Solved – How does one interpret a single margin of error value for a survey consisting of many questions

Technically, you are quite correct. The confidence interval depends on both N and p. Furthermore, the classical equation for the confidence interval of a binomial proportion isn't always all that good. There is quite a literature on this subject, a good starting point is this article by Agresti and Coull. `R' has implemented various of the suggested alternatives to the simple formula.

However, in practice, it is quite common to cite one CI for all the questions. Often the CI for the different questions do not vary that much. The question then becomes which CI to use as the "general" one. I would hope that they chose one calculated with a relatively small N.

Best Answer

Related Solutions

Confidence Interval – Determining Sample Sizes for Binomial Confidence Intervals

Solved – How does one interpret a single margin of error value for a survey consisting of many questions

Related Question