Solved – How to get any quantiles given median value and margin of error

inferencequantilessample-sizesampling

I am trying to get the values of the 25th and 75th quantile of the population based on two values that summarizes the samples:

median value
90 percent margin of error

I don't have any other information including the sample size, standard error, etc.. And I think it 's safe to assume the samples were drawn from a normal distribution.

The 90 percent margin of error in the original document is described as follows:

The degree of uncertainty for an estimate arising from sampling variability is represented through the use of a margin of error. The value shown here is the 90 percent margin of error. The margin of error can be interpreted roughly as providing a 90 percent probability that the interval defined by the estimate minus the margin of error and the estimate plus the margin of error (the lower and upper confidence bounds) contains the true value.

Edit: added the description of the margin of error to clarify the question.

Best Answer

Assume a sample size of 200, with mean (mu) = 20, and standard deviation (sigma) = 10.

import numpy as np

mu, sigma = 20, 10 # mean and standard deviation
s = np.random.normal(mu, sigma, 200)

np.quantile(s, 0.25)
np.quantile(s, 0.75)

I'm using Python for this example, but you can see that we are:

1) generating an array of 200 normally distributed random numbers

2) Obtaining the 25th and 75th quantile.

>>> np.quantile(s, 0.25)
11.700325588242732
>>> np.quantile(s, 0.75)
26.11671871467393

Now, when you say "90% margin of error", I am assuming you mean a 90% "confidence interval". In this case, your margin of error is 10%.

Using the scipy library (also from Python), we can obtain a 90% confidence interval as follows:

from scipy import stats
stats.norm.interval(0.90, loc=mu, scale=s)

More detail can be found on the above here.

You can now see that we generate an array where the values would fall within the 90% confidence interval:

>>> stats.norm.interval(0.90, loc=mu, scale=s)
(array([-20.1017426 , -50.41395259, -15.74140484, -34.9162548 ,
       -14.55505407, -26.20186343,  -8.38349335, -28.15329328,
............
         0.3405667 ,  14.1913693 , -44.18605464, -18.30478346]), 
array([60.1017426 , 90.41395259, 55.74140484, 74.9162548 , 54.55505407,
       66.20186343, 48.38349335, 68.15329328, 42.42820445, 70.17147704,
............
       55.23983044, 41.10373296, 51.30638793, 57.20990033, 47.99641712]))

The above is obviously dependent on which software you are using and what dataset you are working with, but hopefully you might find these guidelines useful.

Related Solutions

Sampling – Determining Sample Size When Given Confidence Interval and Margin of Error in a Finite Population

The formulas concerning the calculation of the sample size to estimate a proportion $p$ in for finite populations are provided on this website. It also contains the derivations.

In short, the sample size necessary for estimating a population proportion $p$ of a finite population with $(1-\alpha)100\%$ confidence and a margin of error no larger than $\epsilon$ is:

$$ n = \frac{m}{1+\frac{m-1}{N}} $$ where $$ m=\frac{z_{1-\alpha/2}^{2}\hat{p}(1-\hat{p})}{\epsilon^{2}}. $$ $N$ denotes the population size, $z_{1-\alpha/2}$ the $(1-\alpha/2)$-quantile of the standard normal distribution and $\hat{p}$ the estimated proportion.

For $N=580, \alpha=0.05, \epsilon=0.1, \hat{p}=0.5$ (i.e. 95% confidence), we get $n\approx 83$. If we take $\alpha = 0.1$ which corresponds to 90% confidence, we get $n\approx 61$.

The more precise we want our estimate of the popultion proportion to be, the higher our sample size needs to be.

This means that the lower $\alpha$, the higher the necessary sample size will be. The following graph illustrates this (for $N=580, \epsilon=0.1, \hat{p}=0.5$): Sample size alpha

The necessary sample size will also increase with decreasing margin of error $\epsilon$ (note the reversed $x$-axis; graph for $N=580, \alpha=0.05, \hat{p}=0.5$):

Sample size margin of error

Sample Size Calculation – Determining Sample Size for Given Confidence Interval and Margin of Error

In order to find the required sample size $n,$ you need a confidence level (such as $.95 = 95\%)$ and a margin of error (such as $\pm .03 = \pm 3\%).$

So that an explicit answer to your question doesn't get lost in a longer explanation of confidence intervals: No. There's no restriction that the confidence level and the margin of error must add to $1.$

The calculator in the link also asks for a population size, but that is not important unless you're thinking you might sample more than 10% of the population. So if this is for a nationwide poll in a large country with millions of eligible subjects, you can ignore that part. (If you're using the calculator in the link, you'd enter something like $10\,000\,000).$

The margin of error for a 95% confidence interval from a poll is $\pm 1.96\sqrt{\frac{p(1-p)}{n}},$ where $n$ is the sample size and $p$ is the true population proportion with the relevant attribute (such as favoring Proposition A on in an upcoming election).

The margin of error is the proportion (percentage in your link) that determines the width of your confidence interval. Maybe you'd like to say that the true proportion is $0.55 \pm 0.03$ or $55\% \pm 3\%.$ Then $E = .03 = 3\%.$

Not knowing $p,$ you could either guess what $p$ might be, or take the worst case, which is $p = 1/2$ (giving the largest possible margin of error). Then for a 95% confidence interval (CI), you'd have a CI of the form $\hat p \pm E.$ So $E=1.96\sqrt{\frac{p(1-p)}{n}}.$ If you're taking $p = 1/2,$ then you have $E = 1.96\sqrt{.25/n} \approx 1/\sqrt{n}.$ So, if $E = 3\%,$ then $n \approx 1/(.03^2) = 1111$ subjects.

Note: Here's why I say that $p = 1/2$ is the 'worst case', leading to the largest margin of error. The factor $Q = p(1 - p)$ in the margin of error reaches its maximum when $p = 1/2.$ So the margin of error $E$ is maximized when $p = 1/2$ and for a fixed value of $E$ that leads to the largest required $n.$

plot(p, Q, type="l", lwd=2)
 abline(v = 1/2, col="green2")

Best Answer

Related Solutions

Sampling – Determining Sample Size When Given Confidence Interval and Margin of Error in a Finite Population

Sample Size Calculation – Determining Sample Size for Given Confidence Interval and Margin of Error

Related Question