sampling distribution of difference between two proportions worksheet

0 These values for z* denote the portion of the standard normal distribution where exactly C percent of the distribution is between -z* and z*. Center: Mean of the differences in sample proportions is, Spread: The large samples will produce a standard error that is very small. There is no difference between the sample and the population. Normal Probability Calculator for Sampling Distributions statistical calculator - Population Proportion - Sample Size. { "9.01:_Why_It_Matters-_Inference_for_Two_Proportions" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.02:_Assignment-_A_Statistical_Investigation_using_Software" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.03:_Introduction_to_Distribution_of_Differences_in_Sample_Proportions" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.04:_Distribution_of_Differences_in_Sample_Proportions_(1_of_5)" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.05:_Distribution_of_Differences_in_Sample_Proportions_(2_of_5)" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.06:_Distribution_of_Differences_in_Sample_Proportions_(3_of_5)" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.07:_Distribution_of_Differences_in_Sample_Proportions_(4_of_5)" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.08:_Distribution_of_Differences_in_Sample_Proportions_(5_of_5)" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.09:_Introduction_to_Estimate_the_Difference_Between_Population_Proportions" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.10:_Estimate_the_Difference_between_Population_Proportions_(1_of_3)" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.11:_Estimate_the_Difference_between_Population_Proportions_(2_of_3)" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.12:_Estimate_the_Difference_between_Population_Proportions_(3_of_3)" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.13:_Introduction_to_Hypothesis_Test_for_Difference_in_Two_Population_Proportions" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.14:_Hypothesis_Test_for_Difference_in_Two_Population_Proportions_(1_of_6)" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.15:_Hypothesis_Test_for_Difference_in_Two_Population_Proportions_(2_of_6)" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.16:_Hypothesis_Test_for_Difference_in_Two_Population_Proportions_(3_of_6)" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.17:_Hypothesis_Test_for_Difference_in_Two_Population_Proportions_(4_of_6)" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.18:_Hypothesis_Test_for_Difference_in_Two_Population_Proportions_(5_of_6)" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.19:_Hypothesis_Test_for_Difference_in_Two_Population_Proportions_(6_of_6)" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "9.20:_Putting_It_Together-_Inference_for_Two_Proportions" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()" }, { "00:_Front_Matter" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "01:_Types_of_Statistical_Studies_and_Producing_Data" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "02:_Summarizing_Data_Graphically_and_Numerically" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "03:_Examining_Relationships-_Quantitative_Data" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "04:_Nonlinear_Models" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "05:_Relationships_in_Categorical_Data_with_Intro_to_Probability" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "06:_Probability_and_Probability_Distributions" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "07:_Linking_Probability_to_Statistical_Inference" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "08:_Inference_for_One_Proportion" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "09:_Inference_for_Two_Proportions" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "10:_Inference_for_Means" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "11:_Chi-Square_Tests" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "12:_Appendix" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()", "zz:_Back_Matter" : "property get [Map MindTouch.Deki.Logic.ExtensionProcessorQueryProvider+<>c__DisplayClass228_0.b__1]()" }, 9.4: Distribution of Differences in Sample Proportions (1 of 5), https://stats.libretexts.org/@app/auth/3/login?returnto=https%3A%2F%2Fstats.libretexts.org%2FCourses%2FLumen_Learning%2FBook%253A_Concepts_in_Statistics_(Lumen)%2F09%253A_Inference_for_Two_Proportions%2F9.04%253A_Distribution_of_Differences_in_Sample_Proportions_(1_of_5), $ \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}}}$ $ \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{#1}}} $$\newcommand{\id}{\mathrm{id}}$ $ \newcommand{\Span}{\mathrm{span}}$ $ \newcommand{\kernel}{\mathrm{null}\,}$ $ \newcommand{\range}{\mathrm{range}\,}$ $ \newcommand{\RealPart}{\mathrm{Re}}$ $ \newcommand{\ImaginaryPart}{\mathrm{Im}}$ $ \newcommand{\Argument}{\mathrm{Arg}}$ $ \newcommand{\norm}[1]{\| #1 \|}$ $ \newcommand{\inner}[2]{\langle #1, #2 \rangle}$ $ \newcommand{\Span}{\mathrm{span}}$ $\newcommand{\id}{\mathrm{id}}$ $ \newcommand{\Span}{\mathrm{span}}$ $ \newcommand{\kernel}{\mathrm{null}\,}$ $ \newcommand{\range}{\mathrm{range}\,}$ $ \newcommand{\RealPart}{\mathrm{Re}}$ $ \newcommand{\ImaginaryPart}{\mathrm{Im}}$ $ \newcommand{\Argument}{\mathrm{Arg}}$ $ \newcommand{\norm}[1]{\| #1 \|}$ $ \newcommand{\inner}[2]{\langle #1, #2 \rangle}$ $ \newcommand{\Span}{\mathrm{span}}$$\newcommand{\AA}{\unicode[.8,0]{x212B}}$. hb```f``@Y8DX$38O?H[@A/D!,,`m0?\q0~g u', % |4oMYixf45AZ2EjV9 The value z* is the appropriate value from the standard normal distribution for your desired confidence level. We write this with symbols as follows: pf pm = 0.140.08 =0.06 p f p m = 0.14 0.08 = 0.06. Research question example. . m1 and m2 are the population means. Suppose we want to see if this difference reflects insurance coverage for workers in our community. We also acknowledge previous National Science Foundation support under grant numbers 1246120, 1525057, and 1413739. The variances of the sampling distributions of sample proportion are. The difference between the female and male sample proportions is 0.06, as reported by Kilpatrick and colleagues. In Inference for One Proportion, we learned to estimate and test hypotheses regarding the value of a single population proportion. When testing a hypothesis made about two population proportions, the null hypothesis is p 1 = p 2. Data Distribution vs. Sampling Distribution: What You Need to Know 3.2.2 Using t-test for difference of the means between two samples. Distribution of Differences in Sample Proportions (1 of 5) That is, we assume that a high-quality prechool experience will produce a 25% increase in college enrollment. Predictor variable. The sampling distribution of averages or proportions from a large number of independent trials approximately follows the normal curve. We cannot make judgments about whether the female and male depression rates are 0.26 and 0.10 respectively. When we calculate the z -score, we get approximately 1.39. If we are estimating a parameter with a confidence interval, we want to state a level of confidence. common core mathematics: the statistics journey Here is an excerpt from the article: According to an article by Elizabeth Rosenthal, Drug Makers Push Leads to Cancer Vaccines Rise (New York Times, August 19, 2008), the FDA and CDC said that with millions of vaccinations, by chance alone some serious adverse effects and deaths will occur in the time period following vaccination, but have nothing to do with the vaccine. The article stated that the FDA and CDC monitor data to determine if more serious effects occur than would be expected from chance alone. Scientists and other healthcare professionals immediately produced evidence to refute this claim. where p 1 and p 2 are the sample proportions, n 1 and n 2 are the sample sizes, and where p is the total pooled proportion calculated as: PDF Chapter 22 - Comparing Two Proportions - Chandler Unified School District The simulation will randomly select a sample of 64 female teens from a population in which 26% are depressed and a sample of 100 male teens from a population in which 10% are depressed. For these people, feelings of depression can have a major impact on their lives. When we calculate the z-score, we get approximately 1.39. We use a normal model to estimate this probability. You select samples and calculate their proportions. It is useful to think of a particular point estimate as being drawn from a sampling distribution. We compare these distributions in the following table. The LibreTexts libraries arePowered by NICE CXone Expertand are supported by the Department of Education Open Textbook Pilot Project, the UC Davis Office of the Provost, the UC Davis Library, the California State University Affordable Learning Solutions Program, and Merlot. The parameter of the population, which we know for plant B is 6%, 0.06, and then that gets us a mean of the difference of 0.02 or 2% or 2% difference in defect rate would be the mean. We also acknowledge previous National Science Foundation support under grant numbers 1246120, 1525057, and 1413739. Skip ahead if you want to go straight to some examples. Its not about the values its about how they are related! <> If the sample proportions are different from those specified when running these procedures, the interval width may be narrower or wider than specified. The students can access the various study materials that are available online, which include previous years' question papers, worksheets and sample papers. In other words, there is more variability in the differences. <> Comparing Two Proportions - Sample Size - Select Statistical Consultants xVO0~S$vlGBH$46*);;NiC({/pg]rs;!#qQn0hs\8Gp|z;b8._IJi: e CA)6ciR&%p@yUNJS]7vsF(@It,SH@fBSz3J&s}GL9W}>6_32+u8!p*o80X%CS7_Le&3`F: The company plans on taking separate random samples of, The company wonders how likely it is that the difference between the two samples is greater than, Sampling distributions for differences in sample proportions. The sample sizes will be denoted by n1 and n2. Here we complete the table to compare the individual sampling distributions for sample proportions to the sampling distribution of differences in sample proportions. Under these two conditions, the sampling distribution of $\hat {p}_1 - \hat {p}_2$ may be well approximated using the . Here's a review of how we can think about the shape, center, and variability in the sampling distribution of the difference between two proportions p ^ 1 p ^ 2 \hat{p}_1 - \hat{p}_2 p ^ 1 p ^ 2 p, with, hat, on top, start subscript, 1, end subscript, minus, p, with, hat, on top, start subscript, 2, end subscript: Recall the AFL-CIO press release from a previous activity. Now we focus on the conditions for use of a normal model for the sampling distribution of differences in sample proportions. When I do this I get To estimate the difference between two population proportions with a confidence interval, you can use the Central Limit Theorem when the sample sizes are large . But does the National Survey of Adolescents suggest that our assumption about a 0.16 difference in the populations is wrong? Requirements: Two normally distributed but independent populations, is known. PDF Unit 25 Hypothesis Tests about Proportions That is, the comparison of the number in each group (for example, 25 to 34) If the answer is So simply use no. Identify a sample statistic. We use a normal model for inference because we want to make probability statements without running a simulation. Since we are trying to estimate the difference between population proportions, we choose the difference between sample proportions as the sample statistic. <>>> Sampling distribution for the difference in two proportions Approximately normal Mean is p1 -p2 = true difference in the population proportions Standard deviation of is 1 2 p p 2 2 2 1 1 1 1 2 1 1. Or, the difference between the sample and the population mean is not . Lets suppose a daycare center replicates the Abecedarian project with 70 infants in the treatment group and 100 in the control group. 2.Sample size and skew should not prevent the sampling distribution from being nearly normal. How to Compare Two Distributions in Practice | by Alex Kim | Towards Lets assume that 9 of the females are clinically depressed compared to 8 of the males. Compute a statistic/metric of the drawn sample in Step 1 and save it. We use a simulation of the standard normal curve to find the probability. In one region of the country, the mean length of stay in hospitals is 5.5 days with standard deviation 2.6 days. Then we selected random samples from that population. This is a test of two population proportions. 9.8: Distribution of Differences in Sample Proportions (5 of 5) Large Sample Test for a Proportion c. Large Sample Test for a Difference between two Proportions d. Test for a Mean e. Test for a Difference between two Means (paired and unpaired) f. Chi-Square test for Goodness of Fit, homogeneity of proportions, and independence (one- and two-way tables) g. Test for the Slope of a Least-Squares Regression Line Applications of Confidence Interval Confidence Interval for a Population Proportion Sample Size Calculation Hypothesis Testing, An Introduction WEEK 3 Module . Regardless of shape, the mean of the distribution of sample differences is the difference between the population proportions, p1 p2. . Legal. 7 0 obj That is, lets assume that the proportion of serious health problems in both groups is 0.00003. 2 0 obj Sampling Distribution: Definition, Factors and Types So this is equivalent to the probability that the difference of the sample proportions, so the sample proportion from A minus the sample proportion from B is going to be less than zero. We will introduce the various building blocks for the confidence interval such as the t-distribution, the t-statistic, the z-statistic and their various excel formulas. The LibreTexts libraries arePowered by NICE CXone Expertand are supported by the Department of Education Open Textbook Pilot Project, the UC Davis Office of the Provost, the UC Davis Library, the California State University Affordable Learning Solutions Program, and Merlot. Short Answer. <> This is equivalent to about 4 more cases of serious health problems in 100,000. We discuss conditions for use of a normal model later. When we select independent random samples from the two populations, the sampling distribution of the difference between two sample proportions has the following shape, center, and spread. Notice the relationship between the means: Notice the relationship between standard errors: In this module, we sample from two populations of categorical data, and compute sample proportions from each. hTOO |9j. A quality control manager takes separate random samples of 150 150 cars from each plant. 5 0 obj If we are conducting a hypothesis test, we need a P-value. Graphically, we can compare these proportion using side-by-side ribbon charts: To compare these proportions, we could describe how many times larger one proportion is than the other.