统计学之估计与置信区间(英文版).pptx
本资料来源本资料来源1Estimation and Confidence Intervals(估计与置信区间)(估计与置信区间)Chapter 9GOALS1.Define a point estimate(点估计).2.Define level of confidence(置信水平).3.Construct a confidence interval for the population mean when the population standard deviation is known.3GOALS4.Construct a confidence interval for a population mean when the population standard deviation is unknown.5.Construct a confidence interval for a population proportion(总体比例).6.Determine the sample size for attribute and variable sampling.4Point Estimates and Confidence Intervals for a MeanlWe will consider two cases,where(1)The population standard deviation()is known,(2)The population standard deviation is unknown.In this case we substitute the sample standard deviation(s)for the population standard deviation().5Population Standard Deviation()KnownlA point estimate is a single statistic used to estimate a population parameter.Suppose Best Buy,Inc.,wants to estimate the mean age of buyers of HD plasma televisions.It selects a random sample of 40 recent purchasers,determines the age of each purchaser,and computes the mean age of the buyers in the sample.lThe mean of this sample is a point estimate of the mean of the population.6Population Standard Deviation()KnownlA point estimate is the statistic(single value),computed from sample information,which is used to estimate the population parameter.The sample mean is a point estimate of the population mean,;p,a sample proportion,is a point estimate of,the population proportion;and s,the sample standard deviation,is a point estimate of the population standard deviation.7Population Standard Deviation()KnownlA confidence interval estimate is a range of values constructed from sample data so that the population parameter is likely to occur within that range at a specified probability.The specified probability is called the level of confidence(置信水平).8A confidence interval estimatelFor reasonably large samples,the results of the central limit theorem allow us to state the following:1.Ninety-five percent of the sample means selected from a population will be within1.96 standard deviations of the population mean.2.Ninety-nine percent of the sample means will lie within 2.58 standard deviations of the population mean.9Factors Affecting Confidence Interval EstimatesThe factors that determine the width of a confidence interval are:1.The sample size,n.2.The variability in the population,usually estimated by s.3.The desired level of confidence.10Finding z-value for 95%Confidence IntervalThe area betweenZ=-1.96 and z=+1.96is 0.9511Interval Estimates-InterpretationFor a 95%confidence interval about 95%of the similarly constructed intervals will contain the parameter being estimated.Also 95%of the sample means for a specified sample size will lie within 1.96 standard deviations of the hypothesized population12Point Estimates and Confidence Intervals for a Mean Known13Point Estimates and Confidence Intervals for a Mean Knownla 95 percent confidence interval:la 99 percent confidence interval:14Point Estimates and Confidence Intervals for a Mean Knownla 90 percent confidence interval15Example:Confidence Interval for a Mean KnownThe American Management Association wishes to have information on the mean income of middle managers in the retail industry.A random sample of 256 managers reveals a sample mean of$45,420.The standard deviation of this population is$2,050.The association would like answers to the following questions:1.What is the population mean?What is a reasonable value to use as an estimate of the population mean?2.What is a reasonable range of values for the population mean?3.What do these results mean?16Example:Confidence Interval for a Mean KnownThe American Management Association wishes to have information on the mean income of middle managers in the retail industry.A random sample of 256 managers reveals a sample mean of$45,420.The standard deviation of this population is$2,050.The association would like answers to the following questions:1.What is the population mean?What is a reasonable value to use as an estimate of the population mean?In this case,we do not know.We do know the sample mean is$45,420.Hence,our best estimate of the unknown population value is the corresponding sample statistic.The sample mean of$45,420 is a point estimate of the unknown population mean.17Example:Confidence Interval for a Mean KnownThe American Management Association wishes to have information on the mean income of middle managers in the retail industry.A random sample of 256 managers reveals a sample mean of$45,420.The standard deviation of this population is$2,050.The association would like answers to the following questions:2.What is a reasonable range of values for the population mean?Suppose the association decides to use the 95 percent level of confidence:The confidence limits are$45,169 and$45,67118Example:Confidence Interval for a Mean KnownThe American Management Association wishes to have information on the mean income of middle managers in the retail industry.A random sample of 256 managers reveals a sample mean of$45,420.The standard deviation of this population is$2,050.The association would like answers to the following questions:3.What do these results mean,i.e.what is the interpretation of the confidence limits$45,169 and$45,671?If we select many samples of 256 managers,and for each sample we compute the mean and then construct a 95 percent confidence interval,we could expect about 95 percent of these confidence intervals to contain the population mean.Conversely,about 5 percent of the intervals would not contain the population mean annual income,19Example:Confidence Interval for a Mean Unknown20Example:Confidence Interval for a Mean Unknown21Characteristics of the t-distribution1.It is,like the z distribution,a continuous distribution.2.It is,like the z distribution,bell-shaped and symmetrical.3.There is not one t distribution,but rather a family of t distributions.All t distributions have a mean of 0,but their standard deviations differ according to the sample size,n.4.The t distribution is more spread out and flatter at the center than the standard normal distribution As the sample size increases,however,the t distribution approaches the standard normal distribution,22Comparing the z and t Distributions when n is small,95%Confidence Level23Confidence Interval Estimates for the MeanUse Z-distributionIf the population standard deviation is known or the sample is at least than 30.Use t-distributionIf the population standard deviation is unknown and the sample is less than 30.24When to Use the z or t Distribution for Confidence Interval Computation25Confidence Interval for the Mean Example using the t-distributionA tire manufacturer wishes to investigate the tread life of its tires.A sample of 10 tires driven 50,000 miles revealed a sample mean of 0.32 inch of tread remaining with a standard deviation of 0.09 inch.Construct a 95 percent confidence interval for the population mean.Would it be reasonable for the manufacturer to conclude that after 50,000 miles the population mean amount of tread remaining is 0.30 inches?26Students t-distribution Table27The manager of the Inlet Square Mall,near Ft.Myers,Florida,wants to estimate the mean amount spent per shopping visit by customers.A sample of 20 customers reveals the following amounts spent.Confidence Interval Estimates for the Mean Using Minitab28Confidence Interval Estimates for the Mean By Formula29Confidence Interval Estimates for the Mean Using Minitab30Confidence Interval Estimates for the Mean Using Excel31Using the Normal Distribution to Approximate the Binomial DistributionTo develop a confidence interval for a proportion,we need to meet the following assumptions.1.The binomial conditions,discussed in Chapter 6,have been met.These conditions are:a.The sample data is the result of counts.b.There are only two possible outcomes.c.The probability of a success remains the same from one trial to the next.d.The trials are independent.2.The values n and n(1-)5.This condition allows us to invoke the central limit theorem and employ the standard normal distribution,to compute a confidence interval.32Confidence Interval for a Population ProportionlPROPORTION The fraction,ratio,or percent indicating the part of the sample or the population having a particular trait of interest.lSAMPLE PROPORTION 33Confidence Interval for a Population ProportionlTo develop a confidence interval for a proportion,we need to meet the following assumptions.l1.The binomial conditions,discussed in Chapter 6,have been met.These conditions are:la.The sample data is the result of counts.We count the number of successes in a fixed number of trials.34Confidence Interval for a Population Proportionlb.There are only two possible outcomes.We usually label one of the outcomes a“success”and the other a“failure.”lc.The probability of a success remains the same from one trial to the next.ld.The trials are independent.This means the outcome on one trial does not affect the outcome on another.35Confidence Interval for a Population Proportionl2.The values nand n(1)should both be greater than or equal to 5.This condition allows us to invoke the central limit theorem and employ the standard normal distribution,that is,z,to complete a confidence interval.36Confidence Interval for a Population Proportion The confidence interval for a population proportion is estimated by:37Confidence Interval for a Population Proportion-ExampleThe union representing the Bottle Blowers of America(BBA)is considering a proposal to merge with the Teamsters Union.According to BBA union bylaws,at least three-fourths of the union membership must approve any merger.A random sample of 2,000 current BBA members reveals 1,600 plan to vote for the merger proposal.What is the estimate of the population proportion?Develop a 95 percent confidence interval for the population proportion.Basing your decision on this sample information,can you conclude that the necessary proportion of BBA members favor the merger?Why?38Self-Review 93lA market survey was conducted to estimate the proportion of homemakers who would recognize the brand name of a cleanser based on the shape and the color of the container.Of the 1,400 homemakers sampled,420 were able to identify the brand by name.39Self-Review 93l(a)Estimate the value of the population proportion.l(b)Develop a 99 percent confidence interval for the population proportion.l(c)Interpret your findings.40Finite-Population Correction FactorlThe populations we have sampled so far have been very large or infinite.lWhat if the sampled population is not very large?lWe need to make some adjustments in the way we compute the standard error of the sample means and the standard error of the sample proportions.41Finite-Population Correction FactorlA population that has a fixed upper bound is finite.l A finite population can be rather small;It can also be very large.lFor a finite population,where the total number of objects or individuals is N and the number of objects or individuals in the sample is n,we need to adjust the standard errors in the confidence interval formulas.42Finite-Population Correction FactorlThis adjustment is called the finite-population correction factor.It is often shortened to FPC and is:43Finite-Population Correction FactorlA population that has a fixed upper bound is said to be finite.lFor a finite population,where the total number of objects is N and the size of the sample is n,the following adjustment is made to the standard errors of the sample means and the proportion:lHowever,if n/N 30)Use the formula below to compute the confidence interval:47CI For Mean with FPC-Example48Self-Review 94lThe same study of church contributions in Scandia revealed that 15 of the 40 families sampled attend church regularly.Construct the 95 percent confidence interval for the proportion of families attending church regularly.Should the finite-population correction factor be used?Why or why not?49Self-Review 9450Selecting a Sample Size(n)There are 3 factors that determine the size of a sample,none of which has any direct relationship to the size of the population.They are:lThe degree of confidence selected.lThe maximum allowable error.lThe variation in the population.51Selecting a Sample Size(n)lThe first factor is the level of confidence.Those conducting the study select the level of confidence.The 95 percent and the 99 percent levels of confidence are the most common,but any value between 0 and 100 percent is possible.The 95 percent level of confidence corresponds to a z value of 1.96,and a 99 percent level of confidence corresponds to a z value of 2.58.The higher the level of confidence selected,the larger the size of the corresponding sample.52Selecting a Sample Size(n)lThe second factor is the allowable error.The maximum allowable error,designated as E,is the amount that is added and subtracted to the sample mean(or sample proportion)to determine the endpoints of the confidence interval.It is the amount of error those conducting the study are willing to tolerate.It is also one-half the width of the corresponding confidence interval.A small allowable error will require a larger sample.A large allowable error will permit a smaller sample.53Selecting a Sample Size(n)lThe third factor in determining the size of a sample is the population standard deviation.If the population is widely dispersed,a large sample is required.On the other hand,if the population is concentrated(homogeneous),the required sample size will be smaller.However,it may be necessary to use an estimate for the population standard deviation.54Selecting a Sample Size(n)lthree suggestions for finding that estimate:1.Use a comparable study.2.Use a range-based approach.3.Conduct a pilot study.lWe can express the interaction among these three factors and the sample size in the following formula.55Sample Size Determination for a VariablelTo find the sample size for a variable:56A student in public administration wants to determine the mean amount members of city councils in large cities earn per month as remuneration for being a council member.The error in estimating the mean is to be less than$100 with a 95 percent level of confidence.The student found a report by the Department of Labor that estimated the standard deviation to be$1,000.What is the required sample size?Given in the problem:lE,the maximum allowable error,is$100lThe value of z for a 95 percent level of confidence is 1.96,lThe estimate of the standard deviation is$1,000.Sample Size Determination for a Variable-Example 157A student in public administration wants to determine the mean amount members of city councils in large cities earn per month as remuneration for being a council member.The error in estimating the mean is to be less than$100 with a 99 percent level of confidence.The student found a report by the Department of Labor that estimated the standard deviation to be$1,000.What is