The Chi-Square Distribution

Test of a Single Variance

OpenStaxCollege

[latexpage]

A test of a single variance assumes that the underlying distribution is normal. The null and alternative hypotheses are stated in terms of the population variance (or population standard deviation). The test statistic is:

\(\frac{\left(n-1\right){s}^{2}}{{\sigma }^{2}}\)

where:

  • n = the total number of data
  • s2 = sample variance
  • σ2 = population variance

You may think of s as the random variable in this test. The number of degrees of freedom is df = n – 1. A test of a single variance may be right-tailed, left-tailed, or two-tailed. [link] will show you how to set up the null and alternative hypotheses. The null and alternative hypotheses contain statements about the population variance.

Math instructors are not only interested in how their students do on exams, on average, but how the exam scores vary. To many instructors, the variance (or standard deviation) may be more important than the average.

Suppose a math instructor believes that the standard deviation for his final exam is five points. One of his
best students thinks otherwise. The student claims that the standard deviation is more than five points. If the student were to conduct a hypothesis test, what would the null and alternative hypotheses be?

Even though we are given the population standard deviation, we can set up the test using the population variance as follows.

  • H0: σ2 = 52
  • Ha: σ2 > 52
Try It

A SCUBA instructor wants to record the collective depths each of his students dives during their checkout. He is interested in how the depths vary, even though everyone should have been at the same depth. He believes the standard deviation is three feet. His assistant thinks the standard deviation is less than three feet. If the instructor were to conduct a test, what would the null and alternative hypotheses be?

H0: σ2 = 32

Ha: σ2 < 32

With individual lines at its various windows, a post office finds that the standard deviation for normally distributed waiting times for customers on Friday afternoon is 7.2 minutes. The post office experiments with a single, main waiting line and finds that for a random sample of 25 customers, the waiting times for customers have a standard deviation of 3.5 minutes.

With a significance level of 5%, test the claim that a single line causes lower variation
among waiting times (shorter waiting times) for customers
.

Since the claim is that a single line causes less variation, this is a test of a single variance.
The parameter is the population variance, σ2, or the population standard deviation, σ.

Random Variable: The sample standard deviation, s, is the random variable.
Let s = standard deviation for the waiting times.

  • H0: σ2 = 7.22
  • Ha: σ2 < 7.22

The word “less” tells you this is a left-tailed test.

Distribution for the test:\({\chi }_{24}^{2}\), where:

n = the number of customers sampled
df = n – 1 = 25 – 1 = 24

Calculate the test statistic:

\({\chi }^{2}=\frac{\left(n\text{ }-\text{ }1\right){s}^{2}}{{\sigma }^{2}}=\frac{\left(25\text{ }-\text{ }1\right){\left(3.5\right)}^{2}}{{7.2}^{2}}=5.67\)

where n = 25, s = 3.5, and σ = 7.2.

Graph:

This is a nonsymmetrical chi-square curve with values of 0 and 5.67 labeled on the horizontal axis. The point 5.67 lies to the left of the peak of the curve. A vertical upward line extends from 5.67 to the curve and the region to the left of this line is shaded. The shaded area is equal to the p-value.

Probability statement:p-value = P ( χ2 < 5.67) = 0.000042

Compare α and the p-value:

α = 0.05p-value = 0.000042α > p-value

Make a decision: Since α > p-value, reject H0. This means that you reject σ2 = 7.22. In other words, you do not think the variation in waiting times is 7.2 minutes; you think the variation in waiting times is less.

Conclusion: At a 5% level of significance, from the data, there is sufficient evidence to
conclude that a single line causes a lower variation among the waiting times or with a single
line, the customer waiting times vary less than 7.2 minutes.

In 2nd DISTR, use 7:χ2cdf. The syntax is
(lower, upper, df) for the parameter list.
For [link], χ2cdf(-1E99,5.67,24). The p-value = 0.000042.

Try It

The FCC conducts broadband speed tests to measure how much data per second passes between a consumer’s computer and the internet. As of August of 2012, the standard deviation of Internet speeds across Internet Service Providers (ISPs) was 12.2 percent. Suppose a sample of 15 ISPs is taken, and the standard deviation is 13.2. An analyst claims that the standard deviation of speeds is more than what was reported. State the null and alternative hypotheses, compute the degrees of freedom, the test statistic, sketch the graph of the p-value, and draw a conclusion. Test at the 1% significance level.

H0: σ2 = 12.22

Ha: σ2 > 12.22

df = 14

chi2 test statistic = 16.39

The p-value is 0.2902, so we decline to reject the null hypothesis. There is not enough evidence to suggest that the variance is greater than 12.22.

In 2nd DISTR, use7:χ2cdf. The syntax is (lower, upper, df) for the parameter list. χ2cdf(16.39,10^99,14). The p-value = 0.2902.

References

“AppleInsider Price Guides.” Apple Insider, 2013. Available online at http://appleinsider.com/mac_price_guide (accessed May 14, 2013).

Data from the World Bank, June 5, 2012.

Chapter Review

To test variability, use the chi-square test of a single variance. The test may be left-, right-, or two-tailed, and its hypotheses are always expressed in terms of the variance (or standard deviation).

Formula Review

\({\chi }^{2}=\)\(\frac{\left(n-1\right)\cdot {s}^{2}}{{\sigma }^{2}}\) Test of a single variance statistic where:

n: sample size

s: sample standard deviation

σ: population standard deviation

df = n – 1 Degrees of freedom

Test of a Single Variance
  • Use the test to determine variation.
  • The degrees of freedom is the number of samples – 1.
  • The test statistic is \(\frac{\left(n–1\right)\cdot {s}^{2}}{{\sigma }^{2}}\), where n = the total number of data, s2 = sample variance, and σ2 = population variance.
  • The test may be left-, right-, or two-tailed.

Use the following information to answer the next three exercises: An archer’s standard deviation for his hits is six (data is measured in distance from the center of the target). An observer claims the standard deviation is less.

What type of test should be used?

a test of a single variance

State the null and alternative hypotheses.

Is this a right-tailed, left-tailed, or two-tailed test?

a left-tailed test

Use the following information to answer the next three exercises: The standard deviation of heights for students in a school is 0.81. A random sample of 50 students is taken, and the standard deviation of heights of the sample is 0.96. A researcher in charge of the study believes the standard deviation of heights for the school is greater than 0.81.

What type of test should be used?

State the null and alternative hypotheses.

H0: σ2 = 0.812;

Ha: σ2 > 0.812

df = ________

Use the following information to answer the next four exercises: The average waiting time in a doctor’s office varies. The standard deviation of waiting times in a doctor’s office is 3.4 minutes. A random sample of 30 patients in the doctor’s office has a standard deviation of waiting times of 4.1 minutes. One doctor believes the variance of waiting times is greater than originally thought.

What type of test should be used?

a test of a single variance

What is the test statistic?

What is the p-value?

0.0542

What can you conclude at the 5% significance level?

Homework

Use the following information to answer the next twelve exercises: Suppose an airline claims that its flights are consistently on time with an average delay of at most 15 minutes. It claims that the average delay is so consistent that the variance is no more than 150 minutes. Doubting the consistency part of the claim, a disgruntled traveler calculates the delays for his next 25 flights. The average delay for those 25 flights is 22 minutes with a standard deviation of 15 minutes.

Is the traveler disputing the claim about the average or about the variance?

A sample standard deviation of 15 minutes is the same as a sample variance of __________ minutes.

225

Is this a right-tailed, left-tailed, or two-tailed test?

H0: __________

H0: σ2 ≤ 150

df = ________

chi-square test statistic = ________

36

p-value = ________

Graph the situation. Label and scale the horizontal axis. Mark the mean and test statistic. Shade the p-value.

Check student’s solution.

Let α = 0.05

Decision: ________

Conclusion (write out in a complete sentence.): ________

How did you know to test the variance instead of the mean?

The claim is that the variance is no more than 150 minutes.

If an additional test were done on the claim of the average delay, which distribution would you use?

If an additional test were done on the claim of the average delay, but 45 flights were surveyed, which distribution would you use?

a Student’s t– or normal distribution

For each word problem, use a solution sheet to solve the hypothesis test problem. Go to [link] for the chi-square solution sheet. Round expected frequency to two decimal places.

A plant manager is concerned her equipment may need recalibrating. It seems that the actual weight of the 15 oz. cereal boxes it fills has been fluctuating. The standard deviation should be at most 0.5 oz. In order to determine if the machine needs to be recalibrated, 84 randomly selected boxes of cereal from the next day’s production were weighed. The standard deviation of the 84 boxes was 0.54. Does the machine need to be recalibrated?

Consumers may be interested in whether the cost of a particular calculator varies from store to store. Based on surveying 43 stores, which yielded a sample mean of 💲84 and a sample standard deviation of 💲12, test the claim that the standard deviation is greater than 💲15.

  1. H0: σ = 15
  2. Ha: σ > 15
  3. df = 42
  4. chi-square with df = 42
  5. test statistic = 26.88
  6. p-value = 0.9663
  7. Check student’s solution.
    1. Alpha = 0.05
    2. Decision: Do not reject null hypothesis.
    3. Reason for decision: p-value > alpha
    4. Conclusion: There is insufficient evidence to conclude that the standard deviation is greater than 15.

Isabella, an accomplished Bay to Breakers runner, claims that the standard deviation for her time to run the 7.5 mile race is at most three minutes. To test her claim, Rupinder looks up five of her race times. They are 55 minutes, 61 minutes, 58 minutes, 63 minutes, and 57 minutes.

Airline companies are interested in the consistency of the number of babies on each flight, so that they have adequate safety equipment. They are also interested in the variation of the number of babies. Suppose that an airline executive believes the average number of babies on flights is six with a variance of nine at most. The airline conducts a survey. The results of the 18 flights surveyed give a sample average of 6.4 with a sample standard deviation of 3.9. Conduct a hypothesis test of the airline executive’s belief.

  1. H0: σ ≤ 3
  2. Ha: σ > 3
  3. df = 17
  4. chi-square distribution with df = 17
  5. test statistic = 28.73
  6. p-value = 0.0371
  7. Check student’s solution.
    1. Alpha: 0.05
    2. Decision: Reject the null hypothesis.
    3. Reason for decision: p-value < alpha
    4. Conclusion: There is sufficient evidence to conclude that the standard deviation is greater than three.

The number of births per woman in China is 1.6 down from 5.91 in 1966. This fertility rate has been attributed to the law passed in 1979 restricting births to one per woman. Suppose that a group of students studied whether or not the standard deviation of births per woman was greater than 0.75. They asked 50 women across China the number of births they had had. The results are shown in [link]. Does the students’ survey indicate that the standard deviation is greater than 0.75?

# of births Frequency
0 5
1 30
2 10
3 5

According to an avid aquarist, the average number of fish in a 20-gallon tank is 10, with a standard deviation of two. His friend, also an aquarist, does not believe that the standard deviation is two. She counts the number of fish in 15 other 20-gallon tanks. Based on the results that follow, do you think that the standard deviation is different from two? Data: 11; 10; 9; 10; 10; 11; 11; 10; 12; 9; 7; 9; 11; 10; 11

  1. H0: σ = 2
  2. Ha: σ ≠ 2
  3. df = 14
  4. chi-square distiribution with df = 14
  5. chi-square test statistic = 5.2094
  6. p-value = 0.0346
  7. Check student’s solution.
    1. Alpha = 0.05
    2. Decision: Reject the null hypothesis
    3. Reason for decision: p-value < alpha
    4. Conclusion: There is sufficient evidence to conclude that the standard deviation is different than 2.

The manager of “Frenchies” is concerned that patrons are not consistently receiving the same amount of French fries with each order. The chef claims that the standard deviation for a ten-ounce order of fries is at most 1.5 oz., but the manager thinks that it may be higher. He randomly weighs 49 orders of fries, which yields a mean of 11 oz. and a standard deviation of two oz.

You want to buy a specific computer. A sales representative of the manufacturer claims that retail stores sell this computer at an average price of 💲1,249 with a very narrow standard deviation of 💲25. You find a website that has a price comparison for the same computer at a series of stores as follows: 💲1,299; 💲1,229.99; 💲1,193.08; 💲1,279; 💲1,224.95; 💲1,229.99; 💲1,269.95; 💲1,249. Can you argue that pricing has a larger standard deviation than claimed by the manufacturer? Use the 5% significance level. As a potential buyer, what would be the practical conclusion from your analysis?

The sample standard deviation is 💲34.29.

H0 : σ2 = 252

Ha : σ2 > 252

df = n – 1 = 7.

test statistic: \({x}^{2}= {x}_{7}^{2}= \frac{\left(n–1\right){s}^{2}}{{25}^{2}}= \frac{\left(8–1\right){\left(34.29\right)}^{2}}{{25}^{2}}=13.169\);

p-value: \(P\left({x}_{7}^{2}>13.169\right)=1–P\left({x}_{7}^{2} \le 13.169\right)=0.0681\)

Alpha: 0.05

Decision: Do not reject the null hypothesis.

Reason for decision: p-value > alpha

Conclusion: At the 5% level, there is insufficient evidence to conclude that the variance is more than 625.

A company packages apples by weight. One of the weight grades is Class A apples. Class A apples have a mean weight of 150 g, and there is a maximum allowed weight tolerance of 5% above or below the mean for apples in the same consumer package. A batch of apples is selected to be included in a Class A apple package. Given the following apple weights of the batch, does the fruit comply with the Class A grade weight tolerance requirements. Conduct an appropriate hypothesis test.

(a) at the 5% significance level

(b) at the 1% significance level

Weights in selected apple batch (in grams): 158; 167; 149; 169; 164; 139; 154; 150; 157; 171; 152; 161; 141; 166; 172;

License

Icon for the Creative Commons Attribution 4.0 International License

Test of a Single Variance Copyright © 2013 by OpenStaxCollege is licensed under a Creative Commons Attribution 4.0 International License, except where otherwise noted.