0% found this document useful (0 votes)

10 views

Chi-Square Test

The document provides an overview of the Chi-square test, a non-parametric statistical method used to assess the goodness of fit between observed and expected frequencies, as well as to test the significance of associations between attributes. It outlines the characteristics, assumptions, and degrees of freedom relevant to the test, along with practical applications and examples demonstrating its use in hypothesis testing. The document concludes with specific case studies illustrating how to apply the Chi-square test in various scenarios.

Uploaded by

MD.Kawsar Alam Tasin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Chi-Square Test

Uploaded by

MD.Kawsar Alam Tasin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 5

Chi-Square Test

Dr. M. Shamim Uddin Khan

Introduction: Chi-square test is one of the most commonly used tests in statistics. Generally it is used
for testing hypothesis concerning the distribution of a random variable rather than the parameter of the
distribution, for which it referred to as non-parametric test. Chi-square test is applied in statistics to test
the goodness of fit to verify the distribution of observed data with assumed theoretical distribution.
Therefore, it is a measure to study the divergence of actual and expected frequencies. Chi-square tests
enable us to test whether more than two population proportions can be considered equal. In order that
Chi-square test may be applicable, both the frequencies must be grouped in the same way and the
theoretical distribution must be adjusted to give the same total frequency which is equal to that observed
frequencies. The test is, in fact, a technique through the use of which it is possible for all researchers to
(i) test the goodness of fit (ii) test the significance of association between two attributes, and (iii) test the
homogeneity or the significance of population variance. If the calculated value of is greater than the
tabled value of at certain level of significance, we reject the hypothesis. If there is no difference
between the actual and expected frequencies, is zero. If the calculated value of
is less than the tabled value at certain level of significance, it is said to be non-significant. Thus, the
test describes the discrepancy between theory and observation.
Characteristics of Chi-Square Test:
1. Test is based on events or frequencies, whereas in theoretical distribution, the test is based on
mean and standard deviation.
2. To draw inferences, this test is applied specially testing the hypothesis but not useful for
estimation.
3. The test can be used between the entire set of observed and expected frequencies.
4. For every increase in the number of degree of freedom, a new chi-square distribution is formed.
5. It is a general purpose test and as such is highly useful in research.
Assumptions of Chi-square Test:
1. Observations recorded and used are collected on a random basis.
2. All the observations must be independent.
3. All the events must be mutually exclusive.
4. No group should contain very few items say less than 10. (In case where the frequencies are less
than 10, regrouping is done by combining the frequencies of adjoining groups so that the new
frequencies become greater than10. Some statisticians take this number as 5, but 10 is regarded
as better by most of the statisticians.)
5. The overall number of items must also be reasonably large. (It should normally be at least 50,
however small number of groups may be).
6. For comparison purpose, the data must be in original units.
Degree of Freedom: When we compare the computed value of with the table value, the degree of
freedom is evident. The degree of freedom means the number of classes to which values can be assigned
at will, without violating restrictions. For e.g. we choose any four numbers whose total is 50. Here we
have a choice to select any three numbers say 10, 15 and 20 and the fourth number is 5. Thus our choice
of freedom is reduced by is one and degree of freedom is three. As the restrictions increase, the freedom
is reduced.
Thus, , where V is degree of freedom, k = No. of independent constraints, n = No. of frequency
classes.
For contingency table the degree of freedom is
where c means the number of column and r means number of row.

Uses of :
1. as a Test for Comparing Variance: The Chi-square value is often used to judge the
significance of population variance i.e. we can use the test to judge if a random sample has been drawn
from a normal population with mean and with a specified variance.

where variance of the sample, variance of the population, degree of freedom,

n being the number of items in the sample. Then by comparing the calculated value with the table value
of Chi-square for (n-1) degrees of freedom at a given level of significance, we may either accept or
reject the null hypothesis. If the calculated value of is less than the tabled value at certain level of
significance, the null hypothesis is accepted, but if the calculated value of is equal or greater than the
tabled value of at certain level of significance, the hypothesis is rejected.
Problem 1: Weight of 10 students is as follows:
Student No. 1 2 3 4 5 6 7 8 9 10
Weight (kg) 38 40 45 53 47 43 55 48 52 49
Can we say that the variance of the distribution of weight of all students from which the above sample of
10 students was drawn is equal to 20 kgs? Test this as 5 per cent and 1 per cent level of significance.
Solution: First of all we should work out the variance of the sample data or and the same has been
worked out as under:
Student No. (Weight in kgs)
1 38 -9 81
2 40 -7 49
3 45 -2 4
4 53 6 36
5 47 0 0
6 43 -4 16
7 55 8 64
8 48 1 1
9 52 5 25
10 49 2 4

kgs

= 31.11
Let the hypothesis be . In order to test this hypothesis we work out the value as under:

Degree of freedom in the given case is At 5% level of significance the table value of
and at 1% level of significance, it is 21.67 for 9 d.f. and both these are greater than the
calculated value 13.99. Hence we accept the null hypothesis and conclude that the variance of the given
distribution can be taken as 20kgs at 5% and 1% level of significance. In other words, the sample can be
said to have been taken from a population with variance 20kgs.
2. as a Non-Parametric Test: is an important non-parametric test and as such no rigid
assumptions are necessary in respect of the type of population. We require only the degree of freedom
for using this test. As a non-parametric test, can be used for (i) as a test of goodness of fit and (ii) as a
test of independence. It is calculated with the help of the following formula:

Case (i): as a Test Goodness of Fit: Through the test we can find out the deviations between
the observed values and expected values. Here we are not concerned with the parameters but concerned
with the form of distribution. test enables us to see how well does the assumed theoretical distribution
(such as Binomial distribution, Poisson distribution, or normal distribution) fit to the observed data.
When some theoretical distribution is fitted to the given data, we are always interested in knowing as to
how well this distribution fits with the observed data. The Chi-square test can give answer to this. If the
calculated value of Chi-square is less than the table value at a certain level of significance, the fit is
considered to be good one which means that the divergence between the observed and expected
frequencies is attributable to fluctuations of sampling. But if the calculated value of Chi-square is greater
than its table value, the fit is not considered to be good one.
Problem 2: 4 coins were tossed 160 times and the following results were obtained:
No. of heads 0 1 2 3 4
Observed frequencies 17 52 54 31 6
Under the assumptions that coins are balanced, find the expected frequencies of getting 0, 1, 2, 3 or 4
heads and test the goodness of fit.
Solution: Hypothesis is that the coins are unbiased.
x Expected frequency =
0
1
2
3
4

No O E O-E (O-E)2
of
heads
0 17 10 7 49 4.9
1 52 40 12 144 3.6
2 54 60 -6 36 0.6
3 31 40 -9 81 2.025
4 6 10 -4 16 1.6

d.f = 5 – 1 = 4;
Calculated value is greater than the table value. Therefore the fit is poor.

Case (ii): as a Test of Independence: test can be used to find out whether one or more
attributes are associated or not. For example, coaching class and successful candidate, marriage and
failure etc; we can find out whether they are related or independent. We take a hypothesis that the
attributes are independent. If the calculated value of is less than the table value at a certain level of
significance, the hypothesis is correct and vice versa.
Problem 3: A certain drug was administered to 500 people out of a total of 800 included in the sample
to test its efficacy against typhoid. The results are given below:
Typhoid No Typhoid Total
Drug 200 300 500
No Drug 280 20 300
Total 480 320 800
On the basis of these data, can it be concluded that the drug is effective in preventing typhoid.
Solution: Let the hypothesis be ‘the drug is not effective in preventing typhoid’.
Expected cell frequency =
The table of expected frequency is
500

300

480 320 800

O E O-E (O-E)2

200 300 -100 10000 33.33

280 180 100 10000 55.56
300 200 100 10000 50.00
20 120 -100 10000 83.33
800 800

d.f.1,
The computed value of is much greater than the table value. Therefore, the hypothesis –the drug is
not effective – is rejected. Hence we conclude that the drug is effective in preventing typhoid.
Problem 4: In an experiment on the immunization of goats from anthrax the following results were
obtained. Derive your inference on the vaccine.

Died of Anthrax Survived Total

Inoculated with vaccine 2 10 12
Not inoculated 6 6 12
Total 8 16 24
Solution: Let us take the hypothesis that the vaccine is not effective. Both the attributes are independent.
Expected frequency of any cell =
The table of expected frequency is

8 16 24
O E O-E (O-E)2

2 4 -2 4 1.0
10 8 2 4 0.5
6 4 2 4 1.0
6 8 -2 4 0.5
24 24 0 16

d.f.1,
The computed value of is 3 which is less than the table value. Therefore, the null hypothesis may be
accepted. Hence we conclude that the vaccine is ineffective in controlling the disease.

Pilgrim Bank - Case Study
97% (39)
Pilgrim Bank - Case Study
14 pages
CHAPTER-2 The Effects of Financial Problem To The Academic Performance of Students
100% (12)
CHAPTER-2 The Effects of Financial Problem To The Academic Performance of Students
19 pages
Chi Square Test
100% (1)
Chi Square Test
23 pages
Chi-Square Test 10.4.22
No ratings yet
Chi-Square Test 10.4.22
17 pages
Chi Square Test
No ratings yet
Chi Square Test
24 pages
1 STAT511 U4-1
No ratings yet
1 STAT511 U4-1
45 pages
Chi Square Test
No ratings yet
Chi Square Test
16 pages
Non Parametric Tests
No ratings yet
Non Parametric Tests
22 pages
Chapter 6. Chi-Square Test
No ratings yet
Chapter 6. Chi-Square Test
25 pages
Chi Square
No ratings yet
Chi Square
10 pages
Definition of Chi-Square Test
100% (1)
Definition of Chi-Square Test
8 pages
Module 6 Chi-Square T Z Test
100% (1)
Module 6 Chi-Square T Z Test
72 pages
Chapter 9 - Chi-Square Test
No ratings yet
Chapter 9 - Chi-Square Test
3 pages
The Chi-Square ( ) Test: A Test of Significance
No ratings yet
The Chi-Square ( ) Test: A Test of Significance
40 pages
Module 5a Chi Square - Introduction - Goodness of Fit Test
No ratings yet
Module 5a Chi Square - Introduction - Goodness of Fit Test
39 pages
Module 5 Quiz Rev
No ratings yet
Module 5 Quiz Rev
118 pages
Chapter 6
No ratings yet
Chapter 6
10 pages
Chi Square Test 2
No ratings yet
Chi Square Test 2
27 pages
Chi Square Method
No ratings yet
Chi Square Method
34 pages
Chi Square (KI Square) Test
No ratings yet
Chi Square (KI Square) Test
30 pages
Abisola
No ratings yet
Abisola
12 pages
X Test PDF
No ratings yet
X Test PDF
38 pages
Chi-Square Test: DR Ramakanth
No ratings yet
Chi-Square Test: DR Ramakanth
38 pages
Engineering Mathematics 2
No ratings yet
Engineering Mathematics 2
29 pages
1 - CA51018 - Chi Square - Introduction - Goodness of Fit Test - 2
No ratings yet
1 - CA51018 - Chi Square - Introduction - Goodness of Fit Test - 2
36 pages
Chi - Square Test
No ratings yet
Chi - Square Test
12 pages
chisquaretest
No ratings yet
chisquaretest
16 pages
BS IMI U8 Oct23
No ratings yet
BS IMI U8 Oct23
100 pages
Chi-Square Test: Amaresh Baranwal - 2021301073 Pratham Maner - 2021301074
No ratings yet
Chi-Square Test: Amaresh Baranwal - 2021301073 Pratham Maner - 2021301074
14 pages
Chisquare
No ratings yet
Chisquare
10 pages
Chi-Square Distribution
No ratings yet
Chi-Square Distribution
28 pages
Non-Parametric
No ratings yet
Non-Parametric
37 pages
Chapter Four
No ratings yet
Chapter Four
12 pages
Chi Square and ANOVA
No ratings yet
Chi Square and ANOVA
132 pages
Non Parametric Test
No ratings yet
Non Parametric Test
102 pages
7 Chi-Square and F
No ratings yet
7 Chi-Square and F
68 pages
Chi-Square As A Test For Comparing Variance
No ratings yet
Chi-Square As A Test For Comparing Variance
9 pages
Chapter 11
No ratings yet
Chapter 11
6 pages
Sardilla's Report On Advance Statistic
No ratings yet
Sardilla's Report On Advance Statistic
32 pages
When To Use Chi-Square? Sample Problems
No ratings yet
When To Use Chi-Square? Sample Problems
5 pages
Chi-Square by MPH
No ratings yet
Chi-Square by MPH
55 pages
Stat-213-Chapter-7-2
No ratings yet
Stat-213-Chapter-7-2
18 pages
CH 14
No ratings yet
CH 14
13 pages
Chi_Square test
No ratings yet
Chi_Square test
15 pages
Lecture 17- Ch10- ChiSquare Test
No ratings yet
Lecture 17- Ch10- ChiSquare Test
35 pages
Block-3
No ratings yet
Block-3
68 pages
Chisquare Test
No ratings yet
Chisquare Test
14 pages
ChiSquare Examples
No ratings yet
ChiSquare Examples
22 pages
Stats Chp 3 Notes
No ratings yet
Stats Chp 3 Notes
8 pages
chi-square test
No ratings yet
chi-square test
17 pages
Lecture3 - Contingency Analysis
No ratings yet
Lecture3 - Contingency Analysis
16 pages
Chi Square Handaouts
No ratings yet
Chi Square Handaouts
6 pages
Chi Square Test
No ratings yet
Chi Square Test
23 pages
Chi-square-Lesson
No ratings yet
Chi-square-Lesson
11 pages
Chi Square Test
No ratings yet
Chi Square Test
4 pages
Chi Square
No ratings yet
Chi Square
34 pages
Statistical Theory Lecture 5-2025
No ratings yet
Statistical Theory Lecture 5-2025
13 pages
11 12 .Chi-Square Test
No ratings yet
11 12 .Chi-Square Test
29 pages
Measurement 6th Sem (H) DSE4 Lec 4 05 05 2020
No ratings yet
Measurement 6th Sem (H) DSE4 Lec 4 05 05 2020
19 pages
Ppt - 3 (Chi-square Test)
No ratings yet
Ppt - 3 (Chi-square Test)
12 pages
Precalculus: A Self-Teaching Guide
From Everand
Precalculus: A Self-Teaching Guide
Steve Slavin
4.5/5 (5)
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Bluman 5th - ch1 Quiz
No ratings yet
Bluman 5th - ch1 Quiz
3 pages
Comparison of Programs - Fall 2023
No ratings yet
Comparison of Programs - Fall 2023
1 page
The Maslach Burnout Inventory: Testing For Factorial Validity and Invariance Across Elementary, Intermediate and Secondary Teachers
No ratings yet
The Maslach Burnout Inventory: Testing For Factorial Validity and Invariance Across Elementary, Intermediate and Secondary Teachers
16 pages
02 Estimation
No ratings yet
02 Estimation
20 pages
Pes Institute of Technology (Bangalore South Campus) : (Iii) Runge-Kutta Method of 4
No ratings yet
Pes Institute of Technology (Bangalore South Campus) : (Iii) Runge-Kutta Method of 4
4 pages
Clustering
No ratings yet
Clustering
4 pages
Estimating Demand: Regression Analysis
No ratings yet
Estimating Demand: Regression Analysis
29 pages
DSL Lab
No ratings yet
DSL Lab
81 pages
Type 1 and Type 2 Errors
No ratings yet
Type 1 and Type 2 Errors
3 pages
Online Correlation and Regression
No ratings yet
Online Correlation and Regression
6 pages
Guidelines On Research Writeup - Luke Biete
No ratings yet
Guidelines On Research Writeup - Luke Biete
6 pages
MScFE 650 MLF - Video - Transcripts - M3
No ratings yet
MScFE 650 MLF - Video - Transcripts - M3
19 pages
Data Scientist: Nanodegree Program Syllabus
No ratings yet
Data Scientist: Nanodegree Program Syllabus
17 pages
I1945 7103 93 6 712
No ratings yet
I1945 7103 93 6 712
9 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
18 pages
Unit 1 RMIPR Notes
No ratings yet
Unit 1 RMIPR Notes
22 pages
Report On Mobile Usage
100% (1)
Report On Mobile Usage
28 pages
PQT, Anna Coimbatore Unit 3
No ratings yet
PQT, Anna Coimbatore Unit 3
4 pages
Coefficient Stability
No ratings yet
Coefficient Stability
41 pages
In-Class Practices - Session 1 - Answers
No ratings yet
In-Class Practices - Session 1 - Answers
19 pages
ZEISS PiWeb Brochure En
No ratings yet
ZEISS PiWeb Brochure En
11 pages
Vidya Sagar
No ratings yet
Vidya Sagar
3 pages
Lect 1 A
No ratings yet
Lect 1 A
35 pages
Sampling From Finite Populations: Example 7.7
No ratings yet
Sampling From Finite Populations: Example 7.7
2 pages
Metalearning - A Tutorial: Christophe Giraud-Carrier December 2008
No ratings yet
Metalearning - A Tutorial: Christophe Giraud-Carrier December 2008
45 pages
Lesson 6 Formulating The Hypothesis
No ratings yet
Lesson 6 Formulating The Hypothesis
40 pages
(Ebook) Effect Sizes for Research: A Broad Practical Approach by Robert J. Grissom, John J. Kim ISBN 9780805850147, 0805850147 pdf download
100% (1)
(Ebook) Effect Sizes for Research: A Broad Practical Approach by Robert J. Grissom, John J. Kim ISBN 9780805850147, 0805850147 pdf download
56 pages
Sampling Techniques and Sample Size Calculation: How and How Many Participants Should I Select For My Research?
No ratings yet
Sampling Techniques and Sample Size Calculation: How and How Many Participants Should I Select For My Research?
4 pages

Uploaded by

Uploaded by

Chi-Square Test

Dr. M. Shamim Uddin Khan

where variance of the sample, variance of the population, degree of freedom,

480 320 800

200 300 -100 10000 33.33

Died of Anthrax Survived Total

You might also like