UG-PSYCHOLOGY, SEMESTER-3, MJC-3, UNIT-4

UnNoticed Digital College March 2, 2025

0 150 13 minutes read

UNIT-4(4.1) Chi square: Concept.

काइ-स्क्वायर परीक्षण (Chi-Square Test): अवधारणा, प्रकार और अनुप्रयोग

परिचय

काइ-स्क्वायर परीक्षण (Chi-Square Test) एक गैर-पैरामीट्रिक (Non-Parametric) सांख्यिकीय परीक्षण है, जिसका उपयोग यह जांचने के लिए किया जाता है कि क्या दो श्रेणीबद्ध (Categorical) चर के बीच कोई महत्वपूर्ण संबंध है या नहीं। यह मनोविज्ञान, जैव विज्ञान, सामाजिक विज्ञान, और व्यवसाय जैसे क्षेत्रों में आवृत्ति डेटा (Frequency Data) के विश्लेषण और परिकल्पना परीक्षण (Hypothesis Testing) के लिए व्यापक रूप से उपयोग किया जाता है।

यह परीक्षण मुख्य रूप से नाममात्र (Nominal) डेटा के लिए उपयोग किया जाता है, जहाँ चर को विभिन्न श्रेणियों में वर्गीकृत किया जाता है लेकिन उनका कोई निश्चित क्रम नहीं होता।

इस लेख में हम निम्नलिखित बिंदुओं को कवर करेंगे:

काइ-स्क्वायर परीक्षण की अवधारणा
काइ-स्क्वायर परीक्षण के प्रकार
- काइ-स्क्वायर समरूपता परीक्षण (Goodness of Fit)
- काइ-स्क्वायर स्वतंत्रता परीक्षण (Test for Independence)
काइ-स्क्वायर परीक्षण की पूर्व-धारणाएँ
काइ-स्क्वायर परीक्षण की गणना (चरण-दर-चरण प्रक्रिया)
परिणामों की व्याख्या
काइ-स्क्वायर परीक्षण के अनुप्रयोग
काइ-स्क्वायर परीक्षण की सीमाएँ

1. काइ-स्क्वायर परीक्षण की अवधारणा

काइ-स्क्वायर परीक्षण क्या है?

काइ-स्क्वायर परीक्षण (χ² टेस्ट) का उपयोग यह निर्धारित करने के लिए किया जाता है कि क्या दो श्रेणीबद्ध चर के बीच कोई महत्वपूर्ण सांख्यिकीय अंतर है।

यह प्रेक्षित (Observed) और अपेक्षित (Expected) आवृत्तियों के बीच के अंतर को मापता है और यह निर्धारित करता है कि यह अंतर मात्र संयोगवश है या दो चर के बीच वास्तव में कोई संबंध है।

काइ-स्क्वायर परीक्षण का सूत्र

χ2=∑(O−E)2E\chi^2 = \sum \frac{(O – E)^2}{E}

जहाँ:

χ2\chi^2 = काइ-स्क्वायर सांख्यिकीय मान
OO = प्रेक्षित आवृत्ति (Observed Frequency)
EE = अपेक्षित आवृत्ति (Expected Frequency)

यदि χ² का मान बड़ा होता है, तो यह दर्शाता है कि प्रेक्षित और अपेक्षित आवृत्तियों के बीच बड़ा अंतर है, जिससे यह संभावना बढ़ जाती है कि दो चर एक-दूसरे से संबंधित हैं।

2. काइ-स्क्वायर परीक्षण के प्रकार

A. समरूपता के लिए काइ-स्क्वायर परीक्षण (Goodness of Fit Test)

यह परीक्षण यह जांचने के लिए किया जाता है कि क्या कोई विशेष डेटा सेट किसी दिए गए सैद्धांतिक वितरण (Theoretical Distribution) के अनुरूप है या नहीं।

उदाहरण:
एक शोधकर्ता यह परीक्षण करना चाहता है कि क्या तीन अलग-अलग ब्रांडों (A, B, और C) के ग्राहक समान रूप से वितरित हैं।

ब्रांड	प्रेक्षित आवृत्ति (O)	अपेक्षित आवृत्ति (E)
A	40	50
B	60	50
C	50	50

यह परीक्षण यह जांचेगा कि क्या ब्रांडों के बीच पसंद में महत्वपूर्ण अंतर है या नहीं।

B. स्वतंत्रता के लिए काइ-स्क्वायर परीक्षण (Test for Independence)

इस परीक्षण का उपयोग यह पता लगाने के लिए किया जाता है कि दो श्रेणीबद्ध चर स्वतंत्र हैं या संबंधित हैं।

उदाहरण:
एक शोधकर्ता यह परीक्षण करना चाहता है कि लिंग (पुरुष/महिला) और मतदान वरीयता (पार्टी X / पार्टी Y) के बीच कोई संबंध है या नहीं।

लिंग	पार्टी X	पार्टी Y
पुरुष	30	20
महिला	25	25

यदि काइ-स्क्वायर परीक्षण दर्शाता है कि अंतर महत्वपूर्ण है, तो यह निष्कर्ष निकाला जा सकता है कि लिंग और मतदान वरीयता के बीच संबंध है।

3. काइ-स्क्वायर परीक्षण की पूर्व-धारणाएँ

डेटा श्रेणीबद्ध होना चाहिए (जैसे लिंग, व्यवसाय, पसंद)।
सभी अवलोकन स्वतंत्र होने चाहिए (हर डेटा बिंदु एक अलग विषय का प्रतिनिधित्व करता है)।
हर श्रेणी की अपेक्षित आवृत्ति कम से कम 5 होनी चाहिए।
नमूना आकार पर्याप्त रूप से बड़ा होना चाहिए।

4. काइ-स्क्वायर परीक्षण की गणना (चरण-दर-चरण प्रक्रिया)

उदाहरण समस्या

एक सर्वेक्षण में यह जांचा जाता है कि व्यायाम करने की आदतें (नियमित/अनियमित) और हृदय रोग (हां/नहीं) के बीच कोई संबंध है या नहीं।

व्यायाम आदत	हृदय रोग (हां)	हृदय रोग (नहीं)	कुल
नियमित	30	70	100
अनियमित	50	50	100
कुल	80	120	200

चरण 1: अपेक्षित आवृत्तियाँ निकालें

E=पंक्ति योग×स्तंभ योगकुल योगE = \frac{\text{पंक्ति योग} \times \text{स्तंभ योग}}{\text{कुल योग}}

उदाहरण के लिए, नियमित & हां के लिए अपेक्षित आवृत्ति:

E=100×80200=40E = \frac{100 \times 80}{200} = 40

इसी तरह अन्य श्रेणियों के लिए अपेक्षित आवृत्तियाँ निकाली जाती हैं।

चरण 2: काइ-स्क्वायर सूत्र लागू करें

χ2=∑(O−E)2E\chi^2 = \sum \frac{(O – E)^2}{E}

गणना करने के बाद, मान χ2=8.34\chi^2 = 8.34 प्राप्त होता है।

चरण 3: महत्वपूर्ण मान (Critical Value) से तुलना करें

यदि df = (पंक्ति – 1) × (स्तंभ – 1) = 1 और महत्वपूर्ण मान 3.84 है, और χ² = 8.34 > 3.84, तो शून्य परिकल्पना को अस्वीकार किया जाता है, जिससे निष्कर्ष निकलता है कि व्यायाम करने की आदतें और हृदय रोग के बीच महत्वपूर्ण संबंध है।

5. परिणामों की व्याख्या

यदि χ² का मान महत्वपूर्ण मान से अधिक है, तो शून्य परिकल्पना को अस्वीकार किया जाता है → दोनों चर संबंधित हैं।
यदि χ² का मान महत्वपूर्ण मान से कम है, तो शून्य परिकल्पना अस्वीकार नहीं की जाती → दोनों चर स्वतंत्र हैं।

6. काइ-स्क्वायर परीक्षण के अनुप्रयोग

मनोविज्ञान: तनाव और मुकाबला करने की रणनीतियों के बीच संबंध की जांच।
शिक्षा: विभिन्न शिक्षण विधियों की तुलना।
विपणन: उपभोक्ता पसंद और ब्रांड चयन का विश्लेषण।
चिकित्सा अनुसंधान: जीवनशैली कारकों और बीमारियों के बीच संबंध की जाँच।

7. काइ-स्क्वायर परीक्षण की सीमाएँ

छोटे नमूना आकार के लिए उपयुक्त नहीं।
कारण-प्रभाव (Cause-Effect) संबंध नहीं दर्शा सकता।
केवल श्रेणीबद्ध डेटा पर लागू होता है।

निष्कर्ष

काइ-स्क्वायर परीक्षण एक महत्वपूर्ण सांख्यिकीय तकनीक है जो यह निर्धारित करने में मदद करता है कि क्या दो श्रेणीबद्ध चर स्वतंत्र हैं या संबंधित। इसका सही उपयोग अनुसंधान निष्कर्षों को अधिक विश्वसनीय बनाता है।

UNIT-4(4.1) Chi square: Concept.

Chi-Square Test: Concept, Types, and Applications

Introduction

The Chi-Square test is a non-parametric statistical test used to determine if there is a significant association between categorical variables. It is widely used in fields such as psychology, biology, social sciences, and business to analyze frequency data and test hypotheses.

The Chi-Square test is particularly useful when dealing with nominal (categorical) data, where variables are classified into different groups without any inherent order.

This article will cover:

Concept of the Chi-Square test
Types of Chi-Square tests
- Chi-Square Test for Goodness of Fit
- Chi-Square Test for Independence
Assumptions of the Chi-Square test
Chi-Square Test Calculation (Step-by-Step)
Interpretation of Results
Applications of the Chi-Square Test
Limitations of the Chi-Square Test

1. Concept of the Chi-Square Test

What is the Chi-Square Test?

The Chi-Square test (χ² test) is used to determine whether there is a statistically significant difference between the expected and observed frequencies in one or more categories.

It helps answer questions such as:

“Is there a relationship between gender and voting preference?”
“Do customer preferences for different brands differ significantly?”

The Chi-Square test is based on comparing observed data with expected data under the assumption that there is no relationship between the variables (null hypothesis).

Formula for the Chi-Square Test

The formula for the Chi-Square statistic is:

χ2=∑(O−E)2E\chi^2 = \sum \frac{(O – E)^2}{E}

Where:

χ2\chi^2 = Chi-Square statistic
OO = Observed frequency (actual data)
EE = Expected frequency (theoretical data based on null hypothesis)

The larger the Chi-Square value, the greater the difference between observed and expected values, indicating a possible relationship between the variables.

2. Types of Chi-Square Tests

A. Chi-Square Test for Goodness of Fit

This test is used to determine if a sample data set fits a specific distribution. It compares observed frequencies with expected frequencies based on a given theoretical model.

Example:
A researcher wants to test whether the distribution of customers across three different brands (A, B, and C) is equal.

Brand	Observed Frequency (O)	Expected Frequency (E)
A	40	50
B	60	50
C	50	50

The researcher uses the Chi-Square test for goodness of fit to see if the differences in observed frequencies are statistically significant.

B. Chi-Square Test for Independence

This test determines whether two categorical variables are independent or related.

Example:
A researcher wants to test whether gender (Male/Female) is related to voting preference (Party X / Party Y).

Gender	Party X	Party Y
Male	30	20
Female	25	25

The Chi-Square test for independence helps determine if gender influences voting preference.

3. Assumptions of the Chi-Square Test

Before using the Chi-Square test, certain assumptions must be met:

Data should be categorical (e.g., gender, occupation, preferences).
Observations should be independent (each data point represents a separate subject).
Expected frequency should be at least 5 for each category.
Sample size should be sufficiently large to ensure accurate results.

4. Chi-Square Test Calculation (Step-by-Step)

Let’s go through an example of the Chi-Square Test for Independence.

Example Problem

A survey is conducted to see if there is a relationship between exercise habits (Regular/Irregular) and Heart Disease (Yes/No) among 200 people.

Exercise Habit	Heart Disease (Yes)	Heart Disease (No)	Total
Regular	30	70	100
Irregular	50	50	100
Total	80	120	200

Step 1: Calculate Expected Frequencies
The expected frequency for each cell is calculated as:

E=Row Total×Column TotalGrand TotalE = \frac{\text{Row Total} \times \text{Column Total}}{\text{Grand Total}}

For Regular & Yes:

E=100×80200=40E = \frac{100 \times 80}{200} = 40

For Regular & No:

E=100×120200=60E = \frac{100 \times 120}{200} = 60

For Irregular & Yes:

E=100×80200=40E = \frac{100 \times 80}{200} = 40

For Irregular & No:

E=100×120200=60E = \frac{100 \times 120}{200} = 60

Step 2: Apply the Chi-Square Formula

χ2=∑(O−E)2E\chi^2 = \sum \frac{(O – E)^2}{E} χ2=(30−40)240+(70−60)260+(50−40)240+(50−60)260\chi^2 = \frac{(30-40)^2}{40} + \frac{(70-60)^2}{60} + \frac{(50-40)^2}{40} + \frac{(50-60)^2}{60} =(−10)240+(10)260+(10)240+(−10)260= \frac{(-10)^2}{40} + \frac{(10)^2}{60} + \frac{(10)^2}{40} + \frac{(-10)^2}{60} =10040+10060+10040+10060= \frac{100}{40} + \frac{100}{60} + \frac{100}{40} + \frac{100}{60} =2.5+1.67+2.5+1.67=8.34= 2.5 + 1.67 + 2.5 + 1.67 = 8.34

Step 3: Compare with the Critical Value
For df = (rows – 1) × (columns – 1) = (2-1) × (2-1) = 1, the critical value from the Chi-Square table at α = 0.05 is 3.84.

Since χ² = 8.34 > 3.84, we reject the null hypothesis, meaning there is a significant relationship between exercise habits and heart disease.

5. Interpretation of Results

If χ² is greater than the critical value, reject the null hypothesis → There is a significant relationship.
If χ² is less than the critical value, fail to reject the null hypothesis → No significant relationship.

6. Applications of the Chi-Square Test

Psychology: Examining the relationship between stress levels and coping mechanisms.
Education: Analyzing student preferences for different teaching methods.
Marketing: Studying consumer preferences for brands based on demographic groups.
Medical Research: Investigating the link between lifestyle factors and diseases.

7. Limitations of the Chi-Square Test

Cannot be used for small sample sizes (expected frequency < 5).
Does not indicate cause-and-effect relationships.
Only works for categorical data, not numerical data.

Conclusion

The Chi-Square test is a powerful statistical tool used to analyze relationships between categorical variables. By comparing observed and expected frequencies, it helps researchers determine whether variables are independent or related. However, its proper use requires meeting its assumptions and interpreting results correctly.

UNIT-4(4.2) Computation of Chi-Square: Equal Distribution Hypothesis and Independent Hypothesis.

काइ-स्क्वायर (Chi-Square) की गणना: समान वितरण परिकल्पना और स्वतंत्र परिकल्पना

परिचय

काइ-स्क्वायर (χ²) परीक्षण एक महत्वपूर्ण गैर-पैरामीट्रिक (Non-Parametric) सांख्यिकीय परीक्षण है, जिसका उपयोग यह जांचने के लिए किया जाता है कि क्या दो श्रेणीबद्ध (Categorical) चर के बीच कोई महत्वपूर्ण संबंध है। यह मुख्य रूप से आवृत्ति डेटा (Frequency Data) का विश्लेषण करने और परिकल्पनाओं की जांच करने के लिए उपयोग किया जाता है।

काइ-स्क्वायर परीक्षण के दो मुख्य प्रकार हैं:

काइ-स्क्वायर समरूपता परीक्षण (Equal Distribution Hypothesis या Goodness-of-Fit Test) – यह जांचता है कि क्या प्रेक्षित वितरण (Observed Distribution) अपेक्षित वितरण (Expected Distribution) से मेल खाता है।
काइ-स्क्वायर स्वतंत्रता परीक्षण (Independent Hypothesis या Test of Independence) – यह जांचता है कि क्या दो श्रेणीबद्ध चर एक-दूसरे से स्वतंत्र हैं या उनके बीच कोई संबंध है।

इस लेख में निम्नलिखित विषयों को विस्तार से समझाया जाएगा:

काइ-स्क्वायर परीक्षण का सूत्र और अवधारणा
समान वितरण परिकल्पना (Equal Distribution Hypothesis) के लिए काइ-स्क्वायर की गणना
स्वतंत्र परिकल्पना (Independent Hypothesis) के लिए काइ-स्क्वायर की गणना
परिणामों की व्याख्या
काइ-स्क्वायर परीक्षण के अनुप्रयोग
काइ-स्क्वायर परीक्षण की सीमाएँ

1. काइ-स्क्वायर परीक्षण का सूत्र और अवधारणा

काइ-स्क्वायर का गणितीय सूत्र

काइ-स्क्वायर सांख्यिकीय मान की गणना निम्नलिखित सूत्र से की जाती है:

χ2=∑(O−E)2E\chi^2 = \sum \frac{(O – E)^2}{E}

जहाँ:

χ2\chi^2 = काइ-स्क्वायर सांख्यिकीय मान
OO = प्रेक्षित आवृत्ति (Observed Frequency)
EE = अपेक्षित आवृत्ति (Expected Frequency)

यदि χ² का मान अधिक होता है, तो इसका अर्थ यह होता है कि प्रेक्षित और अपेक्षित डेटा के बीच बड़ा अंतर है, जो यह दर्शाता है कि दो चर एक-दूसरे से जुड़े हो सकते हैं।

2. समान वितरण परिकल्पना (Equal Distribution Hypothesis) के लिए काइ-स्क्वायर की गणना

अवधारणा

इस परीक्षण का उपयोग यह जांचने के लिए किया जाता है कि क्या किसी डेटा का वितरण किसी सैद्धांतिक (Theoretical) या अनुमानित वितरण से मेल खाता है।

उदाहरण के लिए, यदि यह माना जाए कि तीन अलग-अलग ब्रांडों (A, B, और C) की समान मांग है, तो इनका बिक्री वितरण भी समान होना चाहिए।

उदाहरण समस्या

एक शोधकर्ता यह परीक्षण करना चाहता है कि क्या ग्राहक तीन ब्रांडों (A, B, और C) को समान रूप से पसंद करते हैं। 150 ग्राहकों के सर्वेक्षण के बाद निम्नलिखित डेटा प्राप्त हुआ:

ब्रांड	प्रेक्षित आवृत्ति (O)	अपेक्षित आवृत्ति (E)
A	55	50
B	45	50
C	50	50

चरण 1: काइ-स्क्वायर सूत्र लागू करें

χ2=∑(O−E)2E\chi^2 = \sum \frac{(O – E)^2}{E} =(55−50)250+(45−50)250+(50−50)250= \frac{(55-50)^2}{50} + \frac{(45-50)^2}{50} + \frac{(50-50)^2}{50} =2550+2550+050= \frac{25}{50} + \frac{25}{50} + \frac{0}{50} =0.5+0.5+0= 0.5 + 0.5 + 0 =1.0= 1.0

चरण 2: डिग्री ऑफ फ्रीडम (df) निकालें

df=(k−1)=(3−1)=2df = (k – 1) = (3 – 1) = 2

जहाँ k = श्रेणियों की संख्या।

चरण 3: महत्वपूर्ण मान (Critical Value) से तुलना करें

df = 2 और α = 0.05 पर, महत्वपूर्ण मान = 5.99।

क्योंकि 1.0 < 5.99, हम शून्य परिकल्पना को अस्वीकार नहीं करते, जिसका अर्थ है कि ब्रांड की पसंद में कोई महत्वपूर्ण अंतर नहीं है।

3. स्वतंत्र परिकल्पना (Independent Hypothesis) के लिए काइ-स्क्वायर की गणना

अवधारणा

इस परीक्षण का उपयोग यह जांचने के लिए किया जाता है कि क्या दो श्रेणीबद्ध चर स्वतंत्र हैं या संबंधित हैं।

उदाहरण: क्या लिंग (पुरुष/महिला) और मतदान वरीयता (पार्टी X/पार्टी Y) के बीच कोई संबंध है?

उदाहरण समस्या

200 मतदाताओं पर एक सर्वेक्षण किया गया और निम्नलिखित डेटा प्राप्त हुआ:

लिंग	पार्टी X	पार्टी Y	कुल
पुरुष	40	60	100
महिला	50	50	100
कुल	90	110	200

चरण 1: अपेक्षित आवृत्तियाँ (Expected Frequency) निकालें

E=Row Total×Column TotalGrand TotalE = \frac{\text{Row Total} \times \text{Column Total}}{\text{Grand Total}}

उदाहरण: पुरुष और पार्टी X के लिए अपेक्षित आवृत्ति

E=100×90200=45E = \frac{100 \times 90}{200} = 45

इसी तरह, अन्य अपेक्षित आवृत्तियाँ निकाली जाती हैं।

लिंग	पार्टी X (O)	पार्टी X (E)	पार्टी Y (O)	पार्टी Y (E)
पुरुष	40	45	60	55
महिला	50	45	50	55

चरण 2: काइ-स्क्वायर सूत्र लागू करें

χ2=∑(O−E)2E\chi^2 = \sum \frac{(O – E)^2}{E}

गणना करने के बाद,

χ2=2.02\chi^2 = 2.02

चरण 3: डिग्री ऑफ फ्रीडम (df) निकालें

df=(rows−1)×(columns−1)=1df = (rows – 1) \times (columns – 1) = 1

महत्वपूर्ण मान (Critical Value) = 3.84 (α = 0.05, df = 1)।
चूंकि 2.02 < 3.84, हम शून्य परिकल्पना को अस्वीकार नहीं कर सकते, यानी लिंग और मतदान वरीयता स्वतंत्र हैं।

4. काइ-स्क्वायर परीक्षण के अनुप्रयोग

मनोविज्ञान: व्यक्तित्व प्रकार और तनाव प्रतिक्रिया का संबंध।
शिक्षा: अलग-अलग शिक्षण विधियाँ और छात्रों का प्रदर्शन।
स्वास्थ्य: जीवनशैली और बीमारियों के बीच संबंध।

5. निष्कर्ष

काइ-स्क्वायर परीक्षण समान वितरण और स्वतंत्रता की परिकल्पना की जाँच करने के लिए एक महत्वपूर्ण सांख्यिकीय उपकरण है। सही गणना और व्याख्या अनुसंधान निष्कर्षों को अधिक विश्वसनीय बनाती है।

UNIT-4(4.2) Computation of Chi-Square: Equal Distribution Hypothesis and Independent Hypothesis.

Computation of Chi-Square: Equal Distribution Hypothesis and Independent Hypothesis

Introduction

The Chi-Square (χ²) test is a widely used non-parametric statistical test that helps determine whether there is a significant association between two categorical variables. It is particularly useful for analyzing frequency data and testing hypotheses in various research fields, such as psychology, social sciences, medicine, and business.

There are two main types of Chi-Square tests:

Chi-Square Goodness-of-Fit Test (Equal Distribution Hypothesis) – Determines if an observed distribution matches an expected distribution.
Chi-Square Test for Independence (Independent Hypothesis) – Determines if two categorical variables are related or independent.

This article will cover:

Concept and formula of the Chi-Square test
Computation of the Chi-Square test for the Equal Distribution Hypothesis
Computation of the Chi-Square test for the Independent Hypothesis
Interpretation of results
Applications of the Chi-Square test
Limitations of the Chi-Square test

1. Concept and Formula of the Chi-Square Test

Chi-Square Formula

The Chi-Square statistic is calculated using the formula:

χ2=∑(O−E)2E\chi^2 = \sum \frac{(O – E)^2}{E}

Where:

χ2\chi^2 = Chi-Square statistic
OO = Observed frequency (actual count)
EE = Expected frequency (theoretical count based on the hypothesis)

A large Chi-Square value suggests that the observed data does not fit the expected data, indicating a significant relationship or deviation from the expected distribution.

2. Computation of Chi-Square for Equal Distribution Hypothesis (Goodness-of-Fit Test)

Concept

The Goodness-of-Fit test is used when we want to check whether an observed categorical data distribution follows a theoretically expected distribution.

For example, if we assume that customers choose three brands (A, B, and C) equally, the expected frequency should be the same for each brand.

Example Problem

A researcher wants to test if customers prefer three brands (A, B, and C) equally. A sample of 150 customers was surveyed, and their responses were recorded.

Brand	Observed Frequency (O)	Expected Frequency (E)
A	55	50
B	45	50
C	50	50

Step 1: Calculate the Chi-Square Statistic

Using the Chi-Square formula:

χ2=∑(O−E)2E\chi^2 = \sum \frac{(O – E)^2}{E} =(55−50)250+(45−50)250+(50−50)250= \frac{(55-50)^2}{50} + \frac{(45-50)^2}{50} + \frac{(50-50)^2}{50} =(5)250+(−5)250+(0)250= \frac{(5)^2}{50} + \frac{(-5)^2}{50} + \frac{(0)^2}{50} =2550+2550+0= \frac{25}{50} + \frac{25}{50} + 0 =0.5+0.5+0= 0.5 + 0.5 + 0 =1.0= 1.0

Step 2: Determine the Degrees of Freedom (df)

df=(k−1)=(3−1)=2df = (k – 1) = (3 – 1) = 2

where k is the number of categories.

Step 3: Compare with the Chi-Square Critical Value
From the Chi-Square table, at df = 2 and α = 0.05, the critical value is 5.99.

Since 1.0 < 5.99, we fail to reject the null hypothesis. This means that there is no significant difference in brand preference, and the assumption of equal distribution is valid.

3. Computation of Chi-Square for Independent Hypothesis (Test of Independence)

Concept

The Chi-Square Test for Independence is used when we want to check whether two categorical variables are related or independent.

For example, we might want to check whether gender (Male/Female) is associated with voting preference (Party X/Party Y).

Example Problem

A political analyst surveys 200 voters to see if gender affects voting preference.

Gender	Party X	Party Y	Total
Male	40	60	100
Female	50	50	100
Total	90	110	200

Step 1: Calculate Expected Frequencies (E)
The expected frequency for each cell is calculated as:

E=Row Total×Column TotalGrand TotalE = \frac{\text{Row Total} \times \text{Column Total}}{\text{Grand Total}}

For Male & Party X:

E=100×90200=45E = \frac{100 \times 90}{200} = 45

For Male & Party Y:

E=100×110200=55E = \frac{100 \times 110}{200} = 55

For Female & Party X:

E=100×90200=45E = \frac{100 \times 90}{200} = 45

For Female & Party Y:

E=100×110200=55E = \frac{100 \times 110}{200} = 55

Gender	Party X (O)	Party X (E)	Party Y (O)	Party Y (E)
Male	40	45	60	55
Female	50	45	50	55

Step 2: Compute the Chi-Square Statistic

χ2=∑(O−E)2E\chi^2 = \sum \frac{(O – E)^2}{E} =(40−45)245+(60−55)255+(50−45)245+(50−55)255= \frac{(40-45)^2}{45} + \frac{(60-55)^2}{55} + \frac{(50-45)^2}{45} + \frac{(50-55)^2}{55} =(−5)245+(5)255+(5)245+(−5)255= \frac{(-5)^2}{45} + \frac{(5)^2}{55} + \frac{(5)^2}{45} + \frac{(-5)^2}{55} =2545+2555+2545+2555= \frac{25}{45} + \frac{25}{55} + \frac{25}{45} + \frac{25}{55} =0.56+0.45+0.56+0.45= 0.56 + 0.45 + 0.56 + 0.45 =2.02= 2.02

Step 3: Determine the Degrees of Freedom (df)

df=(rows−1)×(columns−1)=(2−1)×(2−1)=1df = (rows – 1) \times (columns – 1) = (2 – 1) \times (2 – 1) = 1

Step 4: Compare with the Chi-Square Critical Value
From the Chi-Square table, at df = 1 and α = 0.05, the critical value is 3.84.

Since 2.02 < 3.84, we fail to reject the null hypothesis, meaning gender and voting preference are independent.

4. Interpretation of Results

If χ² > critical value, reject the null hypothesis → There is a relationship between variables.
If χ² < critical value, fail to reject the null hypothesis → Variables are independent.

5. Applications of the Chi-Square Test

Psychology: Relationship between stress and coping mechanisms.
Marketing: Brand preference across age groups.
Education: Student performance based on teaching methods.
Healthcare: Disease prevalence based on lifestyle factors.

Conclusion

The Chi-Square test is a crucial statistical tool for analyzing categorical data. The Goodness-of-Fit test checks if an observed distribution matches an expected one, while the Test for Independence determines whether two categorical variables are related.