Chi-Square Data Analysis
Prior to beginning work on this assignment, review Chapter 10 in your course textbook, pages 109 to 120 in Chapter 7 of the Jarman e-book, and the Week 2 Instructor Guidance. Also, review The Chi-Square Test: Often Used and More Often Misinterpreted and The Chi-Square Test of Independence articles Also, review the Two-Way Tables, Chi-Square: Lecture 11, Chi-Square Tests: Crash Course Statistics #29 (Links to an external site.), and Chi-Square Test for Association (Independence) | AP Statistics | Khan Academy (Links to an external site.) videos; and review the How to Chi-Square Test (Links to an external site.) and How to Interpret Chi-Squared (Links to an external site.) web articles. Also, complete the Week 2 learning activity and Week 2 weekly review.
Your instructor will post an announcement with the scenario and data set for your Week 2 assignment. For the calculations in this assignment, you may use either Excel or the free VassarStats: Website for Statistical Computation (Links to an external site.) program online. Instructions for performing a chi-square test of independence in Excel are included in Section 10.4 of the textbook, accompanied by a screencast demonstrating the process in the electronic version. Screencasts showing how to do a chi-square goodness-of-fit test and a chi-square test of independence in VassarStats are included in the Week 2 learning activity, which you may review at any time. For this test, you may prefer VassarStats (Links to an external site.) because it is more automated than Excel. You also have the option of calculating the chi-square manually but if you choose this option, you must show your work and explain all of the steps taken to solve for the chi-square value and to determine statistical significance.
In your paper, begin with a paragraph introducing the scenario and explaining why a chi-square test is needed for the situation. Then, address the following:
- Identify which chi-square test you used and which program or procedure you used for the analysis.
- Describe the procedure you used to calculate the chi-square.
- If using VassarStats (Links to an external site.), copy and paste the data entry area and output into your paper. If using Excel, submit the Excel spreadsheet separately and mention in your paper that the spreadsheet is attached. If using hand calculations, include a table showing the observed and expected frequencies, along with the row and column totals and chi-square calculations.
- Report the chi-square value, degrees of freedom, and the p
- Explain how you determined the p Is the result statistically significant?
- If the test is a chi-square test of independence with a statistically significant result, report the effect size using Cramér’s V.
- Explain the meaning of the results in terms of the scenario.
- Discuss any assumptions, limitations, and implications associated with the situation and analysis.
- Summarize the main points of the paper in a concluding paragraph.
The Chi-Square Data Analysis paper
- Must be two to three double-spaced pages in length (not including title and references pages) and formatted according to APA Style as outlined in the Ashford Writing Center’s APA Style (Links to an external site.)
- Must include a separate title page with the following:
- Title of paper
- Student’s name
- Course name and number
- Instructor’s name
- Date submitted
For further assistance with the formatting and the title page, refer to APA Formatting for Word 2013 (Links to an external site.).
- Must include an introduction and conclusion paragraph. Your introduction paragraph needs to end with a clear statement that indicates the purpose of your paper, to report and explain your analysis of the scenario and data set.
- For assistance on writing Introductions & Conclusions (Links to an external site.), refer to the Ashford Writing Center resources.
- Must use the course text; and Excel, VassarStats, or detailed documentation of hand calculations. Refer to Tables, Images, & Appendices (Links to an external site.) for assistance with formatting data and results tables.
- Must document any information used from sources in APA Style as outlined in the Ashford Writing Center’s APA: Citing Within Your Paper
Reference:
https://digital-films-com.proxy-library.ashford.edu/p_ViewVideo.aspx?xtid=150111
10Nominal Data and the Chi-Square Tests
Jupiterimages/Stockbyte/Thinkstock
Chapter Learning Objectives After reading this chapter, you should be able to do the following:
1. Describe nominal data.
2. Complete and explain the chi-square goodness-of-fit-test.
3. Complete and explain the chi-square test of independence.
tan82773_10_ch10_295-320.indd 295 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Section 10.1 Nominal Data
Introduction When an important development in statistical analysis took place in the early part of the 20th century, more often than not Karl Pearson was associated with it. As the text previously noted, many of those who made important contributions were members of the department that Pear- son founded at University College London. Those who gravitated to Pearson’s department included William Sealy Gosset, who developed the t tests; R. A. Fisher, who developed analysis of variance; and Charles Spearman, who did the early work on factor analysis. Although social relations among these men were not always harmonious, they were enormously productive scholars, and this was particularly true of Pearson. Besides the correlation coefficient named for him, Pearson developed an analytical approach related to Spearman’s factor analysis called principal components analysis, as well as the procedures that are the subjects of this chapter, the chi-square tests. (The Greek letter chi [χ] is pronounced “kye” and rhymes with sky. Chi is the Greek equivalent of the letter c, rather than the letter x, which it resembles.)
10.1 Nominal Data With the exception of Spearman’s rho in Chapter 8, Chapters 1 through 9 have focused on pro- cedures designed for interval or ratio data. Sometimes, however, the data are neither interval scale nor the ordinal-scale data that Spearman’s rho accommodates. When the data are nomi- nal scale, researchers often use one of the chi-square tests.
Because our focus has been so much on interval- and ratio-scale data, it might be helpful to review what makes data nominal scale. Nominal data either fit a category or do not, which is why they are sometimes referred to as “categorical data.” Because of this presence-or- absence quality, analyses of nominal data are based on counting how frequently they occur, and for that reason they are also called “count data.” Compared to ratio, interval, and even ordinal data, nominal data provide relatively little information. They reveal only the presence or absence of a characteristic, not how much of the characteristic, or how the individual’s pos- session of the characteristic compares to others in the category. To illustrate: when people are classified according to whether they are
1. left-handed or right-handed, or 2. Buddhist, Jewish, Muslim, or 3. African American, Hispanic, or Native American, or 4. blue-eyed or brown-eyed, or 5. introverted or extroverted,
then the resulting data are nominal scale.
Parameters and Tests for Nominal Data Because data of different scales provide different kinds of information, the statistical pro- cedure used in their analyses is tailored accordingly. Because nominal data concerns itself with frequency, the related analytical procedures—in this instance, the chi-square tests—are based on how many individuals are in a particular category. To put it simply, the measure- ment procedure for chi-square is counting.
tan82773_10_ch10_295-320.indd 296 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Section 10.2 The Chi-Square Tests
Recall from Chapter 8 that tests for nominal data are nonparametric tests. The “no parame- ters” element means that employing these tests does not obligate the researcher to meet most of the traditional parameters, or requirements, for statistical tests. The t tests and ANOVA, for example, require that the dependent variable be normally distributed in its population. The Pearson correlation and ordinary least-squares regression upon which it is based (Chapter 9) also require that the x and y variables be normally distributed. Like Spearman’s rho (Chapter 8), which is also a nonparametric test, the chi-square tests set normality and homogeneity requirements aside; they are “distribution free” tests. However, in the statistical equivalent of no such thing as a free lunch, all of this analytical flexibility has a cost. The chi-square’s draw- back has to do with the power of the test, which the chapter will later discuss.
When working with nominal data, most of the descriptive statistics used to this point are irrelevant. As the most frequently occurring value, the mode, of course, can still be calculated, but the means and medians to which we compared the mode in order to determine skew require at least interval data. Nominal data offer no standard deviation or range values to examine to evaluate kurtosis. It is just as well that the chi-square tests are nonparametric since most of the values needed to determine normality are unavailable in any case.
10.2 The Chi-Square Tests This chapter explains two chi-square tests. The analysis in both tests is based on comparing the frequency (count) with which something actually occurs, compared to the frequency with which it is expected to occur.
The first test is called the 1 3 k (“one by kay”), or the goodness-of-fit chi-square test. Like the independent variable in the one-way ANOVA, this test accommodates just one variable, but that one variable can have any number of categories greater than one. For instance, a psychologist could analyze whether those participating in court-ordered group therapy ses- sions for drug addiction represent some vocations more than others. In that case, the variable is vocation. It can have any number of manifestations (clerical workers, laborers, the unem- ployed, educators, and so on), but the only variable is vocation.
The second chi-square test the chapter takes up is called the r 3 k (“are by kay”), or the chi- square test of independence. This test accommodates two variables. Each of the two vari- ables can be further divided into any number of categories. A researcher might be interested in whether marital status (single never-married, married, divorced) is related to graduating on-time among university students (graduated within four years, did not graduate within four years).
The Goodness-of-Fit or 1 3 k Chi-Square Test This test asks whether an outcome is different enough from an initial hypothesis that research should conclude that the difference is not likely to have occurred by chance. The focus on whether an outcome might be expected to have occurred by chance makes the 1 3 k like all significance tests. The important difference is that it accommodates a nominal-scale, dependent variable. For some illustrations of problems that might involve the 1 3 k chi-square, consider the following:
tan82773_10_ch10_295-320.indd 297 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Section 10.2 The Chi-Square Tests
Those responsible for recruitment in the universi- ty’s college of social sciences wonder whether opt- ing for a psychology major relates to the potential students’ gender. The variable is the gender of the student, with two categories: female and male. The research questions whether, in a randomly selected group of psychology majors, male or female stu- dents occur with significantly different frequencies.
This problem is similar to an independent groups t test in that it has two independent categories. The difference in the two tests is whether the count or frequency with which subjects occur in each cate- gory significantly strays from a pre-determined hypothesis, rather than whether the groups’ means,
which nominal data cannot provide, are significantly different from each other.
In a second example, a military psychologist wants to know whether recruits represent urban, suburban, semi-rural, and rural backgrounds in similar proportions. The psychologist selects a random sample of 50 recent recruits and determines their demographic origins. The variable is the population characteristics of the recruits’ origins. In the absence of information to the contrary, the researcher’s hypothesis is probably that recruits come from different areas of the
country in equal proportions. If the psychologist deter- mines that twice as many people live in suburban areas as in semi-rural areas, however, perhaps the correspond- ing hypothesis is that recruits from suburban areas will be twice as numerous as those from rural areas. The psy- chologist might also hypothesize that patriotism, which may affect the individual’s desire to join the military, runs higher in rural than in urban populations, so that
the expectation is that rural recruits will occur in greater proportions than those from urban environments. With multiple groups represented in this hypothetical problem, it bears some similarity to a one-way ANOVA, but without any sums of squares to analyze.
Without wishing to belabor the point, the independent t test and analysis of variance divide subjects into two or more categories, with each category characterized by a different level, or manifestation, of the independent variable. The study analyzes how the different levels affect some other variable, the dependent variable. The chi-square similarly has two or more categories, but it analyzes the frequency with which individuals are distributed into those different categories.
Observed and Expected Frequencies To restate our approach, then, the measurement involved in chi-square analysis is simply counting. Researchers who use this analysis are interested in the frequency with which some- thing occurs in a category. More specifically, rather than comparing sample means to popu- lation means, or sample means to each other, chi square examines differences between the frequency with which individuals occur in a particular category (symbolized by fo), and the frequency with which they were expected to occur (symbolized by fe).
Try It!: #1 How many variables will the 1 3 k chi- square accommodate?
Fuse/Thinkstock
Do women and men pursue psychology majors in equal numbers? A 1 3 k chi- square test will provide an answer.
tan82773_10_ch10_295-320.indd 298 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Section 10.2 The Chi-Square Tests
The fe and fo values are simply the number of observations in each category; they are fre- quency counts. When the expected number varies sufficiently from the observed number, the result is statistically significant.
The Chi-Square Test Statistic The test statistic for the chi-square test is as follows:
Formula 10.1
χ2 5 ∑ (fo 2 fe)2
fe
where
χ2 5 the value of the chi-square statistic
fo 5 the frequency observed in the particular category
fe 5 the frequency expected in the particular category
Studying the test statistic for the chi-square test is quite revealing. To calculate the value of this statistic, start with these steps:
1. Count the number in each category (fo). 2. Determine the number expected in each category (fe). When the assumption is that
all categories are equal, this will be the total number of subjects divided by the number of categories.
3. As a quick check before continuing, note that the sum of the fe categories must equal the sum of the fo categories. Then, perform the following mathematical operations:
a. Subtract fe from fo.
b. Square the difference.
c. Divide the squared difference by fe.
d. Sum the squared differences divided by fe across the categories.
e. Compare to the critical value of chi-square for the number of categories, minus 1 degree of freedom. (The critical values of chi-square appear in Table 10.2.)
A Goodness-of-Fit (1 3 k Chi-Square) Problem Using the ethnic diversity of voters as an example, a psychologist who has examined voting patterns and ethnicity perhaps wishes to test the assumption that voting in a general election is unrelated to voters’ ethnic group membership. On election day, the psychologist journeys to a polling place in an ethnically diverse part of the city and administers a brief survey to those who have just voted. One question concerns the respondents’ ethnic group. Figure 10.1 shows the data for the 18 people who completed the survey.
tan82773_10_ch10_295-320.indd 299 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
0
1
2
3
4
5
6
7
8
9
BA C D
N u m
b e r
o f re
sp o n d e n ts
Ethnic group
Section 10.2 The Chi-Square Tests
Figure 10.1: Voter participation data
0
1
2
3
4
5
6
7
8
9
BA C D
N u m
b e r
o f re
sp o n d e n ts
Ethnic group
Although the calculations are not difficult, determining the value of chi-square involves some arithmetic. An easy way to keep track of the calculations is to arrange the data into a table like Table 10.1. The rows are numbered to be consistent with the numbered steps listed after Formula 10.1 for calculating the chi-square statistic. The results from the survey are the frequency-observed values in the first line of the table. The frequency-expected values are n divided by the number of categories: 18 4 4 5 4.50. That value indicates that if the ethnic group membership of the voters in this group is exactly equivalent, 4.50 of the respondents will declare for each group. Do not let the .50 value in each fe distract you. Although the fo numbers have no chance of any such value, that fe value is the same for all groups; the issue is whether the fo 2 fe differences are significantly different from category to category.
Table 10.1: A goodness-of-fit chi-square problem for voting patterns
Value Ethnic group A Ethnic group B Ethnic group C Ethnic group D
1. fo 500 300 200 8.0
2. fe 4.50 4.50 4.50 4.50
3a. fo 2 fe 0.50 21.500 22.50.0 3.50
3b. fo 2 fe2 0.25 2.25 6.25 12.25
3c. fo 2 fe2 / fe 0.06 0.50 1.39 2.72
3d. χ2 5 ∑ (fo 2 fe)2
fe 5 0.06 1 0.50 1 1.39 1 2.72 5 4.67
Determining Significance For this problem, the value of chi-square is χ2 5 4.67. Having calculated the statistic, the researcher needs something with which to compare it, a critical value, and—as with other tests—the critical value is indexed to degrees of freedom for the problem.
tan82773_10_ch10_295-320.indd 300 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Section 10.2 The Chi-Square Tests
• The degrees of freedom for a goodness-of-fit problem are the number of categories in the problem, minus 1.
• With subjects in the voting participation prob- lem divided into four different ethnic groups, there are 4 2 1 5 3 df.
The critical values for chi-square in Table 10.2 (also table B.7 in Appendix B) are arranged by degrees of freedom down the left side, and the level at which the test is conducted across the top.
Table 10.2: The critical values of chi-squared
df p 5 0.05 p 5 0.01 p 5 0.001
1 3.84 6.64 10.83 2 5.99 9.21 13.82 3 7.82 11.35 16.27 4 9.49 13.28 18.47 5 11.07 15.09 20.52 6 12.59 16.81 22.46 7 14.07 18.48 24.32 8 15.51 20.09 26.13 9 16.92 21.67 27.88 10 18.31 23.21 29.59 11 19.68 24.73 31.26 12 21.03 26.22 32.91 13 22.36 27.69 34.53 14 23.69 29.14 36.12 15 25.00 30.58 37.70 16 26.30 32.00 39.25 17 27.59 33.41 40.79 18 28.87 34.81 42.31 19 30.14 36.19 43.82 20 31.41 37.57 45.32 21 32.67 38.93 46.80 22 33.92 40.29 48.27 23 35.17 41.64 49.73 24 36.42 42.98 51.18 25 37.65 44.31 52.62 26 38.89 45.64 54.05 27 40.11 46.96 55.48 28 41.34 48.28 56.89 29 42.56 49.59 58.30 30 43.77 50.89 59.70
Source: Virginia Tech, Quantitative Population Ecology. (n.d.). Table of chi-square statistics. Retrieved from https://web.archive.org/web/20150930232540/http://alexei.nfshost.com/PopEcol/tables/chisq.html
Try It!: #2 Why can chi-square values never be negative?
tan82773_10_ch10_295-320.indd 301 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Section 10.2 The Chi-Square Tests
To keep the size of the table manageable, the values are carried to just two decimals. For con- sistency, the final values of chi-square will be also rounded to two decimal places.
The critical value for a chi-square problem with df 5 3 and p 5 0.05 is 7.82. To distinguish the calculated value of chi-square from the critical value, follow the same pattern adopted for the other tests. First, calculate the value from the test results:
χ2 5 4.667 for the calculated value
This value is compared to the critical value, which is indicated by the subscripts for the level of probability of alpha error for the test (0.05) and its degrees of freedom.
χ20.05(3) 5 7.82
With a calculated value less than the critical value from the table, the differences in the ethnic- ity of the voters in these four groups are not statistically significant; the researcher attributes the differences to chance. That may seem like a strange conclusion when the differences in the fo values are so substantial. The explanation goes back to the heart of what a goodness-of-fit test is designed to analyze. Pearson focused not on the differences (in this case) between eth- nic groups, but on the differences between what was observed and what could be expected to occur if the initial hypothesis is valid. The comparison is not how ethnic group C compares to ethnic group D, for example, but how the fo and fe values within each category differ. The dif- ference in ethnic group C between 2 (fo) and 4.5 (fe) is a different matter than the difference between 2 (ethnic group C) and 8 (ethnic group D). The result indicates that across the four groups, the difference between fo and fe does not vary enough for the result to be significant. This much difference could have occurred by chance.
The Hypotheses in a Goodness-of-Fit Test Consistent with the other tests of significant differences (z, t, F), the null hypothesis in the chi- square tests is the hypothesis of no difference, symbolized by H0: fo 5 fe. However, as the sym- bols indicate, the difference between what is observed (fo) and what is expected (fe) is what is at issue. When the fo and fe for a particular category show an approximate equivalence, we fail to reject the null hypothesis. The alternate hypothesis is that what is observed is significantly different from the expected, HA: fo ? fe. Literally, the frequency observed does not equal the frequency expected.
In the case of ethnic-group voting behavior, the statistical decision is to fail to reject H0. The differences between what was observed and what was expected across the four groups were not great enough to be statistically significant.
A Goodness-of-Fit Problem with Nonequivalent Frequencies Expected In the ethnicity and voting problem, the researcher tested the assumption that what could be expected (fe) did not differ from group-to-group among the four ethnic groups. However, researchers do not always assume equivalent fe values. When the hypothesis is that the fre- quencies will vary from category to category, the different fe values must be calculated for the categories in order to reflect the hypothesis.
tan82773_10_ch10_295-320.indd 302 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Section 10.2 The Chi-Square Tests
Perhaps a psychologist working with the military observes that service personnel exposed to combat situations for more than six months appear to experience post-traumatic stress disor- der (PTSD) about three times more frequently than those with less than six months of combat exposure. To test this hypothesis, the fe values will need to indicate the different expectations. Gathering data for a group of service personnel, the psychologist has the following:
Of 429 service personnel, 154 were exposed to combat situations for less than six months and the other 275 had six months or more of combat exposure.
Those 154 and 275 numbers indicate the fo values for the problem. As always with chi-square problems, the fe values must sum to the same 429 value, but the fe numbers must also reflect the 3-to-1 hypothesis. To determine the fe values, follow these steps:
1. Take the ratio, 3 to 1 in this example. 2. Add the elements of the ratio together: 3 1 1 5 4.3. 3. Divide the total number of subjects, n, by the sum of the ratio elements:
429 4 4 5 107.25
The fe value for those exposed to combat situations for less than six months will be 1 3 107.25. The fe value for those exposed to combat situations for six months or more will be 3 3 107.25 5 321.75.
The balance of the problem involves the same procedure used in Table 10.1 except that there are only two categories. The problem is completed in Table 10.3.
Table 10.3: A goodness-of-fit chi-square with unequal frequencies
Combat experience
Value Less than 6 months 6 months or more
fo 154.0 275.0
fe 107.25 321.75
fo 2 fe 46.75 246.75
fo 2 fe2 2185.56 2185.56
fo 2 fe2 / fe 20.38 6.79
∑ (fo 2 fe)2
fe 5 χ2 5 27.17
Note that the null hypothesis reflects the assumption that there will be no difference between what was expected and what was observed. In this particular problem, the hypothesis is that fo ? fe. What the psychologist expected was a PTSD rate that was three times higher among service personnel who had been exposed to combat situations for six months or more than among personnel who had less than six months of exposure. The value calculated is χ2 5 27.17, and the associated critical value from the table for p 5 0.05 and one degree
tan82773_10_ch10_295-320.indd 303 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Section 10.2 The Chi-Square Tests
of freedom is χ20.05(1) 5 3.84. With a calculated value of chi-square higher than the critical value from the table, the result is statistically significant. What does that mean in terms of what the psychologist expected to occur? It means that among these 429 service personnel, H0: fo 5 fe must be rejected. The rate of PTSD is not three times higher among those with six months or more of combat exposure. Just examining the fo values indicates that the PTSD rate is about double for those with six months or more of combat experience compared to those with less than six months of exposure. The psychologist’s expectation does not hold for these personnel.
The Chi-Square and Statistical Power Before a chi-square result is significant, the difference between what is expected and what actually occurs must be substantial. Nominal data cannot match the sophistication of ratio, interval, or even ordinal data, because the data used in a chi-square problem reflect only frequency. They do not contain the information that can indicate the subtle differences in measured qualities that data of the other scales reflect. The analytical price paid for relying exclusively on nominal data is power. Recall that in statistical terms, power refers to the prob- ability of detecting significance.
Users of distribution-free tests like chi-square gain great flexibility. They need not make any judgments about normality or linearity, but as the chapter earlier stated, such flexibility comes at a cost. The flexibility’s ever-present companion is an increased probability of a type II error. The failure to detect significance is higher with these distribution-free tests than with the procedures in the earlier chapters. The departures from the fo 5 fe assumption must be quite extreme before they can be chalked up to anything except sampling variability. That was the situation in the first problem on voter turnout, when it appeared that there were substantial differences in the voting behavior of people of different ethnic backgrounds, but they were nonsignificant nevertheless.
Remember that type I and II errors are related, however. When the likelihood of failing to detect a statistically significant difference is higher than usual (a type II error), the probability of finding significant difference in error (a type I error) is correspondingly reduced. Although the chi-square tests have a relatively high incidence of type II error, at least the probability of type I error is lower than with many of the parametric alternatives.
These characteristics notwithstanding, the loss of power from using a chi-square test is often a nonissue. If the data are nominal scale to begin with, researchers can make no decision about what kind of test to use; their only choice is to use one that accommodates nominal data. Power becomes an issue when data are ordinal scale or higher but some requirement such as normality is suspect. In that case the analyst must make a decision about the best course: rely on a traditional parametric test in spite of suspect normality, or adopt a nonpara- metric test with relaxed requirements but also consequent loss of power.
For example, suppose that the voting-survey researcher also asked respondents how many times in the last 15 years they had participated in elections. The researcher may have intended to use analysis of variance to deter- mine whether ethnic-group differences exist in the level of participation in elections. However, 18 respondents
Try It!: #3 The risk of which type of decision error increases with chi-square problems?
tan82773_10_ch10_295-320.indd 304 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Section 10.3 The Chi-Square Test of Independence
divided among four ethnic groups is a very small sample size. The two people in ethnic group C provide little basis for completing an ANOVA; the sample is simply too small. With groups so small, just one or two extremely low or extremely high scores will skew results, making normality an issue. In such a case, a shift to a nonparametric test like the goodness-of-fit test, where neither the normality of the data nor the sample size is central, is likely to be more appropriate.
10.3 The Chi-Square Test of Independence Both of the chi-square problems we have worked in this chapter have been goodness-of- fit (1 3 k) tests. Like all goodness-of-fit tests, the first problem involved just one variable, although it was divided into four categories to reflect the ethnicity of the voter. The second problem’s one variable—the incidence of post-traumatic stress disorder among service per- sonnel—was divided into two categories: those deployed to combat situations for less than six months and those deployed for six months or more. The goodness-of-fit test works well for any number of data categories related to a single, nominal-scale, variable.
Sometimes the question is more complex. Maybe the question involves the ethnicity of the respondent and whether the individual voted in the last election. Or perhaps the PTSD problem looks at the incidence among service personnel of different deployment periods and whether the service personnel were men or women. Both of those examples involve two variables. In any statistical analysis, researchers add variables to be able to explain the scoring variability more completely. Although z, t, and one-way ANOVA procedures are extremely important, they, like the goodness-of-fit test, are all restricted to a single independent variable. Relatively few outcomes, particularly related to human subjects, can be adequately explained by a single variable. People are too complicated.
Both the chi-square tests in this chapter compare what is observed to what is expected, but in the goodness-of-fit test, fo to fe differences test a hypothesis about frequencies in categories. The chi-square test of independence uses the fo to fe differences to test whether the two variables being examined, as the name suggests, operate independently of each other. This second chi- square test is also known as the r 3 k chi-square for reasons that will become clear below.
The Hypotheses in the Chi-Square Test of Independence The null and alternate hypotheses look the same as they do in the 1 3 k:
• H0: fo 5 fe • HA: fo ? fe
The hypotheses are reminders that the problem seeks to resolve how the frequencies observed compare to the frequencies expected. As before, H0 is rejected for calculated values of chi-square that are larger than the table value. However, in an r 3 k chi-square problem, the null hypothesis also indicates that the two variables are unrelated: the frequency with which one variable occurs does not affect the frequency of the other. If the null hypothesis is rejected (indicating that the two variables are related), the analysis has another step: determining the strength of the relationship between the variables, as the following example demonstrates.
tan82773_10_ch10_295-320.indd 305 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Section 10.3 The Chi-Square Test of Independence
A Chi-Square Test of Independence Problem Let us return to the ethnicity and voting behavior problem. The researcher now decides to expand the study to gain a more comprehensive view of how ethnicity and the tendency to vote might be related. With a list of registered voters in hand, the researcher sends out several ques- tionnaires asking, among other things, the individual’s ethnicity and whether the person voted in the last national election. With 36 responses, the researcher has gathered the following data:
Ethnic Group A: Of the 12 respondents, 8 voted
Ethnic Group B: Of the 8 respondents, 2 voted
Ethnic Group C: Of the 8 respondents, 3 voted
Ethnic Group D: Of the 8 respondents, 7 voted
The Contingency Table In this two-variable chi-square test, a table called a contingency table helps to keep the data organized. The subsets of one variable are reflected in the rows of the table (the r in the r 3 k), and the subsets of the other variable are listed in the table columns or categories (the k in the r 3 k). Table 10.4, an example of a contingency table, shows the breakdown of ethnicity and voting behavior data results.
Table 10.4: Contingency table
Ethnic group
Voted in last election
Total number of respondentsYes No
A 8 a 4 b 12
B 2 c 6 d 8
C 3 e 5 f 8
D 7 g 1 h 8
Totals 20 16 36
The subject’s ethnicity is indicated in the rows, which end with a row for column totals. The col- umns indicate how many voted and how many did not, as well as the total number in each ethnic group. Each of the 8 cells is identified with a letter, which the researcher will use to calculate the chi-square value. Cell a, for example, indicates that eight of the people in ethnic group A voted.
Calculating the Frequency-Expected Values, fe As it was with the 1 3 k chi-square test, the frequency-observed (fo) values reflect what actu- ally occurred. The frequencies expected (fe) are calculated differently than they were in the one- variable test, however. Because each value reflects the influence of two variables (each cell in the contingency table is at the intersection of a row and a column), a researcher cannot just divide the number of subjects by the number of cells and use the same fe value for each cell. The fact that
tan82773_10_ch10_295-320.indd 306 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Section 10.3 The Chi-Square Test of Independence
the two variables might have a different impact on some combinations than on others disallows such an approach. The fe value must reflect the impact that both variables have on the outcome in each combination. The fe value for cell a in the r 3 k chi-square test is completed this way:
The fe value for a particular cell is the row total for that cell times the column total for that cell, divided by the total number of subjects.
The fe value for cell a, for example, is the row total for cell a (12) times the total for the column in which cell a is found (20), divided by the total number of subjects (36): (12 3 20) 4 36 5 6.67.
The fe calculations for cells b through h follow:
b: (12 3 16) 4 36 5 5.33
c: (8 3 20) 4 36 5 4.44
d: (8 3 16) 4 36 5 3.56
e: (8 3 20) 4 36 5 4.44
f: (8 3 16) 4 36 5 3.56
g: (8 3 20) 4 36 5 4.44
h: (8 3 16) 4 36 5 3.56
Using the frequency-observed values in the cells of the contingency table and the calculated frequency-expected values, the researcher can create the same table used in the goodness-of- fit problems earlier:
For each of the eight cells,
1. subtract fe from fo, 2. square the difference, 3. divide the squared difference by fe, and 4. sum the results from each of the cells, which is the value of chi-square.
Table 10.5 completes the ethnicity and voting behavior problem.
Table 10.5: The chi-square test of independence: Ethnicity and voting behavior
Value a b c d e f g h
fo 8.00 4.0 2.0 6.0 3.0 5.0 7.0 1.0
fe 6.67 5.33 4.44 3.56 4.44 3.56 4.44 3.56
fo 2 fe 1.33 21.33 22.44 2.44 21.44 1.44 2.56 22.56
fo 2 fe2 1.77 1.77 5.95 5.95 2.07 2.07 6.55 6.55
fo 2 fe2 / fe 0.27 0.33 1.34 1.67 0.47 0.58 1.48 4.84
∑ (fo 2 fe)2
fe 5 χ2 5 7.98
tan82773_10_ch10_295-320.indd 307 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Section 10.3 The Chi-Square Test of Independence
Degrees of Freedom in the Chi-Square Test of Independence For a chi-square test of independence, the number of degrees of freedom is determined by the number of categories of one variable, minus one, times the number of categories in the other variable, minus one. For this problem, which has four rows and two columns in the contin- gency table, the number of degrees of freedom is (4 2 1) 3 (2 2 1) 5 3.
From the table for critical values of chi-square (Table 10.2), the value for 3 degrees of freedom and testing for alpha error at p 5 0.05 is χ20.05 (3) 5 7.82.
Interpreting the r 3 k Result By conducting the chi-square test of independence, the researcher is asking, “Is ethnicity related to whether the individual votes in a national election?” As with the first test, Pearson compared what actually occurs in a particular situation (fo) to what can be expected, but with
the test of independence, what is expected is based on the hypothesis that the variables involved are unrelated, uncorrelated. The null hypothesis for this test is based on that uncorrelated hypothesis, so the fe values are cal- culated to indicate what to expect when the variables are independent of each other. The substantial variations of fo from fe prompt larger values of chi-square. If the varia- tions between fo and fe are great enough that they meet
or exceed the critical value, the statistical decision is to reject the null hypothesis and conclude that the variables are not independent of each other; they are correlated.
The psychologist’s data on ethnicity and voting behavior produced a calculated value of chi- square which exceeds the critical value from Table 10.2 for p 5 0.05 and three degrees of freedom. It is statistically significant. The lack of independence indicates that voting behavior for some ethnic groups is different than it is for those of other ethnic groups.
Classifying the r 3 k Test Earlier chapters organized statistical tests according to whether they addressed the hypoth- esis of difference or the hypothesis of association. Tests like z, t, and ANOVA (F) are analyses of significant differences between samples and populations, or differences between samples. The Pearson and Spearman correlation procedures quantified the strength of the relation- ship between two variables; they addressed the hypothesis of association. The chi-square test of independence does not fit this either-or classification. The researcher initially ques- tioned whether there are significant differences in voting behavior among the different eth- nic groups, which makes the r 3 k sound a lot like an ANOVA. But the analysis is based on whether ethnicity and voting behavior are related, a question that makes the test more of a correlation analysis. The r 3 k test addresses both of those main hypotheses. It straddles the ground between the hypotheses of difference and association.
Phi Coefficient and Cramér’s V Because the researcher’s results indicate that ethnicity and voting behavior are not inde- pendent, a supplementary question follows: How related are the two variables? This is
Try It!: #4 How many categories in either variable can the r 3 k chi-square accommodate?
tan82773_10_ch10_295-320.indd 308 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Section 10.3 The Chi-Square Test of Independence
a correlation question; whenever the r 3 k chi-square value is statistically significant, the strength of the relationship must be determined.
Research uses several correlation procedures for nominal data. Depending upon circum- stances, the phi coefficient (symbolized by the Greek letter phi, φ), or a variation of the phi coefficient called Cramér’s V are the correlation procedures we’ll use. Phi coefficient is appropriate when at least one of the variables has only two levels. In the ethnicity and voting behavior problem, the individual voted or did not—these are the only two categories. If both variables have three or more levels, the researcher must choose Cramér’s V.
Both Cramér’s V and phi coefficient employ the previously determined value of chi-square in their calculations, which means that most of the work has been done before the strength-of- the-correlation question is resolved. In addition, the correlation values require no separate significance tests. Since these values are calculated only when the chi-square value is sta- tistically significant, any related φ or V coefficient will likewise be significant. For the same reason, the phi coefficient and Cramér’s V may also be considered effect sizes. Like all sig- nificance tests, the initial r 3 k result indicates only whether the chi-square value is or is not statistically significant. Phi and Cramér’s V provide information about the magnitude of the relationship.
The formula for the phi coefficient follows:
Formula 10.2
φ 5 Ñ a χ2 n b
where
χ2 5 the value already available from the r 3 k problem
n 5 the total number of observations (the sum of all the fo values)
The formula for Cramér’s V follows:
Formula 10.3
V 5 Ñ χ2
n(k 2 1)
where
χ2 5 the value already available from the r 3 k problem
n 5 the total number of observations
k 2 1 5 the number of rows or columns—whichever is less—minus 1
When there are only two levels of either or both of the variables, then V 5 φ because the denominator in V would be n 3 1.
For the ethnicity and voting behavior problem, χ2 5 7.98, and it is statistically significant. The phi coefficient is
tan82773_10_ch10_295-320.indd 309 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Section 10.3 The Chi-Square Test of Independence
φ 5 Ñ a χ2 n b
5 Ñ a 7.982
36 b
5 0.47
Note that the chi-square value is symbolized by χ2. Sometimes students note the exponent and proceed to square the value, 7.98 in this case. Keep in mind that the exponent refers to the way the value was initially calculated (fo 2 fe)2 and that to square it again as part of the phi or Cramér’s V coefficient would be an error.
The phi coefficient value is interpreted like any other correlation value, except for two things: 1. A researcher does not need to check for significance because that was already established for the chi-square value upon which it is based. 2. Because the coefficient is reached by calcu- lating a square root, phi can never be negative. Correlations closer to 1.0 are stronger; values closer to 0 are weaker. The correlation here between ethnicity and the tendency to vote might be described as a moderate correlation.
Apply It! Chi-Square Test for Locating Group Homes
Because working with the mentally ill in group homes rather than in secure facilities is more eco- nomical and in some cases more effective, a group of mental health professionals has decided to gauge whether public opinion favors outpatient treatment of mental health inmates convicted of nonviolent crimes. The group assumes that there will be differences of opinion in rural, semi-rural, suburban, and urban locations, since some of those locations are more likely to house group homes than others. Researchers sent to people in different locations 3,000 questionnaires, asking residents whether they favor releasing certain classifications of the mentally ill to the custody of group homes for treatment and rehabilitation into society. The health professionals received 1,434 completed questionnaires. The null hypothesis (H0: fo 5 fe) holds that whether people favor group homes rather than assignment to a secure facility will be unrelated to where they live. The alternate hypothesis (HA: fo ? fe) is that the two variables are related. Table 10.6 summarizes the survey data results.
Table 10.6: Questionnaire results concerning release of mentally ill to group homes
Type of neighborhood
Favor group homes
TotalsYes No
Urban 248 128 376
Suburban 212 143 355
Small town 198 95 293
Semi-rural 33 67 100
Rural 133 177 310
Totals 824 610 1434
tan82773_10_ch10_295-320.indd 310 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Section 10.3 The Chi-Square Test of Independence
Because this 5 3 2 chi-square involves so many repetitious calculations, this would be a good problem to complete using Excel. The related calculations are demonstrated below, but in the meantime, note that χ2 5 75.47. By comparison, the critical value is χ20.05(4) 5 9.49. With a statistically significant value of chi-square, the psychologists reject the null hypothesis and proceed to calculate phi:
φ 5 Ñ ( χ2 n )
Inserting the values for χ2 and for n, they discover that
φ 5 Ñ ( 75.47 1,434 )
φ 5 0.23
Researchers initially chose to break down responses by area of residence because they assumed that there might be differences of opinion about the value of establishing group homes depending upon the likelihood of one near respondents’ places of residence. The initial r 3 k results support this explanation. The two variables (where one lives and whether one supports) do not operate independently. However, the phi coefficient value makes it clear that the relationship is not a particularly strong one. A value of φ 5 0.23 suggest that where one lives and whether one supports group homes as an alternative to secure institutions are only modestly related.
Apply It! boxes written by Shawn Murphy
A 3 3 3 Problem An earthquake and an accompanying tsunami have wreaked havoc in a particular country. A psychologist is analyzing the appeal of three different outreach programs (A, B, and C) to people who describe their losses as primarily material, primarily emotional, or a com- bination. The three programs offer counseling regarding insurance and financial recovery (A), therapy focused on coping with change and the loss of loved ones (B), and therapy that includes elements of both financial recovery and emotional disruption (C). The data are as follows:
• Among 8 people describing their losses as primarily material, 6 opt for A, 2 for B, and 0 for C.
• Among 6 people describing their losses as primarily emotional, 0 opt for A, 2 for B, and 4 for C.
• Among 6 people describing their losses as a combination, 0 opt for A, 5 opt for B, and 1 for C.
The question is whether people who have different kinds of losses choose to be involved in different kinds of counseling programs. Table 10.7 depicts the contingency table for these data.
Table 10.8 shows the solution for this problem.
tan82773_10_ch10_295-320.indd 311 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Section 10.3 The Chi-Square Test of Independence
Completing the calculations produces a chi-square value of χ2 5 18.19. With four degrees of freedom (3 2 1) 3 (3 2 1), the critical value for testing for alpha error at p 5 0.05 (χ20.05(4)) is 9.49 5 18.190.05(4). The type of loss the individual suffered is related to the kind of counsel- ing the individual chooses. Because both variables have more than three levels, researchers use Cramér’s V as the correlation procedure to determine the strength of the relationship between type of loss and kind of counseling.
With n 5 20 and the number of either the rows or columns 5 3, k 2 1 5 2. Cramér’s V is calculated as follows:
V 5 Ñ χ2
n(k 2 1) 5 Ñ 18.19 20(2)
5 0.67
The r 3 k result indicated that there are significant differences between what was observed and what would have been expected had there been no relationship between the type of coun- seling program and the kind of loss victims suffered. Cramér’s V quantified the strength of that relationship. Besides being statistically significant (which the significant χ2 value estab- lishes), the strength of the relationship appears to be substantial.
Table 10.7: Contingency table for a 3 3 3 problem
Type of Loss
Program
TotalsA B C
Material 6a 2b 0c 8
Emotional 0d 2e 4f 6
Material and Emotional 0g 5h 1i 6
Totals 6 9 5 20
Table 10.8: Another r 3 k problem: Outreach programs and the type of loss suffered
Value a b c d e f g h i
fo 6.0 2.0 0 000 2.00 4.00 0.00 5.00 1.00
fe 2.4 3.6 2 1.80 2.70 1.50 1.80 2.70 1.50
fo 2 fe 3.6 21.6.0 22.0 21.8 00 20.70 0 2.50 21.8.00 2.30 20.5.00
fo 2 fe2 12.96 2.56 4 3.24 0.49 6.25 3.24 5.29 0.25
fo 2 fe2 / fe 5.4 0.71 2 1.80 0.18 4.17 1.8.0 1.96 0.17
∑ (fo 2 fe)2
fe 5 χ2 5 18.19
tan82773_10_ch10_295-320.indd 312 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Section 10.3 The Chi-Square Test of Independence
Apply It! Public Policy Research
Psychologists associated with a particular national psychological association are inter- ested in how people view the insanity defense in criminal trials. Their research is based on anecdotal data that suggest that the viability of an insanity defense has something to do with the region of the country in which the respon- dent lives. The psychologists develop a ques- tionnaire on mental health issues where one of the items is in Likert-type form and is posed as follows:
At certain times, circumstances make otherwise rational people unaccountable for their actions.
The response choices are the following:
• disagree • neither agree nor disagree • agree
The researchers solicit responses from five different areas of the country identified as these:
• the Northeast • the South • the Midwest • the Northwest • the West
This is a 5 (regions) 3 3 (type of response) chi-square test of independence. The question the procedure will answer is whether where the respondent lives is independent of the respon- dent’s response to this particular item. The null hypothesis associated with such a question is that the two are unrelated, indicated by no statistically significant differences between fo and fe. Results for the survey are shown in Table 10.9.
Table 10.9: Agreement with insanity defense according to region
Campaign Disagree Agree Strongly Agree Totals
Northwest 1 4 13 18
Northeast 2 5 9 16
Midwest 10 4 2 16
West 11 5 1 17
South 13 3 0 16
Totals 37 21 25 83
Mark Massel/iStock/Thinkstock
(continued)
tan82773_10_ch10_295-320.indd 313 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Section 10.4 Completing the r 3 k with Excel
10.4 Completing the r 3 k with Excel The Excel statistical package focuses on analyses for interval and ratio data. Excel offers no procedure for either of the chi-square tests, but they are not difficult to set up and do not require elaborate programming. To illustrate, here is how to complete the second r 3 k prob- lem on Excel. To produce Figure 10.2, follow these steps:
1. Leave cell A1 blank, and then label cells B1–F1 as follows:
B1: fo C1: fe D1: fo 2 fe E1: (fo 2 fe)2
F1: (fo 2 fe)2/fe
(continued)
Having completed the analysis, the psychologists find that
χ2 5 42.04.
The number of degrees of freedom is (5 2 1) 3 (3 2 1) 5 8, so
χ20.05(8) 5 15.51.
The calculated value of chi-square is larger than the critical value of chi-square from Table 10.2, indexed by the probability of a type I error and the degrees of freedom. For these data at least, where respondents live does relate to how they answer the item about accountability. To determine the strength of the relationship between the region of residence and the response type, the psychologists will need Cramér’s V, since both variables have more than two levels.
In this case, the response type (agree, neither agree . . .) represents the variable with the fewest number of levels (3), so k 2 1 5 2, and n 5 the total number of responses (83). Therefore,
V 5 Ñ χ2
n(k 2 1)
5 Ñ 42.04
(83 3 2)
5 0.50
This is another moderate correlation. Psychologists who are called as expert witnesses in trials involving an insanity defense will have a more difficult time convincing the jury in some areas of the country than in others.
Apply It! boxes written by Shawn Murphy
tan82773_10_ch10_295-320.indd 314 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Section 10.4 Completing the r 3 k with Excel
Figure 10.2: Completing the chi-square test of independence using Excel
Source: Microsoft Excel. Used with permission from Microsoft.
2. Beginning in cell A2 and continuing to cell A10, enter the labels, as shown in Figure 10.1.
3. In column B under “fo,” enter the fo values for cells a through i from the contingency table.
4. Leave cell A11 blank, and then beginning in cell A12, enter the label “row 1 =.” The label “row 2 =” will go in cell A13.
tan82773_10_ch10_295-320.indd 315 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Section 10.4 Completing the r 3 k with Excel
5. Starting in cell B12, enter the formula for determining the sum of row 1 (as if you were adding cells a, b, and c in the contingency table): =sum(B2:B4).
• For the row 2 total in cell B13, the command will be =sum(B5:B7). Click Enter. • For the row 3 total in cell B14 the command will be =sum(B8:B10). Click Enter. • For the column 1 total in cell B15 the command will be =sum(B2+B5+B8).
Click Enter. • For the column 2 total in cell B16 the command will be =sum(B3+B6+B9).
Click Enter. • For the column 3 total in cell B17 the command will be =sum(B4+B7+B10).
Click Enter.
6. In cell C2, enter the formula for determining the fe value for what is cell a in the con- tingency table, with the formula =(B12*B15)/20. For cells b–i in the contingency table, the commands are the following:
• In cell C3, enter the command =(B12*B16)/20. • In cell C4, enter the command =(B12*B17)/20. • In cell C5, enter the command =(B13*B15)/20. • In cell C6, enter the command =(B13*B16)/20. • In cell C7, enter the command =(B13*B17)/20. • In cell C8, enter the command =(B14*B15)/20. • In cell C9, enter the command =(B14*B16)/20. • In cell C10, enter the command =(B14*B17)/20.
7. In cell D2, enter the command =B2-C2.
• Repeat this command for cells D3–D10 by placing the cursor on the result in cell D2 and dragging the cursor down to cell D10 so that all the cells from D2 to D10 are highlighted.
• From the Home tab, select the small down arrow to the right of the larger down arrow at the top of the page on the extreme right just under the summation (∑) sign, and then select down.
8. In cell E2, enter the command =D2^2 which will enter the square of the cell D2 contents in cell E2.
• Repeat this command for cells E3–E10 by placing the cursor on the result in cell E2 and dragging it down to cell E10 so that all the cells from E2 to E10 are highlighted.
• From the Home tab, select the small down arrow to the right of the larger down arrow at the top of the page on the extreme right just under the summation (∑) sign, and then select down.
9. In cell F2, enter the command =E2/C2.
• Repeat this command for cells F3–F10 by placing the cursor on the result in cell F2 and dragging it down to cell F10 so that all the cells from F2 to F10 are highlighted.
tan82773_10_ch10_295-320.indd 316 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Summary and Resources
• From the Home tab, select the small down arrow to the right of the larger down arrow at the top of the page on the extreme right just below the summation (∑) sign, then select down.
10. Sum the values from cells F2 to F10 by entering the command =sum(F2:F10) in cell F11. The result is the chi-square value, which may modestly differ from a long- hand calculation because Excel carries several more decimals than the two in the longhand solution.
Writing Up Statistics Habersham (2014) used the chi-square test of independence to study the relationship between college persistence and student characteristics such as dual enrollment (concurrent college and high school enrollment), gender, and ethnicity. Of the three variables, only gender was significantly related to college persistence. The students’ persistence in post-secondary studies was dependent upon neither their prior experience with dual enrollment nor their ethnicity. Interestingly, the author does not report the strength of the relationship between gender and persistence.
Summary and Resources
Chapter Summary Nineteenth-century British prime minister and apparent skeptic Benjamin Disraeli observed that what we expect usually does not occur and what we do not anticipate typically happens. Karl Pearson, who happened to be one of Disraeli’s contemporaries, would have disagreed. In fact, Pearson’s chi-square tests are based on the assumption that under normal circum- stances, it is the expected thing that happens, and when the outcome differs significantly from what is expected, that is noteworthy.
The chi-square tests are the only procedures in this book that are designed for nominal- scale data, data assigned to categories rather than quantities (Objective 1). For that reason, they stand apart from the others. There are, however, some important similarities as well. Both the goodness-of-fit (1 3 k) test and the test of independence (r 3 k) are tests of signifi- cant differences like the z, t, and ANOVA tests. In addition, a significant finding with the r 3 k test leads to calculating either the phi coefficient or, when both variables have more than two categories, Cramér’s V (Objectives 2 and 3). The phi coefficient and Cramér’s V are both correlation procedures, which places them in the same category as the Pearson and Spear- man correlations. Although this chapter may seem very different from the first nine chap- ters, the differences are confined to the scale of the data involved and the kinds of values that are calculated as a result, not to the kinds of decisions that are made.
Data of any scale can be reduced to nominal-scale data and subjected to chi-square analysis. It is possible to reduce ratio-scale data into nominal-scale data. For example, this occurs if a psychologist who has collected data on the number of times clients manifest compulsive behaviors elects to ignore the number of behaviors (ratio data) and focus only on whether or not people from different groups manifest compulsive behavior (yes/no responses
tan82773_10_ch10_295-320.indd 317 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Summary and Resources
constitute nominal data—Objective 1) according to significantly different frequencies (Objective 2). Such research provides for a simpler analysis, but any opportunity to examine the degree of compulsivity is forfeit. The differences between the client who is minimally compulsive and the one who is extremely compulsive are lost. Reductions in data scale result in lost data, and when researchers have a choice, most avoid reducing data scale, in spite of the opportunity to use chi-square analysis with all its flexibility.
Chi-square tests’ appeal, on the other hand, is that they do not require that the data meet any of the normality requirements that parametric tests impose. Very small data sets, with their inherent risks to normality, are more acceptable in chi-square tests than they are with t test, for example. The chi-square tests should have a place in every behavioral scientist’s analytical repertoire.
To some degree, the chi-square tests in this chapter represent a nominal data equivalent to the one-way ANOVA and the factorial ANOVA tests used for interval and ratio data. Like the one-way ANOVA, the goodness-of-fit chi-square involves just one independent variable (political party affiliation, for example), but it can be divided into any number of categories. Like the factorial ANOVA, the chi-square test of independence involves two independent variables (party affiliation and gender, for example), although in the case of the chi-square test, two variables are the limit. (Objective 3). Although the coverage of statistical proce- dures in this book has not been exhaustive, it has been representative. The 10 chapters introduced some of the most important concepts in data analysis and some of the statistical procedures that provide the foundation for nearly all quantitative analysis. Someone with a grasp of the material in these chapters has a solid footing from which to conduct research, complete the data analysis that many reports require, and read the scholarly research.
A parting comment: data-analysis concepts do not often come up in day-to-day conversation, but they probably should provide at least the backdrop for those conversations. You now have the tools for summarizing data and presenting them to others and for answering a variety of important questions. Do the people in your community differ from a national population in voter participation? Are graduation rates at your university significantly different from those at others you considered attending? Is the relationship between marital status and salary merely a random relationship? People frequently make snap judgments about such things, but you need not. You now have the tools that will allow you, with some work in data collec- tion, to answer such questions definitively. So it is worth working to keep your data-analysis tools well-oiled. Think of the understanding you have gained of these topics as elements of a cognitive muscle: Do not let it atrophy. Occasional practice and frequent review will keep your understanding active. The author wishes each student the very best of luck. Happy analyzing!
chi-square test of independence, or r 3 k chi-square A test of whether two nominal variables operate independently of each other.
contingency table The arrangement of data from a chi-square test of indepen- dence where the categories of one nominal
variable are in rows and the categories of the other variable are in columns.
Cramér’s V Variation of the phi coefficient used to determine the strength of the rela- tionship between two variables with more than two categories.
Key Terms
tan82773_10_ch10_295-320.indd 318 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Summary and Resources
Review Questions Answers to the odd-numbered questions are provided in Appendix A.
1. A market researcher for an advertising agency wants to determine whether people listen to three local radio stations in similar proportions. The researcher conducts a telephone survey to find out which station people prefer. In two hours, 52 of the respondents indicated preference for one of the three. The data are as follows:
Station A: 22 Station B: 18 Station C: 12
Are the differences among their preferences statistically significant?
2. What does it mean when an r 3 k problem is statistically significant?
3. Why are significant r 3 k problems followed by either phi coefficient or Cramér’s V? What are the circumstances under which each procedure is used?
4. A counselor working with people who are developmentally disabled has read research relating accurate responses to the type of reinforcement they receive. For three weeks, the counselor provides verbal praise every time a client completes a simple task cor- rectly and then totals the number of simple problem-solving tasks that are completed successfully. For the next three weeks, the counselor provides a small piece of candy for each successfully completed task. The data for the group are as follows:
After 3 weeks of verbal praise—17 tasks completed successfully in one hour. After 3 weeks of tangible rewards—27 tasks completed successfully in one hour.
Are the differences statistically significant?
5. An armed-services psychologist believes that among those who have been in a com- bat area, ground forces experience post-traumatic stress more frequently than Navy or Air Force personnel. Data from a number of armed-services personnel who have been in combat areas reveal the following:
Of 40 Army personnel, 22 have experienced some post-traumatic stress. Of 30 Air Force personnel, 3 have experience some post-traumatic stress. Of 30 Navy personnel, 12 have experienced some post-traumatic stress.
goodness-of-fit chi-square test, or 1 3 k chi-square A test for significant differences in the frequency with which nominal data occur in distinct categories.
phi coefficient Used to determine the strength of the relationship when there are
two categories of each variable in the chi- square test of independence. When there are more than two categories of both variables, the measure is Cramér’s V.
tan82773_10_ch10_295-320.indd 319 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.
Summary and Resources
a. Are the branch of service and post-traumatic stress related? b. If so, what is the strength of the relationship?
6. A university counselor has a theory that students’ employment and their grades may be related. Among 20 employed students, 8 have grades above 3.0. Among 15 non- employed students, 12 have grades better than 3.0. Is the counselor correct? If so, what is the strength of the relationship?
7. A university administrator believes that undergraduate students of different majors attend the writing lab in different proportions. Test this with the following data:
Of 20 English majors, 2 attend the writing lab. Of 18 engineering majors, 10 attend the lab. Of 15 history majors, 6 attend the lab.
Do major and lab attendance operate independently?
8. Juvenile offenders in court-ordered treatment can choose between community- service activities and a series of group-counseling sessions. The therapist believes that they will choose therapy over community service by a ratio of two to one. Among 35 offenders, 20 opt for therapy. Is the counselor correct?
9. In an effort to assist students more effectively, a high school counselor wonders whether students who opt for college choose one institution with significantly greater frequency than the others. Data are as follows for 70 graduating seniors who are going to college:
37 apply to the local community college 23 apply to the state university 10 apply to the private, liberal-arts college
Are the differences in institutional preference statistically significant?
10. A car salesperson attempts to determine whether age and the type of car purchased are related.
Of 15 people in their 20s, 3 opt for sports cars, 8 for economy cars, and 4 for sedans. Of 18 people in their 30s, 7 opt for sports cars, 4 for economy cars, and 7 for sedans. Of 12 people in their 40s, 2 opt for sports cars, 4 for economy cars, and 6 for sedans.
Are customers’ ages and the type of car they select related?
Answers to Try It! Questions
1. The 1 3 k chi-square will accommodate just one nominal-scale variable, but it may have any number of categories.
2. The fact that chi-square values are squared as they are calculated does away with any possibility of a negative value.
3. Type II errors are typically more of a problem with chi-square than type I errors are.
4. The r 3 k can accommodate just two variables, but theoretically, those two variables can have any number of categories.
tan82773_10_ch10_295-320.indd 320 3/3/16 2:33 PM
© 2016 Bridgepoint Education, Inc. All rights reserved. Not for resale or redistribution.



