This paper presents two applications of chi-square analysis to categorical educational research. The first section proposes a research design investigating perceived barriers to STEM majors among male and female students, establishing the null and alternative hypotheses and explaining why chi-square methodology suits nominal, categorical data. The second section critiques Hill and Hannon's (2008) empirical study of middle school physical education preferences, evaluating their chi-square analysis across gender, grade level, skill level, and out-of-school participation variables. Together, these sections demonstrate how chi-square tests address frequency-based research questions in educational settings.
Many interesting questions related to students are categorical in nature. For instance, considerable interest exists in the different enrollment patterns of male and female students in Science, Technology, Engineering, and Mathematics (STEM) majors. While the literature provides robust quantitative data on the numbers of students enrolled in these fields, intriguing questions remain that may be better suited to qualitative or exploratory data collection methods.
One such research question is: What barriers are perceived by female students with regard to pursuing STEM majors, and do male students face similar barriers? Understanding these barriers requires capturing how individual students categorize or identify obstacles, which naturally leads to categorical data. Categorical representation of presumed barriers could form the basis for a rigorous statistical study using chi-square analysis to explore the frequency of students who identify particular barriers to selecting STEM majors.
This approach recognizes that not all research questions are best answered by continuous, normally distributed data. When variables are discrete and categorical, different statistical methods are required. Understanding when and why to apply chi-square analysis is essential for educational researchers addressing questions about student preferences, barriers, and outcomes.
Chi-square analysis, also called goodness of fit testing, relies on frequency data, which makes it applicable for nominal data. The variables of gender and barrier categories are discrete and categorical—they fit into distinct bins with no inherent order or numerical meaning. Such nominal variables cannot be meaningfully analyzed using parametric tests like t-tests or ANOVA, which assume continuous, normally distributed data.
The chi-square test of independence assumes that each subject contributes data to only one cell in the contingency table. This fundamental assumption means that the sum of frequencies in all cells must equal the total number of subjects in the study. When this condition is met and data are properly categorical, chi-square analysis provides a robust method for determining whether observed frequencies differ significantly from expected frequencies across groups.
A well-designed study investigating gender differences in perceived STEM barriers would begin with clear hypotheses. The research question can be stated formally as:
RQ1: What barriers do female and male students perceive with regard to selecting majors in science, technology, engineering, and mathematics (STEM) courses?
The corresponding hypotheses would be:
H0: There are no differences in the barriers perceived by male and female students with regard to majoring in STEM courses.
H1: There are differences in the barriers perceived by male and female students with regard to majoring in STEM courses.
Such a study would collect categorical data on gender and barrier identification, making chi-square analysis the appropriate statistical tool for testing these hypotheses.
Hill and Hannon (2008) conducted an empirical study exploring activity preferences among middle school students in physical education classes. Their research was motivated by the long-term objective of establishing lifelong healthy physical exercise habits in students. The study's purpose was to determine what activities middle school students prefer for their physical education classes and whether preferences differ based on gender, grade level, motor skill competencies, and out-of-school physical activity participation.
The sampling frame consisted of 1,500 students in grades 7, 8, and 9 at two middle schools in the Southwestern United States. The student ethnic populations at the two schools were similarly distributed. A total of 881 surveys were appropriately completed, representing 97% of the classes selected to participate. Parental consent was obtained, with returning consent forms offering extra credit to participating students.
The survey instrument included sections for demographic information, a list of physical education activity choices, and an open-ended section where students could write in preferred activities not on the standard list. Physical education teachers ranked students according to their motor skill competencies using a three-point scale (1 = above grade level, 2 = at grade level, 3 = below grade level).
Hill and Hannon (2008) employed SPSS software to conduct their statistical analyses. The dependent variables were the students' preferred physical education activities. The independent variables included the 33 suggested activities and student attributes: gender, grade level, motor skill competencies, and participation in physical activities outside of school.
Data analysis consisted of frequencies and percentages of activity choice responses by group and gender, followed by chi-square analyses to determine whether significant relationships existed between curricular choices and the independent variables. A separate chi-square analysis examined whether significant relationships existed between activity preferences at the two schools—one with co-ed classes and one with gender-separate classes. All statistical tests used a significance level of p < .01.
The results were extensive. When survey responses were compared between schools, chi-square analysis showed a significant main effect for nine activities. In the school with co-ed physical education classes, a significantly higher percentage of students selected bowling, swimming, archery, skating, table tennis, volleyball, canoeing, water polo, and walking. When responses were compared by gender, 21 percent of the activities revealed significant chi-square values. Nine activities showed significant chi-square values when compared by skill level, and 11 activities showed significant differences by grade level. The analysis also demonstrated significant differences in activity preferences between males and females at each grade level, and 12 activities showed significant chi-square values when comparing students' additional sports participation outside class.
"Significant results and limitations of the study"
"Educational and health implications of findings"
Always verify citation format against your institution’s current style guide requirements.