Summary of Discovering statistics using IBM SPSS statistics by Andy Field - 5th edition
- 3093 reads
Join with a free account for more service, or become a member for full access to exclusives and extra support of WorldSupporter >>
It is possible to predict categorical outcome variables, meaning, in which category an entity falls. When looking at categorical variables, frequencies are used. The chi-squared test can be used to see whether there is a relationship between two categorical variables. It is comparing the observed frequencies with the expected frequencies. The chi-squared test standardizes the deviation for each observation and these are added together.
The chi-squared test uses the following formula:
The expected score has the following formula:
The degrees of freedom of the chi-squared distribution are (r-1)(c-1). In order to use the chi-squared distribution with the chi-squared statistic, there is a need for the expected value in each cell to be greater than 5. If this is not the case, then Fisher’s exact test can be used.
The likelihood ratio statistic is an alternative to the chi-square statistic. It is comparing the probability of obtaining the observed data with the probability of obtaining the same data under the null hypothesis. The likelihood ratio statistic uses the following formula:
It uses the chi-squared distribution and is the preferred test if the sample size is small. The chi-square statistic tends to make a type-I error if the table is 2 x 2. This can be corrected for by using Yates’ correction and uses the following formula:
In short, the chi-square test tests whether there is a significant association between two categorical variables.
ASSUMPTIONS WHEN ANALYSING CATEGORICAL DATA
One assumption the chi-square test uses is the assumption of independence of cases. Each person, item or entity must contribute to only one cell of the contingency table. Another assumption is that in 2x2 tables, no expected value should be below 5. In larger tables, not more than 20% of the expected values should be below 5 and all expected values should be greater than 1. Not meeting this assumption leads to a reduction in test power.
The residual is the error between what the expected frequency and the observed frequency. The standardized residual can be calculated in the following way:
Individual standardized residuals have a direct relationship with the test statistic, as the chi-square statistic is composed of the sum of the standardized residuals. The standardized residuals behave like z-scores.
EFFECT SIZE
Cramer’s V can give an effect size. In 2x2 tables, the odds-ratio is often used as the effect size. The odds-ratio uses the following formula:
The actual odds ratio is the odds of event A divided by the odds of event B.
This bundle contains the chapters of the book "Discovering statistics using IBM SPSS statistics by Andy Field, fifth edition". It includes the following chapters:
- 1, 2, 3, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18.
There are several ways to navigate the large amount of summaries, study notes en practice exams on JoHo WorldSupporter.
Do you want to share your summaries with JoHo WorldSupporter and its visitors?
Main summaries home pages:
Main study fields:
Business organization and economics, Communication & Marketing, Education & Pedagogic Sciences, International Relations and Politics, IT and Technology, Law & Administration, Medicine & Health Care, Nature & Environmental Sciences, Psychology and behavioral sciences, Science and academic Research, Society & Culture, Tourisme & Sports
Main study fields NL:
JoHo can really use your help! Check out the various student jobs here that match your studies, improve your competencies, strengthen your CV and contribute to a more tolerant world
2531 | 1 |
Add new contribution