Discovering statistics using IBM SPSS statistics by Andy Field, fifth edition – Summary chapter 3

There are three main misconceptions of statistical significance:

  1. A significant result means that the effect is important
    Statistical significance is not the same as practical significance.
  2. A non-significant result means that the null hypothesis is true
    Rejecting the alternative hypothesis does not mean we accept the null hypothesis.
  3. A significant result means that the null hypothesis is false
    If we reject the null hypothesis in favour of the alternative hypothesis, this does not mean that the null hypothesis is false, as rejection is all based on probability and there still is a probability of it not being false.

The use of NHST encourages ‘all-or-nothing’ thinking. A result is either significant or not. If a confidence interval contains zero, it could be that the population effect might be zero.

An empirical probability is the proportion of events that have the outcome in which you’re interested in an indefinitely large collective of events. The p-value is the probability of getting a test statistic at least as large as the one observed relative to all possible values of the null hypothesis from an infinite number of identical replications of the experiment. It is the frequency of the observed test statistic relative to all possible values that could be observed in the collective of identical experiments. The p-value is affected by the intention of the researcher as the p-values are relative to all possible values in identical experiments and sample size and time of collection of data (the intentions) could influence the p-values.

In journals, based on NHST, there is a publication bias. Significant results are more likely to get published. Researcher degrees of freedom are ways in which the researcher could influence the p-value. This could be used to make it more likely to find a significant result (e.g. by excluding some cases to make the result significant). Researcher degrees of freedom could include not using some observations and not publishing key findings.

P-hacking refers to selective reporting of significant p-values by trying multiple analyses and reporting only the significant ones. HARKing refers to making a hypothesis after data collection and presenting it as if it was made before data collection. P-hacking and HARKing makes results difficult to replicate. Tests of excess success (e.g. looking at multiple studies studying the same and calculating the probability of them all having success) are used to see whether it is likely that p-hacking or something else may have occurred.

EMBERS
There is an abbreviation for how to tackle the problems of NHST: Effect sizes (E), Meta-analysis (M), Bayesian Estimation (BE), Registration (R) and Sense (S), together making EMBERS.

SENSE
There are six principles for when using NHST in order to use your sense:

  1. The exact p-value can indicate how incompatible the data are with the null hypothesis.
  2. P-values are not interpreted as the probability that the hypothesis is true.
  3. Scientific conclusions and policy conclusions should not be based on whether a p-value passes a threshold (no all-or-nothing thinking).
  4. Don’t p-hack.
  5. Don’t confuse statistical significance with practical importance.
  6. A p-value by itself does not provide a good measure of evidence regarding a model or hypothesis.

The problems of NHST and p-hacking can be combatted by pre-registering research and using open science.

EFFECT SIZES
A statistical significant result does not tell us anything about the importance of an effect. The size of an effect can be measured by calculating the effect size. This is an objective and standardized measure of the magnitude of observed effect. There are several measures of effect size.

One way to do this is using Cohen’s d and it uses the following formula:

It is standardized in standard deviations. The rules of thumb for using Cohen’s d are the following: d=0.2 (small), d=0.5 (medium), d=0.8 (large). If the standard deviations are not equal then it is possible to use the pooled standard deviation which uses the following formula:

N denotes the sample size of each group and s denotes the standard deviation. Another way of calculating effect sizes is making use of Pearson’s r. It is a measure of strength of a relationship between two continuous variables and uses the following rules of thumb:

r

Effect size

Variance explained

0.10

Small effect

1% of total variance

0.30

Medium effect

9% of total variance

0.50

Large effect

25% of total variance

Pearson’s r can vary from -1 to 1. Cohen’s d is favoured if the group sizes are very discrepant. ‘r’ can be quite biased compared to ‘d’. Another way to calculate the effect size is by making use of the odds ratio. It is useful when using it for counts (contingency table). The odds of an event occurring refers to the probability of an event occurring divided by the probability of that event not occurring. It uses the following formula:

The odds ratio is the odds of an event divided by the odds of another event. It uses the following formula:

META-ANALYSIS
A basic meta-analysis is taking the average of the effect sizes of the studies. It uses the following formula:

It is the sum of all the effect sizes divided by the number of studies. An actual meta-analysis uses a weighted average, instead of the regular average.

BAYESIAN APPROACHES
Bayesian statistics is about using the data you collect to update your beliefs about a model parameter or a hypothesis. Beliefs are updated based on new information. Bayes’ theorem is used to calculate the conditional probabilities. Conditional probability deals with finding the probability of an event when you know that the outcome was in some particular part of the sample space. It is most commonly used to find a probability about a category for one variable (e.g: a person being a drug user).

For events A and B, the conditional probability of event A, given that event B has occurred, is:

Depression

Positive

Negative

Total probability

Yes

P(Pos|D) = 0.99

Pos(Neg|D) = 0.01

1

No

P(Pos|Dc) = 0.02

Pos(Neg|Dc) = 0.98

1

It can be calculated using a tree-diagram, or by using Bayes’ theorem. Bayes’ theorem uses the following formulas:

Posterior probability is our belief in a hypothesis after having considered the data. The prior probability is our belief in a hypothesis before considering the data (base rate). The marginal likelihood is the probability of the observed data.

When estimating a parameter, the prior probability is a distribution of possibilities. An informative prior distribution shows the distribution of the probabilities before considering the data. In an uninformative prior distribution (a flat line) you are prepared to believe all possible outcomes with equal probability. Unlike the uninformative prior distribution, the informative prior distribution does show you what values are more probable. A credible interval are the limits between which 95% o the values fall in the posterior distribution fall.

Bayes’ theorem can be used to compare two hypotheses using posterior odds. This uses the following formula:

A Bayes factor is the ratio of the probability of the data given the alternative hypothesis to that for the null hypothesis. A Bayes factor greater than 1 suggests that the observed data are more likely given the alternative hypothesis than given the null.

The positives of Bayesian statistics is that it is not affected by the problems of NHST, but a negative is that it requires a prior belief, which is subjective.

 

 

 

Image

Access: 
Public

Image

Join WorldSupporter!
This content is used in:

Summary of Discovering statistics using IBM SPSS statistics by Andy Field - 5th edition

Search a summary

Image

 

 

Contributions: posts

Help other WorldSupporters with additions, improvements and tips

Add new contribution

CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Image CAPTCHA
Enter the characters shown in the image.

Image

Spotlight: topics

Check the related and most recent topics and summaries:
Institutions, jobs and organizations:
Activity abroad, study field of working area:
This content is also used in .....

Image

Check how to use summaries on WorldSupporter.org

Online access to all summaries, study notes en practice exams

How and why use WorldSupporter.org for your summaries and study assistance?

  • For free use of many of the summaries and study aids provided or collected by your fellow students.
  • For free use of many of the lecture and study group notes, exam questions and practice questions.
  • For use of all exclusive summaries and study assistance for those who are member with JoHo WorldSupporter with online access
  • For compiling your own materials and contributions with relevant study help
  • For sharing and finding relevant and interesting summaries, documents, notes, blogs, tips, videos, discussions, activities, recipes, side jobs and more.

Using and finding summaries, notes and practice exams on JoHo WorldSupporter

There are several ways to navigate the large amount of summaries, study notes en practice exams on JoHo WorldSupporter.

  1. Use the summaries home pages for your study or field of study
  2. Use the check and search pages for summaries and study aids by field of study, subject or faculty
  3. Use and follow your (study) organization
    • by using your own student organization as a starting point, and continuing to follow it, easily discover which study materials are relevant to you
    • this option is only available through partner organizations
  4. Check or follow authors or other WorldSupporters
  5. Use the menu above each page to go to the main theme pages for summaries
    • Theme pages can be found for international studies as well as Dutch studies

Do you want to share your summaries with JoHo WorldSupporter and its visitors?

Quicklinks to fields of study for summaries and study assistance

Main summaries home pages:

Main study fields:

Main study fields NL:

Follow the author: JesperN
Work for WorldSupporter

Image

JoHo can really use your help!  Check out the various student jobs here that match your studies, improve your competencies, strengthen your CV and contribute to a more tolerant world

Working for JoHo as a student in Leyden

Parttime werken voor JoHo

Statistics
3002 1 1