Correlation - summary of chapter 8 of Statistics by A. Field (5th edition)

Statistics
Chapter 8
Correlation

Modeling relationships
Partial and semi-partial correlation
Comparing correlations
Calculating the effect size
How to report correlation coefficents

Modeling relationships

The data we observe can be predicted from the model we choose to fit the data plus some error in prediction.

Outcome_i= (model) + error_i
Thus
outcome_i= (b₁X_i)+error_i

z(outcome)_i = b₁z(X_i)+error_i

z-scores are standardized scores.

A detour into the murky world of covariance

The simplest way to look at whether two variables are associated is to look whether they covary.
If two variables are related, then changes in one variable should be met with similar changes in the other variable.

Covariance (x,y) = Σⁿ_i=1 ((x_i-ẍ)(y_i-ÿ))/N-1

The equation for covariance is the same as the equation for variance, except that instead of squaring the deviances, we multiply them by the corresponding deviance of the second variable.

A positive covariance indicates that as on variable deviates from the mean, the other variable deviates in the same direction.
A negative covariance indicates that as one variable deviates from the mean, the other deviates from the mean in the opposite direction.

The covariance depends upon the scales of measurement used: it is not a standardized measure.

Standardization of the correlation coefficient

To overcome the problem of dependence on the measurement scale, we need to convert the covariance into standard set of units → standardization.
Standard deviation: a measure of the average deviation from the mean.
If we divide any distance from the mean by the standard deviation, it gives us that distance in standard deviation units.
We can express the covariance in a standard units of measurement if we divide it by the standard deviation. But, there are two variables and hence two standard deviations.

Correlation coefficient: the standardized covariance

r = cov_xy/(s_xs_y)

s_x is the standard deviation for the first variable
s_y is the standard deviation for the second variable.

By standardizing the covariance we end up with a value that has to lie between -1 and +1.
A coefficient of +1 indicates that the two variables are perfectly positively correlated.
A coefficient of -1 indicates a perfect negative relationship.
A coefficient of 0 indicates no linear relationship at all.

The significance of the correlation coefficient

We can test the hypothesis that the correlation is different from zero.
There are two ways of testing this hypothesis.

We can adjust r so that its sampling distribution is normal:

z_r = ½ log_e((1+r)/(1-r))

The resulting z_rhas a standard error given by:

Se_zr = 1/(square root(N-3))

We can adjust r into a z-score

z = z_r/Se_zr

The t-statistic for r is:

t_r = (r * square root(N-2))/ (square root(1-r²))

The correlation coefficient is a commonly used measure of the size of an effect.

values of +/- 0.1 represent a small effect
values of +/- 0.5 represent a large effect
values of +/- 0.3 represent medium effect

Confidence intervals for r

Confidence intervals tell us about the likely value in the population.
Lower boundary of the confidence interval = z_r – (1,96 X SE_Zr)
Upper boundary of the confidence interval = z_r + (1,96 X SE_Zr)

Bivariate correlation

The data must be linear and normally distributed.
The outcome variable needs to be measured at the interval ratio level, as does the predictor variable.

It would be advisable to use a bootstrap to get robust confidence intervals.

Spearman’s correlation coefficient

A non-parametric statistic that is useful to minimize the effects of extreme scores or the effects of violations of the assumptions discussed in chapter 6.
Spearman’s test works by first ranking the data, and then applying Pearson’s equation to those ranks.

Kendall’s tau (non-parametric)

A non-parametric correlation.
Should be used when you have a small data set with a large number of tied ranks, if you rank the scores and many scores have the same rank.
A better estimate of the correlation in the population.

Biserial and point-biserial correlations

Often it is necessary to investigate relationships between two variables when one of the variables is dichotomous (categorical with only two categories).
The biserial and point-biserial correlation should be used in these situations.

The difference between the use of biserial and point-biserial correlations depends on whether the dichotomous variable is discrete of continuous.
A discrete,or true, dichotomy: one for which there is no underlying continuum between the categories.
A continuous dichotomy: a dichotomy for which a continuum exists.

The point-biserial correlation (r_pb) is used when one variable is a discrete dichotomy
Biserial correlation (r_b) is used when one variable is a continuous dichotomy.

Summary

Spearman’s correlation coefficient, r_s, is a non-parametric statistic and requires only ordinal data for both samples.
The point-biserial correlation coefficient, r_pb, quantifies the relationship between a continuous variable and a variable that is a discrete dichotomy (there is no continuum underlying the two categories. Like death or alive)
the biserial correlation coefficient, r_b, quantifies the relationship between a continuous variable and a variable that is a continuous dichotomy (there is a continuum underlying the two categories. Like passing an exam).
Kendall’s correlation coefficient, τ, is like Spearman’s but probably better for small samples.

Partial and semi-partial correlation

Semi-partial (or part) correlation

There is a type of correlation that can be done that allows you to look at the relationship between two variables, accounting for the effect of a third variable.

You can transform the correlation coefficient into proportion of variance by squaring them.
If we multiply the resulting proportions by 100 we turn them into percentages.

The semi-partial correlation expresses the unique relationship between two variables as a function of their total variance.

Imagine we want to look at the relationship between two variables X and Y, adjusting for the effect of Z.
The semi-partial correlation squared is the uniquely shared variance between X and Y, expressed as a proportion of the total variance in Y.

Partial correlation

Express the variance in terms of variance in Y left over when other variables have been considered.

Summary

a partial correlation quantifies the relationship between two variables while accounting for the effects of a third variable on both variables in the original correlation
A semi-partial correlation quantifies the relationship between two variables while accounting for the effects of a third variables on only one of the variables in the original correlation.

Comparing correlations

Comparing independent rs

To compare correlations we can convert them to z_r.
We can calculate the z-score of the difference between these correlations using:;

z_dfiiference= (z_r1-z_r2)/square root((1/(N₁-3))+(1/(N₂-3)))

We can look up this z-score in the appendix.

Comparing dependent rs

you can use a t-statistic to test whether a difference between two dependent correlations are significant.

T_difference= (r_xy-r_zy) square root(((n-3)(1+r_xz)) / (2(1-r²_xy-r²_xz-r²_zy+2r_xyr_xzr_zy)))

This value can be checked agains the appropriate critical value of t with N-3 degrees of freedom.

Calculating the effect size

Correlation coefficients are effect sizes.

How to report correlation coefficents

You report how big they are, their confidence intervals and significance value.

Access:

Public

Join WorldSupporter!

Join with a free account for more service, or become a member for full access to exclusives and extra support of WorldSupporter >>

This content is related to:

Summary of Discovering statistics using IBM SPSS statistics by Field - 5th edition

Check more of topic:

Statistics and Data analysis Methods

Universiteit Amsterdam: UVA

This content is used in:

Summary of Discovering statistics using IBM SPSS statistics by Field - 5th edition

Going abroad?

Insure your way around the world

International expat insurances

Travel & Worldsupporter insurances (NL)

Study with summaries

Contributions: posts

Help other WorldSupporters with additions, improvements and tips

Spotlight: topics

Check the related and most recent topics and summaries:

Activities abroad, study fields and working areas:

Statistics and Data analysis Methods

Samenvattingen voor psychologie en gedrag

WorldSupporter and development goals:

Development Goal 04: Quality Education

Institutions, jobs and organizations:

Universiteit Amsterdam: UVA

This content is also used in .....

Summary of Discovering statistics using IBM SPSS statistics by Field - 5th edition

This is a summary of the book "Discovering statistics using IBM SPSS statistics" by A. Field. In this summary, everything students at the second year of psychology at the Uva will need is present. The content needed in the thirst three blocks are already online, and the rest

...

analysis-2958826_960_720.jpg

Why is my evil lecturer forcing me to learn statisics? - summary of chapter 1 of statistics by A. Field (5th edition)

The spine of statistics - summary of chapter 2 of Statistics by A. Field (5th edition)

The beast of bias - summary of chapter 6 of Statistics by A. Field (5th edition)

Non-parametric models - summary of chapter 7 of Statistics by A. Field (5h edition)

Correlation - summary of chapter 8 of Statistics by A. Field (5th edition)

The linear model - summary of Chapter 9 by A. Field 5th edition

Comparing two means - summary of chapter 10 of Statistics by A. Field (5th edition)

Moderation, mediation, and multi-category predictors - summary of chapter 11 of Statistics by A. Field (5th edition),

Comparing several independent means - summary of chapter 12 of Statistics by A. Field (5th edition)

Analysis of covariance - summary of chapter 13 of Statistics by A. Field (5th edition)

Factorial designs - summary of chapter 14 of statistics by A. Field (5th edition)

Repeated measures designs - summary of chapter 15 of Statistics by A. Field (5th edition)

Mixed designs - summary of chapter 16 of Statistics by A. Field (5th edition)

Multivariate analysis of variance (MANOVA) - summary of chapter 17 of Statistics by A. Field (5th edition)

Exploratory factor analysis - summary of chapter 18 of Statistics by A. Field (5th edition)

Categorical outcomes: chi-square and loglinear analysis - summary of chapter 19 of Statistics by A. Field

WSRt using SPSS, manual for tests in the third block of the second year of psychology at the uva

Everything you need for the course WSRt of the second year of Psychology at the Uva

Categorical outcomes: logistic regression - summary of (part of) chapter 20 of Statistics by A. Field

Lees verder over Summary of Discovering statistics using IBM SPSS statistics by Field - 5th edition
12469 keer gelezen

Check how to use summaries on WorldSupporter.org

Submenu: Summaries & Activities

Follow the author: SanneA

Work for WorldSupporter

JoHo can really use your help! Check out the various student jobs here that match your studies, improve your competencies, strengthen your CV and contribute to a more tolerant world

Working for JoHo as a student in Leyden

Parttime werken voor JoHo

Statistics

Search a summary, study help or student organization

Select any filter and click on Search to see results

Correlation - summary of chapter 8 of Statistics by A. Field (5th edition)

Modeling relationships

The significance of the correlation coefficient

Confidence intervals for r

Bivariate correlation

Spearman’s correlation coefficient

Kendall’s tau (non-parametric)

Biserial and point-biserial correlations

Summary

Partial and semi-partial correlation

Semi-partial (or part) correlation

Partial correlation

Summary

Comparing correlations

Comparing independent rs

Comparing dependent rs

Calculating the effect size

How to report correlation coefficents

Summary of Discovering statistics using IBM SPSS statistics by Field - 5th edition

Statistics and Data analysis Methods

Universiteit Amsterdam: UVA

Summary of Discovering statistics using IBM SPSS statistics by Field - 5th edition

Contributions: posts

Add new contribution

Spotlight: topics

Statistics and Data analysis Methods

Samenvattingen voor psychologie en gedrag

Development Goal 04: Quality Education

Universiteit Amsterdam: UVA

Summary of Discovering statistics using IBM SPSS statistics by Field - 5th edition

analysis-2958826_960_720.jpg

Online access to all summaries, study notes en practice exams

How and why use WorldSupporter.org for your summaries and study assistance?

Using and finding summaries, notes and practice exams on JoHo WorldSupporter

Quicklinks to fields of study for summaries and study assistance