Answers to assignments - Inferential Statistics, Leiden University
- 1615 keer gelezen
The answers can be found in the corresponding upload (right column)
Theory
What symbols are used for the population mean and the sample mean?
What is the difference between the H0 and the Ha?
What is the definition of a p-value?
What is the definition of the rejection criterion α?
Application
Select the correct response(s). (More than one may be correct.) The p-value for testing H0 : µ = 100 against Ha : µ is not 100 is p = .001. This indicates that:
1. There is strong evidence that µ = 100.
2. There is strong evidence that µ is not 100.
3. There is strong evidence that µ > 100.
There is strong evidence that µ < 100.
If µ were equal to 100, it would be unusual to obtain data such as those observed.
It is important that you understand the difference between a sample and a population, and that you understand the difference between the symbols for them. A sample is the one test result set that a research or investigation delivers, or one group of people from one specific experiment, while a population is an overview of anybody who fits to the requirements. Below is a table of different symbols for the two different situations.
Parameter | Population | Sample |
Mean | µ | x̄ |
Probability | P | P |
Standard Deviation | σ | S |
Theory
What is meant by events A and B being independent?
What is meant by events A and B being disjoint?
Can disjoint events be independent from each other?
What are the formulas for the:
General additive rule
Additive rule for disjoint events
Complement rule
General multiplicative rule
Multiplicative rule for independent events
What is the definition a random variable?
What is the expected value of a random variable?
What is the expected value of X?
What is the variance of a random variable?
What is the expected value of the sum of two random variables?
What is the variance of the sum of two random variables?
There are three types of relationships between probability and variables. First there is Marginal, which means there is only a single variable involved. Then there is joint, when there are two or more variables, dependently or independently involved and last, there is conditional, which means that for instance event A needs to happen before it is even possible for even B to occur.
The thing about H0 and Ha is that they are about the truth of the parameters of the population.
Theory
How do numerical data differ from categorical data?
When are two categorical variables independent?
What are the three different research designs for chi-square tests?
How does one determine the degrees of freedom for a χ2-test of independence?
How does one determine the degrees of freedom for a goodness of fit χ2-test?
In a χ2-test of independence, what is the formula for the expected cell frequency fe?
In a χ2-test of independence, what is the formula for the χ2 test statistic?
When deciding when to use which chi-squared test, there is a certain scheme you can follow that can help you. First, look at how many populations you have. If this number is more that just one, you know right away that you need to use the test for homogeneity. It the answer is one population, look at how many variables you have. If this is also just one, take the goodness of fit test. If you have more than one variable, use the test of independence.
The thing about chi-squared tests is that they are about the difference between two or more populations.
The difference between dependent and independent samples is that dependent are mostly before and after, husband and wife, so related statistics. Independent
Theory
What is the general H0 for a one-sample z- or t- test?
What is the standard deviation of the distribution of sample means if σ is unknown?
What is the standard deviation of the distribution of sample means if σ is known?
What is the formula for the z test statistic?
What is the formula for the t test statistic?
When is the p value multiplied by 2: For one-sided or two-sided hypothesis tests?
Keep in mind that the observed results are always rounded numbers because, well, you can’t really observe half people, now can you? The expected, or usual results can, however, have decimals, because they can be an average.
Also, if the definition of the degrees of freedom sounds hard, look at it like this; df = number of rows – 1/ number of columns – 1
The thing with T and Z test is that they are about the difference between the sample and the population
Theory
What is the general H0 and Ha for a two-sample t test for dependent samples?
What is the general H0 and Ha for a two-sample t test for independent samples?
What is the number of degrees of freedom of a t test when:
2 paired variables with each n values?
2 independent samples with n1 and n2 values and σ21 , σ22?
2 independent samples with n1 and n2 values and σ21 = σ22?
What is the formula for the test statistic for a two-sample t test for independent samples with unequal variances?
Why can we leave µ1 − µ2 from the formula most of the time when using the test statistic for a two-sample t test of independent samples ?
What is the formula for the test statistic for a two-sample t test of dependent samples?
The thing with matched pairs/paired sample/dependant samples is that they are about the difference between each other. It is a one sample test, tested twice. Like, for instance, the difference between the man and the woman of a couple, or a person’s right and left arm.
Also, when you have a large t-test statistic, this means there is a smaller p-value. This, in turn, means that H0 is more likely to be rejected, because p can fall into the rejection area more quickly.
Standard Error is the standard deviation of a sampling distribution
Theory
What is the definition of a 95% confidence interval of µ?
What is the formula for it?
What is the formula of the 95% confidence interval of µ1 and µ2?
What is the formula of the effect size η2?
What is the formula of the effect size Cohen’s d?
Application
Which of the following interpretations of a 95% confidence interval (CI) is/are correct?
A 95% CI contains 95% of the data in the population.
The probability that the population parameter is in this particular 95% CI is 95%.
95% of all CIs that are calculated in this manner will capture the population parameter.
I am 95% sure that the mean of a sample will fall within the 95% CI for the mean.
To consider variances equal, the ration of the different standard deviations have to be at least equal or smaller that 2 (variance with a ratio of µ). This is calculated by Sbigger/Ssmaller. If the ratio is indeed two or less, use the following formula: SEx̄1 - x̄2 = √( s2p/n1 + s2p/n2)
In which case S2p= (n1 – 1) × s21 + (n2 – 1) × s22 / n1 + n2 – 2.
If the ratio is larger than 2: SEx̄1 - x̄2 = √( s21/n1 + s22/n2)
Theory
What is a nonparametric test?
In which two situations do you conduct a nonparametric test (instead of a parametric one)?
Which Wilcoxon test is suitable for paired data? Which one for independent samples?
What do you do when two or more observations have equal scores?
What are the formulas for the expected value of the sum and for the standard deviation of the Wilcoxon signed-rank test?
What are the formulas for the expected value of the sum and for the standard deviation of the Wilcoxon rank-sum test?
What are the formula’s for the z and t calculations.
Use the information from Howell 18.4
Analyse the data by hand with a Wilcoxon rank-sum test. Assume that the test statistic has a normal distribution (Howell 18.3).
Rank the counts.
Add the rank numbers for one of the groups.
Calculate the expected value of the sum (under H0).
Calculate the corresponding standard deviation.
Calculate the value of the test statistic z.
What is the probability p on the result (or even more extreme) found?
What is your statistical and substantive conclusion at α = .05
When n is large enough (20 - 30 or more) it doesn’t matter what your original data looked like, the sampling distribution will be normal.
When can you say that there is a systematic difference between two small samples? When P is smaller than the critical value.
Parametric tests | Non-parametric |
Z- test | Wilcoxon’s tests (ordinal data and small n) |
T-test | Chi-squared (nominal data) |
When dealing with a rank-sum test, you’ll need to use a z-test. When dealing with a signed rank test, you’ll need to use t. When one of the data in a signed rank test is 0, it disappears from the equation and you n becomes smaller.
They won’t ask what type of chi-squared test you are dealing with. Read the guidelines for the SPSS test in the workbook, they’re important!
On a chi-squared, you never double P of half α. In a 2 by x table, φ and Cramer’s V are always the same, because k-1 turns into 1.
A one-way chi-squared test is just one row of data, such as skittles. A two-way test is dealing with two or more variables
Expected frequency is row total x column total / by all.
Theory
What is the formula for the effect measure φ for a χ2-test for 2-by-2 contingency tables?
What is the formula for the effect measure Cramer’s V for a χ2-test?
Join with a free account for more service, or become a member for full access to exclusives and extra support of WorldSupporter >>
There are several ways to navigate the large amount of summaries, study notes en practice exams on JoHo WorldSupporter.
Do you want to share your summaries with JoHo WorldSupporter and its visitors?
Main summaries home pages:
Main study fields:
Business organization and economics, Communication & Marketing, Education & Pedagogic Sciences, International Relations and Politics, IT and Technology, Law & Administration, Medicine & Health Care, Nature & Environmental Sciences, Psychology and behavioral sciences, Science and academic Research, Society & Culture, Tourisme & Sports
Main study fields NL:
JoHo can really use your help! Check out the various student jobs here that match your studies, improve your competencies, strengthen your CV and contribute to a more tolerant world
2211 | 2 |
Add new contribution