Statistics  Quiz
48 Questions
17 Views

Statistics Quiz

Created by
@emilyroseblack

Questions and Answers

What is the percentage of scores that fall between -1 and 1 standard deviations in a normal distribution?

About 68%

What is a characteristic of the standard normal distribution?

Having a mean of 0 and a standard deviation of 1

What is the purpose of transforming raw scores into z-scores?

To compare scores on different variables

What does a Pearson's r value of -0.61 indicate?

<p>A negative relationship</p> Signup and view all the answers

What does the 'strength' of a relationship refer to in a scattergram?

<p>How closely bunched around the imaginary line the dots are</p> Signup and view all the answers

How is the formula for Pearson's r defined?

<p>The mean of the products of paired z-scores</p> Signup and view all the answers

What does the numerator in the Pearson's r formula refer to?

<p>The covariance between the two variables</p> Signup and view all the answers

What is the purpose of using z-scores?

<p>All of the above</p> Signup and view all the answers

What is the value range of Pearson's r?

<p>From -1 to 1</p> Signup and view all the answers

What happens when the relationships between two variables depart from linearity?

<p>Pearson's r will underestimate the correlation</p> Signup and view all the answers

What is the definition of a skewed distribution?

<p>A distribution where the most frequently occurring scores are clustered at one end of the scale</p> Signup and view all the answers

What is kurtosis related to?

<p>The peakedness of a distribution</p> Signup and view all the answers

What can be done when assumptions for the test of inference regarding Pearson's r are not satisfied?

<p>All of the above</p> Signup and view all the answers

What is the null hypothesis in Pearson's r significance test?

<p>The correlation in the population is 0</p> Signup and view all the answers

What is the method used to find the regression line called?

<p>Method of least squares</p> Signup and view all the answers

What does the F-value represent in the ANOVA table?

<p>The ratio between the portion of total variance accounted for by the regression line, and the variance not accounted for by the regression line</p> Signup and view all the answers

Which of the following correctly defines R-Square?

<p>All of the above</p> Signup and view all the answers

What does Beta represent in the coefficient table of the SPSS output for regression?

<p>The slope of the regression line</p> Signup and view all the answers

What does the standardised regression coefficient Beta represent?

<p>The amount of increase or decrease in the outcome variable per standard deviation unit of the predictor</p> Signup and view all the answers

When is the statement 'In a scatterplot describing the relationship between two standardised variables, Beta is the slope of the regression line' true?

<p>Always, regardless of the nature of the two standardised variables</p> Signup and view all the answers

What is the relationship between Beta and R in simple regression?

<p>Beta is always equal to R</p> Signup and view all the answers

What determines the value of Beta?

<p>The clustering of data points around the regression line</p> Signup and view all the answers

In a non-experimental study, can you infer causality from a simple regression analysis?

<p>No, never</p> Signup and view all the answers

How many predictors are required to run multiple regression?

<p>At least 2</p> Signup and view all the answers

Where do you find the goodness-of-fit of the overall model in the SPSS output?

<p>ANOVA table</p> Signup and view all the answers

What type of effects does the Coefficients table in the SPSS output refer to?

<p>Unique effects of each predictor</p> Signup and view all the answers

How do you enter predictors in hierarchical multiple regression?

<p>In steps</p> Signup and view all the answers

What is a categorical predictor?

<p>A predictor with categorical labels</p> Signup and view all the answers

What does the path c represent in a mediation model?

<p>Direct effects</p> Signup and view all the answers

What is the conceptualisation of the standardised coefficient for the indirect effect in mediation analysis?

<p>The number of standard deviations by which the outcome changes for each standard deviation increase in the predictor, indirectly via the mediator</p> Signup and view all the answers

Can the variable 'gender' mediate the effect of intelligence on a mathematical test?

<p>No, because to say that intelligence may have an effect on gender is nonsensical</p> Signup and view all the answers

Which variable is the mediating variable in the hypothesis that getting better grades improves mental health because better grades increase self-esteem?

<p>Self-esteem</p> Signup and view all the answers

How many multiple regressions do you need to run in order to obtain the standardised regression coefficients that are necessary to report the results of a mediation analysis pictorially?

<p>1</p> Signup and view all the answers

What is the essence of moderation analysis?

<p>Testing the hypothesis that the effect of a predictor variable on an outcome variable changes as a function of the level of a third variable</p> Signup and view all the answers

What is the moderating variable in the hypothesis that the effect of a greater number of negative events on mental health will depend on age-group?

<p>Age-group</p> Signup and view all the answers

What is the 'interaction term' in moderation analysis?

<p>The product of the predictor and the moderating variable</p> Signup and view all the answers

What does centring do to a variable?

<p>Transforms the variable into deviations around a fixed point, the grand mean</p> Signup and view all the answers

What does the coefficient for the effect of M on Y represent when centring both X and M?

<p>The effect of M on Y for a case with an average score on X</p> Signup and view all the answers

When is a predictor considered categorical?

<p>When the predictor is measured on a categorical (or nominal) scale.</p> Signup and view all the answers

What is the purpose of dummy coding?

<p>To compare the categories of a categorical variable to a baseline category.</p> Signup and view all the answers

What is the value assigned to the baseline category in a dummy variable?

<p>0</p> Signup and view all the answers

If a categorical predictor has four categories, how many dummy variables should be created?

<p>3</p> Signup and view all the answers

What is an outlier in regression analysis?

<p>A case that differs substantially from the main trend.</p> Signup and view all the answers

When can we say that there is multicollinearity?

<p>When two predictors are very strongly correlated.</p> Signup and view all the answers

What is the purpose of checking the Durbin-Watson index?

<p>To check for independence of errors.</p> Signup and view all the answers

What is the mediator in the hypothesis that glucose consumption improves cognitive performance because glucose consumption enhances energy levels?

<p>Energy levels</p> Signup and view all the answers

What happens to the effect of a predictor on an outcome when a mediator is added to the equation and the effect is completely mediated?

<p>The effect becomes zero.</p> Signup and view all the answers

What does it mean if the unstandardised coefficient for the indirect effect of a predictor on an outcome has BootLLCI=0.21 and BootULCI=0.46?

<p>We can be 95% confident that the unstandardised coefficient will be included between 0.21 and 0.46.</p> Signup and view all the answers

Study Notes

Normal Distribution and Standardization

  • A normal distribution is a bell-shaped distribution of scores.
  • In a normal distribution, about 68% of cases fall between the values -1 and 1 standard deviations.
  • About 1% of cases fall above 3 standard deviations.
  • A standard deviation value of 345 or 100 + 725 is equivalent to 1 if transformed into z-scores.
  • A standard normal distribution has a mean of 0 and a standard deviation of 1.

Z-Scores

  • A z-score is a score expressed in standard deviation units.
  • Transforming raw scores into z-scores implies a linear transformation.
  • Z-scores can be used to:
    • Know the percentile of a score on a variable.
    • Compare a score on a variable with a score on another variable.
    • Know which score on a variable is at a specific percentile.

Correlation

  • A correlation coefficient (Pearson's r) indicates the strength and direction of the relationship between two variables.
  • A correlation coefficient can take values from -1 to 1.
  • A correlation coefficient of -0.61 indicates a negative relationship.
  • A correlation coefficient of 0.00 indicates no relationship.
  • A correlation coefficient of 0.06 indicates a small positive relationship.
  • The strength of a relationship refers to how closely bunched around the imaginary line the dots are in the scattergram.

Pearson's r Formula

  • The formula for Pearson's r is defined as the mean of the products of paired z-scores.
  • The numerator in the Pearson's r formula refers to the sum of the products of the paired z-scores.
  • If one of the variables is constant, the correlation is undefined.

Scattergram

  • The two axes of a scattergram are called the ordinate (vertical axis) and abscissa (horizontal axis), or Y-axis and X-axis.
  • If the relationships between two variables depart from linearity, Pearson's r will underestimate the correlation.

Correlation in Restricted Range

  • When the correlation is calculated using a restricted range, the correlation will either be reduced or inflated.

Outliers

  • The value of Pearson's r will be affected by outliers, either inflated or reduced.

Perfect Positive Correlation

  • A perfect positive correlation (r = 1) can only occur when X and Y distributions have exactly the same shape.

Skewed Distribution

  • A skewed distribution is a distribution where the most frequently occurring scores are clustered at one end of the scale.

Kurtosis

  • Kurtosis refers to the peakedness of a distribution.

Assumptions for Pearson's r

  • Interval data is required for Pearson's r.
  • If assumptions are not satisfied, tests specifically designed for non-interval data can be used, or alpha level can be lowered.

Regression Analysis

  • Regression analysis is used to predict the value of a variable (Y) based on the value of another variable (X).
  • The regression line is a straight line drawn through a scatterplot of two variables, that comes as close to the data points as possible.
  • The method used to find the regression line is called the method of least squares.
  • The intercept in regression analysis is the point at which the regression line cuts across the Y-axis.
  • The slope, or regression coefficient, represents the change in Y for a one-unit change in X.
  • The true linear regression equation is y = a + b * x.

Simple Regression

  • In simple regression, the effect of a single predictor (X) on an outcome (Y) is assessed.
  • The value of the regression coefficient (b) depends on the steepness of the slope.
  • The value of the standardized regression coefficient (β) depends on how closely clustered around the line the data points are.

Multiple Regression

  • In multiple regression, the effect of multiple predictors (X1, X2, …) on an outcome (Y) is assessed.
  • The number of predictors required is at least 2.
  • The ANOVA table is used to assess the goodness-of-fit of the overall model.
  • The coefficients table provides information about the unique effects of each predictor.
  • The R-Square value represents the proportion of the total variance in Y explained by the predictors.
  • The F-value represents the ratio between the portion of total variance accounted for by the regression line and the variance not accounted for by the regression line.

Hierarchical Multiple Regression

  • In hierarchical multiple regression, the effects of multiple predictors on an outcome are assessed in a step-by-step manner.
  • The R-Square Change value represents the additional variance explained by a specific model.
  • The Sig. F Change value is used to determine if the R-Square Change value is statistically significant.
  • The model with the highest number of predictors is referred to as Model 3.

Categorical Predictors

  • A categorical predictor is a predictor measured on a categorical (or nominal) scale.
  • Dummy coding is used to represent categorical predictors.
  • The b value of a categorical predictor with two categories is equal to the difference between the mean scores obtained on the outcome by the two categorical groups.

Mediation Analysis

  • Mediation analysis assesses the effect of a predictor (X) on an outcome (Y) through a mediator (M).
  • The indirect effect of X on Y through M is represented by paths a and b.
  • The direct effect of X on Y is represented by path c.
  • The standardized coefficient for the indirect effect can be obtained using PROCESS.
  • The mediation model can be used to test the hypothesis that a mediator (M) completely mediates the effect of a predictor (X) on an outcome (Y).

Moderation Analysis

  • Moderation analysis assesses the effect of a predictor (X) on an outcome (Y) as a function of a moderator (M).
  • The interaction term is the product of the predictor and the moderator.
  • The coefficient for the effect of M on Y represents the effect of M on Y for a case with an average score on X.
  • The coefficient for the effect of X on Y represents the effect of X on Y for a case with an average score on M.
  • A simple slope analysis shows the effect of the predictor on the outcome at different levels of the moderator.

Regression Diagnostics

  • Regression diagnostics are used to assess the accuracy of the regression model.
  • Outliers are cases that differ substantially from the main trend.
  • Influential cases are cases that affect the precision of the estimation of the regression coefficient.
  • Multicollinearity occurs when two or more predictors are highly correlated.
  • Normality of residuals, independence of errors, and homoscedasticity are assumptions of regression analysis.

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Description

Test your knowledge of normal distributions, standard deviations, and z-scores in statistics. Questions cover characteristics of normal distributions and transformations.

More Quizzes Like This

Use Quizgecko on...
Browser
Browser