Podcast
Questions and Answers
A researcher measures customer satisfaction using a 7-point scale. This scale allows for ranking satisfaction levels but doesn't have a true zero point. What type of scale is being used?
A researcher measures customer satisfaction using a 7-point scale. This scale allows for ranking satisfaction levels but doesn't have a true zero point. What type of scale is being used?
- Nominal scale
- Ratio scale
- Interval scale (correct)
- Ordinal scale
Which measure of central tendency is most appropriate for summarizing the most common nationality in a diverse group of international students?
Which measure of central tendency is most appropriate for summarizing the most common nationality in a diverse group of international students?
- Range
- Median
- Mode (correct)
- Mean
In a dataset of income levels, a few extremely high values are present. Which measure of central tendency would be least affected by these outliers?
In a dataset of income levels, a few extremely high values are present. Which measure of central tendency would be least affected by these outliers?
- Median (correct)
- Range
- Mean
- Mode
Which of the following is NOT a measure of variability?
Which of the following is NOT a measure of variability?
A dataset has a skewness value of -3. What can you infer about the shape of the distribution?
A dataset has a skewness value of -3. What can you infer about the shape of the distribution?
If the kurtosis of a dataset is 8, what does this indicate about the distribution's shape?
If the kurtosis of a dataset is 8, what does this indicate about the distribution's shape?
In hypothesis testing, if the p-value is 0.02, and the alpha level is set at 0.05, what decision should be made regarding the null hypothesis?
In hypothesis testing, if the p-value is 0.02, and the alpha level is set at 0.05, what decision should be made regarding the null hypothesis?
What is the potential consequence of setting a very high alpha (α) level (e.g., 0.10 or higher) in hypothesis testing?
What is the potential consequence of setting a very high alpha (α) level (e.g., 0.10 or higher) in hypothesis testing?
A study compares customer satisfaction scores before and after a new service implementation. Which statistical test is most appropriate to analyze the data?
A study compares customer satisfaction scores before and after a new service implementation. Which statistical test is most appropriate to analyze the data?
Which test is used to determine if the variances of two groups are equal before conducting an independent samples t-test?
Which test is used to determine if the variances of two groups are equal before conducting an independent samples t-test?
For what type of variable is a Chi-Square test used?
For what type of variable is a Chi-Square test used?
In experimental design, which type of experiment typically has higher internal validity but lower external validity?
In experimental design, which type of experiment typically has higher internal validity but lower external validity?
In the context of experimental design, which of the following refers to a threat to internal validity due to events occurring during the experiment that are not part of the treatment?
In the context of experimental design, which of the following refers to a threat to internal validity due to events occurring during the experiment that are not part of the treatment?
You want to compare the average test scores of students in three different teaching methods to determine if there is a significant difference. Which statistical test should you use?
You want to compare the average test scores of students in three different teaching methods to determine if there is a significant difference. Which statistical test should you use?
What does the F-statistic in ANOVA represent?
What does the F-statistic in ANOVA represent?
What does a Pearson's correlation coefficient (r) of -1 indicate?
What does a Pearson's correlation coefficient (r) of -1 indicate?
In simple linear regression, the equation is given as $Y = a + bX$. What does 'b' represent?
In simple linear regression, the equation is given as $Y = a + bX$. What does 'b' represent?
What does R² (Coefficient of Determination) in regression analysis represent?
What does R² (Coefficient of Determination) in regression analysis represent?
In multiple regression, what does a high Variance Inflation Factor (VIF) indicate?
In multiple regression, what does a high Variance Inflation Factor (VIF) indicate?
If a researcher increased the sample size in a study, what effect would that have on the standard error of the mean (SE)?
If a researcher increased the sample size in a study, what effect would that have on the standard error of the mean (SE)?
Flashcards
Nominal Scale
Nominal Scale
Categories with different names and no implied order.
Ordinal Scale
Ordinal Scale
Categories with a meaningful order.
Interval Scale
Interval Scale
Ordered categories with equal intervals but no absolute zero point.
Ratio Scale
Ratio Scale
Signup and view all the flashcards
Frequency Distribution
Frequency Distribution
Signup and view all the flashcards
Mean
Mean
Signup and view all the flashcards
Median
Median
Signup and view all the flashcards
Mode
Mode
Signup and view all the flashcards
Range
Range
Signup and view all the flashcards
Interquartile Range (IQR)
Interquartile Range (IQR)
Signup and view all the flashcards
Standard Deviation (SD)
Standard Deviation (SD)
Signup and view all the flashcards
Coefficient of Variation (CV)
Coefficient of Variation (CV)
Signup and view all the flashcards
Skewness
Skewness
Signup and view all the flashcards
P-value
P-value
Signup and view all the flashcards
Alpha (α) Level
Alpha (α) Level
Signup and view all the flashcards
Type I Error
Type I Error
Signup and view all the flashcards
Type II Error
Type II Error
Signup and view all the flashcards
Independent Samples T-Test
Independent Samples T-Test
Signup and view all the flashcards
Pearson's Correlation
Pearson's Correlation
Signup and view all the flashcards
R² (Coefficient of Determination)
R² (Coefficient of Determination)
Signup and view all the flashcards
Study Notes
Scales of Measurement
- Nominal scales categorize data with distinct names, implying no inherent order (e.g., gender, nationality).
- Ordinal scales categorize data with a meaningful order or ranking (e.g., education level).
- Interval scales feature ordered categories with equal intervals but lack a true zero point (e.g., satisfaction ratings on a 1-7 scale); non-comparative rating scales are interval if they have 5+ points.
- Ratio scales represent interval scales possessing a true zero point (e.g., age, weight).
Frequency Distribution
- Frequency distributions summarize the occurrence rate for each value of a variable.
- Frequency distributions do not indicate skewness.
Measures of Central Tendency
- The mean is the arithmetic average.
- The mean is suitable for interval and ratio data types.
- The mean is susceptible to skewed data.
- The median is the middle value in a dataset.
- The median is used for ordinal, interval, and ratio data.
- The median divides a dataset into two equal halves.
- The mode represents the most frequent value in a dataset.
- The mode is applicable to nominal and ordinal variables.
- The mode indicates the distribution's highest peak.
- In a normal distribution, the mean, median, and mode are equal.
Measures of Variability
- The range is the difference between the largest and smallest values in a dataset.
- The range is sensitive to outliers.
- The interquartile range (IQR) is the difference between the 75th and 25th percentiles.
- A small SD indicates values clustered around the mean.
- The coefficient of variation (CV) is the ratio of the standard deviation to the mean.
- The CV is useful for comparing different variables, tracking the same variable over time or across groups, and analyzing variables measured in different units.
Measures of Shape: Skewness & Kurtosis
- Skewness measures the symmetry of a distribution.
- A skewness value less than -2 or greater than 2 indicates high skewness.
- A long right tail indicates positive skewness.
- A long left tail indicates negative skewness.
- Kurtosis measures the peakedness or flatness of a distribution.
- Kurtosis values less than -7 or greater than 7 signify a significant deviation from normal.
- Positive kurtosis indicates a sharply peaked distribution.
- Negative kurtosis indicates a flatter distribution.
Probability
- The total probability of mutually exclusive outcomes equals 1.
- A probability less than 5% is considered an unlikely event.
- The 5% level is a critical threshold in statistical analysis.
- Probability helps assess the likelihood of observing a result if the null hypothesis (H₀) is true.
- Sampling error affects the ability to generalize findings from a sample to a population.
- The standard error of the mean (SE) estimates sampling error based on the sample's standard deviation and size.
Confidence Intervals
- A 95% confidence interval suggests the population mean lies within the range with 95% probability.
- The 95% confidence interval is not a guarantee.
Hypothesis Testing Basics
- Inferential statistics draw conclusions about a population from a sample.
- The null hypothesis (H₀) posits no effect or difference.
- The alternative hypothesis (H₁) suggests an expected effect.
- The p-value indicates the probability of observing a sample result if H₀ is true.
- A p-value less than 0.05 leads to the rejection of H₀.
- A p-value of 0.05 is a common cut-off point for rejecting H₀.
- A p-value higher than 0.05 indicates failure to reject H₀.
- A p-value of 0.001 provides strong evidence against H₀.
- The alpha (α) level is the maximum acceptable probability of a Type I error, commonly set at 0.05.
Type I and Type II Errors
- A Type I error (false positive) involves rejecting H₀ when it is true.
- The risk of a Type I error equals the alpha level (e.g., 5%).
- A Type II error (false negative) involves failing to reject H₀ when it is false.
- Increasing the sample size reduces the risk of a Type II error.
T-Tests
- An independent samples t-test compares means from two different groups (e.g., male vs. female income), assuming equal variances checked by Levene’s Test.
- A paired samples t-test compares two variables within the same group (e.g., satisfaction before vs. after service).
- A one-sample t-test compares a sample mean to a known value (e.g., hotel satisfaction vs. industry average).
- A T-distribution is flatter and wider than a normal distribution, particularly with small samples.
- Levene’s test checks whether two or more groups have equal variances, which is important in tests like independent sample t-tests and ANOVA.
Chi-Square Tests
- Chi-square tests assess the association between categorical variables.
- Chi-square tests cannot be used for continuous variables (e.g., age vs. shopping).
- Expected frequencies represent what we would expect if no association exists.
- Statistical significance is indicated by a p-value less than 0.05.
Experimental Design & Validity
- The goal is to determine causal relationships between variables.
- Causality requires concomitant variation, time-order of variables, and elimination of other causes.
- The independent variable (IV) is manipulated.
- The dependent variable (DV) is measured.
- Threats to internal validity (extraneous variables) include history, maturation, instrumentation, statistical regression, selection bias, mortality, and testing effect.
- Threats to external validity include artificial situations, inappropriate samples, and inappropriate timing.
- Pre-experimental designs such as one-group pretest-posttest lack a control group.
- True experimental designs include a control group and randomization; types include pretest-posttest with control and posttest-only with control.
- Lab experiments offer controlled conditions and high internal validity but may have low external validity.
- Field experiments occur in real-world settings, providing high external validity but potentially lower internal validity.
One-Way ANOVA
- One-way ANOVA compares means across two or more groups.
- The DV is continuous (interval/ratio).
- The IV is categorical (nominal/ordinal).
- The null hypothesis (H₀) states that all group means are equal.
- The F-statistic is the ratio of between-group variance to within-group variance.
- An F-statistic ≈ 1 suggests H₀ is likely true.
- Within-group variance is the variation within each category.
- Between-group variance is the variation between group means.
- A p-value less than 0.05 leads to the rejection of H₀, indicating a significant effect.
- A p-value greater than or equal to 0.05 indicates failure to reject H₀, suggesting no significant effect.
Correlation & Regression
- Pearson’s correlation measures the strength and direction of the linear relationship between two continuous variables.
- r = 1 indicates a perfect positive correlation.
- r = 0 indicates no correlation.
- r = -1 indicates a perfect negative correlation.
- A scatter plot visualizes relationships and helps detect linearity and outliers.
- Simple regression predicts Y (DV) from X (IV) using the equation Y = a + bX, where a is the intercept and b is the slope.
- A positive b indicates a positive relationship.
- A negative b indicates a negative relationship.
- b = 0 indicates no relationship.
- R² (coefficient of determination) represents the proportion of variance in the dependent variable explained by the model.
- Higher R² indicates better predictive power.
- Multiple regression predicts Y from multiple Xs (IVs), including continuous and nominal-binary predictors.
- Multicollinearity in multiple regression can be checked using the variance inflation factor (VIF).
- High VIF indicates a strong correlation between IVs, which can be problematic.
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.