Samples, Populations, and Inferential Statistics

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

Explain the difference between a sample and a population in the context of research, and why researchers often rely on samples rather than studying entire populations.

A sample is a subset of a population. Researchers often rely on samples because studying the entire population is often infeasible due to size or accessibility constraints.

Describe what inferential statistics are and explain their importance in psychological research. What does it allow researchers to do?

Inferential statistics involves drawing conclusions about a population based on data from a sample. It allows researchers to generalize findings beyond the specific sample studied.

A researcher finds a correlation of r = -0.55 between stress levels and hours of sleep. Interpret the strength and direction of this correlation. What does this correlation suggest?

This indicates a moderate to strong negative correlation. As stress levels increase, hours of sleep tend to decrease.

Explain why a correlation of r = -0.4 is considered stronger than a correlation of r = +0.3, even though 0.4 is less than 0.3 mathematically.

<p>Correlation strength is based on the absolute value of r, not its sign. -0.4 is stronger because it indicates a closer relationship regardless of direction.</p> Signup and view all the answers

How does the concept of generalizability relate to the use of samples in inferential statistics?

<p>Samples are used in inferential statistics to draw conclusions that are generalizable to the larger population they represent.</p> Signup and view all the answers

A researcher is studying the effectiveness of a new teaching method on student test scores. They implement the method in one class and compare the results to the average scores of all students in the school district. Identify the population and the sample in this scenario.

<p>The population is all students in the school district and the sample is the class using the new teaching method.</p> Signup and view all the answers

Explain why a sample mean that is very different from the population mean is considered relatively rare when using random sampling.

<p>Random sampling gives each member of the population an equal chance of being selected, meaning the sample is likely to be representative. Therefore, the sample mean will likely be close to population mean.</p> Signup and view all the answers

In the context of statistical significance, what does it mean for a sample to be significant at the 5% level?

<p>A sample that is significant at the 5% level means that the sample mean is among the 5% of sample means that are most different from the population mean.</p> Signup and view all the answers

Describe a situation where obtaining a truly random sample might be challenging or impossible.

<p>Studying a rare disease where it's difficult to identify and access all individuals affected by the disease. OR Studying illegal behavior, such as drug use, where individuals are unlikely to self-report and participate in a study.</p> Signup and view all the answers

How does the concept of random sampling relate to the goal of making inferences about a population based on a sample?

<p>Random sampling helps ensure that the sample is representative of the population, increasing the validity of inferences made about the population based on sample data.</p> Signup and view all the answers

Explain the fundamental assumption about the relationship between variables when formulating a null hypothesis.

<p>The null hypothesis assumes there is no relationship between the variables being studied.</p> Signup and view all the answers

Describe what a p-value represents in the context of hypothesis testing. What does it tell you?

<p>A p-value represents the probability of obtaining the observed results (or more extreme results) if the null hypothesis is true.</p> Signup and view all the answers

If a study yields a p-value of 0.02, what conclusion can be drawn regarding the null hypothesis, assuming a significance level of 0.05?

<p>The null hypothesis can be rejected because the p-value (0.02) is less than the significance level (0.05).</p> Signup and view all the answers

Explain the difference between rejecting and failing to reject the null hypothesis. What does each outcome imply about the study's results?

<p>Rejecting the null hypothesis suggests evidence supports a real effect or relationship. Failing to reject the null hypothesis means there is insufficient evidence to conclude there is an effect.</p> Signup and view all the answers

How does the choice of a significance level (alpha) impact the likelihood of making a Type I error (false positive)?

<p>The significance level (alpha) directly sets the probability of making a Type I error; a lower alpha reduces the risk of a false positive.</p> Signup and view all the answers

A researcher conducts a study and obtains a p-value of 0.10. Using a significance level of 0.05, explain whether the results are statistically significant and what this means for the null hypothesis.

<p>The results are not statistically significant because the p-value (0.10) is greater than the significance level (0.05). We fail to reject the null hypothesis.</p> Signup and view all the answers

Explain how a smaller p-value provides stronger evidence against the null hypothesis.

<p>A smaller p-value indicates that the observed data would be very unlikely if the null hypothesis were true, thus providing strong evidence against it.</p> Signup and view all the answers

In the context of statistical significance, what does it mean for a result to be 'statistically significant' but not 'practically significant'?

<p>A result can be statistically significant (unlikely due to chance) but not practically significant if the effect size is small or not meaningful in a real-world context.</p> Signup and view all the answers

How does the sample size influence the p-value and the likelihood of achieving statistical significance?

<p>Larger sample sizes tend to produce smaller p-values, increasing the likelihood of achieving statistical significance, even for small effects.</p> Signup and view all the answers

Describe a situation where failing to reject the null hypothesis might still be a valuable outcome in research. What can it indicate?

<p>Failing to reject the null hypothesis can be valuable when it contradicts a widely held belief or prior research, suggesting the need to re-evaluate existing theories or practices.</p> Signup and view all the answers

Flashcards

Population

The entire group you want to draw conclusions about.

Sample

A subset of the population used to make inferences about the population.

Random Sample

Each member of the population has an equal chance of being selected.

Statistical Significance

Sample means are very different from the population from which it was drawn.

Signup and view all the flashcards

Significance at the 5% level

The sample score lies within the 5% of samples which are most different from the population.

Signup and view all the flashcards

Pearson's r

A measure of the strength and direction of a linear relationship between two variables. Ranges from -1 to +1.

Signup and view all the flashcards

Positive Correlation

As one variable increases, the other also increases, or as one decreases, the other also decreases.

Signup and view all the flashcards

Negative Correlation

As one variable increases, the other decreases, and vice versa.

Signup and view all the flashcards

Inferential Statistics

Using sample data to make general statements or draw conclusions about the larger population.

Signup and view all the flashcards

Null Hypothesis (H0)

A statement of no effect or no relationship, used as a starting point for testing.

Signup and view all the flashcards

P-value

The probability of obtaining results as extreme as, or more extreme than, the observed results, assuming the null hypothesis is true.

Signup and view all the flashcards

p < 0.05

Reject the null hypothesis.

Signup and view all the flashcards

p > 0.05

Fail to reject the null hypothesis.

Signup and view all the flashcards

Alternative Hypothesis

The hypothesis that contradicts the null hypothesis. It states there is a relationship.

Signup and view all the flashcards

When do you reject the Null?

If the p-value is less than 0.05.

Signup and view all the flashcards

What does the Null state?

There is no relationship between the two variables being measured.

Signup and view all the flashcards

What does the Alternative State?

The independent variable did affect the dependent variable, and the results are significant in terms of supporting the theory being investigated (i.e.not due to chance).

Signup and view all the flashcards

How to check if a p-value is Statistically Significant?

The smaller the p-value, the stronger the evidence that you should reject the null hypothesis.

Signup and view all the flashcards

Study Notes

  • The lecture covers samples and populations, statistical significance, one-tailed and two-tailed tests, and Type I and II errors.

Recap on Correlation

  • The correlation of a sample is represented by Pearson's r
  • The range of possible values for a correlation is between -1 to +1
  • A positive correlation (+) indicates that as scores on one variable increase/decrease, so too do scores on another variable
  • A negative correlation (-) indicates that as scores on one variable increase, scores on another variable decrease, and vice versa
  • A correlation coefficient can either be weak, moderate, or strong
  • Weak correlation: r is between 0 and ± 0.29
  • Moderate correlation: r is between ± 0.3 and ± 0.59
  • Strong correlation: r is between ± 0.6 and ± 1.00
  • Example: r = -0.4 is stronger than r = +0.3

Inferential Statistics

  • Inferential statistics are used to make general statements or draw conclusions that apply beyond the sample
  • This involves drawing inferences about all scores in the population from just a sample of those scores

Samples & Populations

  • A sample is a small number of scores selected from the entirety of scores
  • A population is the entire set of scores
  • A sample is a subset taken from the full set or population of scores
  • Both population and sample refer to scores on a variable
  • The population of scores can sometimes be measured
  • More often, the population of scores is infinite and cannot feasibly be measured
  • The mean of the sample can be used as an estimation of the mean of the populaion

Random Samples

  • In statistical inference, it is generally assumed that samples are drawn at random from the population
  • Obtaining a random sample of scores entails selecting scores in such a way that each score in the population has an equal chance of being selected
  • Manual random number tables, electronic random number generators, and pulling names out of a hat are ways to obtain a random sample
  • Most random samples have a mean that is very close to the population mean
  • A sample mean obtained from random sampling that is very different from the population mean is relatively rare

Statistical Significance

  • Psychologists are interested in which sample means are very unlikely to occur through random sampling
  • The extreme 5% of these samples is of interest, and so these samples are called significant
  • Significance means that the means of the sample are very different from those of the population from which it was drawn
  • Significance at the 5% level means that the sample score lies within the 5% of samples which are most different from the population
  • This 5% is obtained from looking at the extreme lower 2.5%, and the extreme upper 2.5%
  • Scores in the middle 95% of scores are likely, and scores in the extreme 5% (extreme upper and extreme lower 5%) are unlikely

Null Hypothesis

  • The null hypothesis (H0) always makes a statement of no (null) difference/relationship between the values of a population (e.g. means) or between two variables
  • H0 = there is no relationship between the two variables being measured
  • The null hypothesis is used to define a population in which there is no relationship between two variables (i.e., the middle 95%)
  • Whether or not it is possible that the sample comes from this population is determined by the null hypothesis
  • If it is unlikely that the sample comes from the middle 95%, the possibility that the null hypothesis is true is rejected

P-Values

  • A p-value is used to find out the probability of a result occurring assuming that the null hypothesis is true
  • The null hypothesis states that there is no relationship between the two variables being studied (one variable does not affect the other) and that the results are due to chance and are not statistically significant
  • An alternative hypothesis posits that the independent variable did affect the dependent variable, and the results are significant in terms of supporting the theory being investigated and not due to chance
  • A p-value, or probability value, is a number describing how likely it is that data would have occurred by random chance (i.e. that the null hypothesis is true)
  • The level of statistical significance is often expressed as a p-value between 0 and 1
  • The smaller the p-value, the stronger the evidence that the null hypothesis should be rejected
  • A p-value less than 0.05 (p < .05) is statistically significant and indicates strong evidence against the null hypothesis
  • There is less than a 5% probability the null is correct, thus the null hypothesis is rejected
  • A p-value higher than 0.05 (p > 0.05) is not statistically significant and indicates strong evidence for the null hypothesis
  • The null hypothesis is not accepted, but the rejection of it can be failed

Notes on Statistically Significant Results

  • A statistically significant result cannot prove that a research hypothesis is correct (as this implies 100% certainty)
  • Results may state that they "provide support for" or "give evidence for" the research hypothesis, as there is still a slight probability that the results occurred by chance and the null hypothesis was correct
  • A statistically significant relationship is one that is unlikely to have occurred in the sample if there's no relationship in the population
  • The issue of whether a result is unlikely to happen by chance is an important one in establishing cause-and-effect relationships
  • A correlation can be weak but still statistically significant
  • The association is small, but not zero

Reporting P-Values

  • Report exact p values (e.g., p = .031) to three decimal places
  • Report p values less than .001 as p < .001
  • Use italics for p
  • The opposite of statistically significant is "not statistically significant", not "insignificant"

Type 1 and Type II Errors

  • Type I error: Deciding that the null hypothesis is false when it is actually true (i.e., a false positive)
  • Type II error: Deciding that the null hypothesis is true when it is actually false (i.e., a false negative)
  • In psychology, we use a very narrow threshold for statistical significance (p < .05) to reduce the likelihood that Type I or Type II errors are made

One-Tailed and Two-Tailed Significance Testing

  • A one-tailed test specifies the direction of the hypothesis
  • Example: 'Those who attend 80% of PS219 lectures throughout term will perform better than those who do not.'
  • Two-tailed tests allow the hypothesis to go in either direction
  • Example: 'There will be a difference in end of term grades between those who attend 80% of PS219 lectures throughout term than those who do not.'
  • Two-tailed tests are more widely used

Problems with P-Values

  • Problem #1: P Hacking: the inappropriate manipulation of data to produce a statistically significant result
    • Making up data points
    • Removing data points
    • Altering existing data points
    • Running lots of statistical tests until one is found that produces a statistically significant result
  • Problem #2: P-values don't show absolute certainty
    • A statistically significant result cannot prove that a research hypothesis is correct (as this implies 100% certainty)
    • P-values do not measure the probability that the studied hypothesis is true, or the probability that the data were produced by random chance alone
  • Problem #3: p < .05 does not imply a strong effect
    • A p-value is a number describing how likely it is that data would have occurred by random chance (i.e. that the null hypothesis is true)
    • It does not measure how big the association or the difference is
    • To investigate the strength of a difference or association, effect sizes must be looked at

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

More Like This

Use Quizgecko on...
Browser
Browser