Midterm
20 Questions
0 Views

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

Which historical figure is most closely linked to the p = .05 significance threshold?

  • Pierre-Simon Laplace
  • John Arbuthnot
  • Karl Pearson
  • Ronald Fisher (correct)

When using a sample to model a population, which conditions should ideally be met? (Select all that apply)

  • The data are normally distributed
  • A bootstrap re-sampling procedure was used
  • The sample is extremely large
  • The sample is representative of the population (correct)
  • The data were sampled independently (correct)

For normally distributed data, what is the probability of a random score falling within one standard deviation of the mean?

  • Approximately 95%
  • Approximately 50%
  • Approximately 68% (correct)
  • Approximately 99.7%

If a study's results accurately reflect a population, which is most likely?

<p>The true population mean lies within the confidence interval boundaries. (D)</p> Signup and view all the answers

What does the provided Python code block perform? (Select all that apply)

<p>Generates Monte Carlo data (A), Calls a function (E), Resamples existing data (F)</p> Signup and view all the answers

Which example illustrates a non-directional hypothesis?

<p>Participants in a study of yoga versus mindfulness meditation will report similar levels of anxiety. (C)</p> Signup and view all the answers

In a Monte Carlo simulation, how would changing from a smaller to a larger standard deviation impact the resulting data?

<p>The resulting data would show a wider spread. (B)</p> Signup and view all the answers

In the Murky Research scenario, what statistic does the provided plot primarily display?

<p>Median (C)</p> Signup and view all the answers

Which formula calculates the unbiased sample variance?

<p>$s^2 = \frac{\sum_{i=1}^{n}(x_i-\bar{x})^2}{n-1}$ (A)</p> Signup and view all the answers

Based on the sample distribution provided, which point estimator is likely most efficient?

<p>Median (C)</p> Signup and view all the answers

If p(event 1 | event 2) = p(event 1), what does this indicate?

<p>The two events are independent (A)</p> Signup and view all the answers

In a bootstrap simulation, how would reducing the number of resampled participants affect the resulting data?

<p>Increase the variability in the estimated parameters. (B)</p> Signup and view all the answers

When conducting significance tests, what do we do?

<p>Presume that the null hypothesis is true. (C)</p> Signup and view all the answers

What is another name for standard deviation?

<p>The average deviation of scores in the sample, in original units. (B)</p> Signup and view all the answers

Event X and Event Y are disjoint events. What is the probability of both occurring together?

<p>0 (D)</p> Signup and view all the answers

Which of the following is a random process? (Select all that apply)

<p>That it will snow tomorrow. (A), The number of items you will get correct on this exam. (C)</p> Signup and view all the answers

In the math program study, what is a critical element in the logic of the researcher's randomization test?

<p>The originally obtained statistic is compared to the distribution of obtained values. (C)</p> Signup and view all the answers

In the math experiment, the researcher hypothesizes that the intervention group will improve more than the control group. Is this directional or non-directional?

<p>Directional (A)</p> Signup and view all the answers

A researcher conducts a randomization test and re-randomizes their data computing the statistic 1000 times and then generates a distribution of those tests. Which histogram, ‘A’ or ‘B’, should the researcher use to evaluate their results?

<p>Histogram B (A)</p> Signup and view all the answers

A set of random samples are generated from a Monte Carlo simulation. Based on the work you’ve done which of the follow snippets of code most probably produced the data for this histogram?

<pre><code class="language-python">for i in range(num_iterations): for j in range(sample_size): sample_1[j] = random.gauss(mean_1, population_sd) sample_2[j] = random.gauss(mean_2, population_sd) mean_1_sample[i] = np.mean(sample_1) mean_2_sample[i] = np.mean(sample_2) ``` (C) </code></pre> Signup and view all the answers

Flashcards

Who is associated with p = .05 significance?

The threshold (p = .05) is linked to Ronald Fisher's statistical significance testing approach.

Conditions for Sample as Population Model

The sample should be representative, large, and the sampled independently.

Data within 1 SD of Mean

Approximately 68% of the data falls within one standard deviation of the mean in a perfectly normally distribution set of data.

Study results representative

The true population mean likely falls within the boundaries of the confidence interval.

Signup and view all the flashcards

What the Python Code does

Produces Histogram, Resamples existing data and Generates Monte Carlo Data.

Signup and view all the flashcards

Non-Directional Hypothesis

A hypothesis that doesn't specify the direction of an effect (e.g., no increase or decrease).

Signup and view all the flashcards

Monte Carlo & Standard Deviation

Increased standard deviation leads to more variability in results.

Signup and view all the flashcards

Which Statistic?

Median is the statistic represented.

Signup and view all the flashcards

Independent Events

The two events do not have any bearing on the probability of occurence on each other.

Signup and view all the flashcards

Random process example

Random processes include: snow, flipping coins.

Signup and view all the flashcards

Standard Error of the Mean

The standard error of the mean estimates the variability between sample means.

Signup and view all the flashcards

Disjoint Events Probability

Probability of both disjoint events simultaneously occurring is zero.

Signup and view all the flashcards

Standard Deviation Descriptor

Shows the typical deviation from the mean

Signup and view all the flashcards

Cars in study?

The numerator is 426+1647+879+832. The denominator is 3784

Signup and view all the flashcards

Cars in study?

The numerator is 426 The denominator is 1305

Signup and view all the flashcards

Cars in study?

The numerator is 832. The denominator is 3784

Signup and view all the flashcards

Sample Space?

List of possible outcomes

Signup and view all the flashcards

Study Notes

  • Exam covers multiple choice, select all that apply, matching, fill in the blank, short answer, graph interpretation, and code interpretation items.
  • Responses should be within the provided boxes, handwriting must be clear and legible.
  • No items require long answers.
  • If an item cannot be solved, mark it for review and return to it later.
  • Writing the student ID number on the top of each page and ensuring written answers are legible will earn 3 extra credit points.

Question 1

  • Ronald Fisher is most closely associated with the p = .05 threshold for significance.

Question 2

  • Conditions for a same to be a good model of its population:
    • Data are normally distributed.
    • Sample is extremely large.
    • Data was sampled independently.
    • Sample is representative of the population.

Question 3

  • If a set of data is perfectly normally distributed, the probability of randomly drawing a score that falls within 1 standard deviation of the mean needs to be determined.

Question 4

  • If the results of a study are genuinely representative of a population, the true population mean will lie within the boundaries of the confidence interval.

Question 5

  • Python code is provided.
  • Identify which of the following its function preforms:
    • Produces a histogram
    • Resamples existing data
    • Generates Monte Carlo data
    • Defines a bootstrap function
    • Calls a function
    • Creates an array of individual scores

Question 6

  • The study examples includes a non-directional hypothesis:
    • Participants in a study of yoga versus mindfulness meditation will report similar levels of anxiety.

Question 7

  • Determine what happens to resulting data in a Monte Carlo simulation if the standard deviation is changed from a smaller value to a larger value.

Question 8 and 9

  • Dr. Murky is evaluating the effectiveness of two new aftershave advertisements in 150 pharmacies and supermarkets in London, Ontario.
  • 50% of locations are randomly selected to display Ad 1, with the rest carrying Ad 2 for one week.
  • The plot shows the total number of product purchased with the advertisement present.
  • Report the averages for Advertisement 1 and Advertisement 2 to the nearest whole number based on the data.
  • Identify which statistic the average is showing:
    • Arithmetic Mean
    • Median
    • Mode
    • Geometric Mean

Question 10

  • Determine if the two advertisements differ, in terms of the number of sales they promoted and why or why not.

Question 11

  • Identify the formula that Dr. Murky should use to calculate the variance of each sample.

Question 12

  • Rank order the elements of the study from most (rank = 1) to least (rank = 5) important using the boxes beside each item to provide rankings:
    • The two advertisements and any differences between them
    • The calculated statistics based on the sample data
    • How the ads were selected for placement at each location
    • The number of days the trial ran
    • The participant sample (people in the London, Ontario area who buy men's aftershave)

Question 13

  • Based on a provided sample distribution, identify which point estimator is likely to be most efficient for the data presented.

Question 14

  • Provide a single sentence to explain the answer given in question 13.

Question 15

  • If p(event 1 | event 2) = p(event 1), determine which is true:
    • The two events are related to one another
    • The two events are independent
    • The two events are complementary
    • The probability of event 1 depends on the probability of event 2

Question 16

  • Determine what would happen to the resulting data in a bootstrap simulation assuming the number of resampled participants is reduced.

Question 17

  • When conducting significance tests one would:
    • Test the research hypothesis to determine if it is true.
    • Always use a statistical threshold of p = .05.
    • Presume that the null hypothesis is true.
    • Presume a cause-and-effect relationship between the independent and dependent variables.

Question 18

  • Describe the standard error of the mean.

Question 19

  • List the assumptions a researcher must make to calculate the 95% confidence interval using a formula, rather than a bootstrap approach.

Question 20

  • Identify the biased statistics:
    • Mean
    • Median
    • Mode
    • Variance
    • Standard Deviation

Question 21

  • Event X and Event Y are disjoint evernts
  • Determine what is the probability of both events occurring together?

Question 22

  • Choose what is a good descriptor for the standard deviation of a sample around it's mean:
    • The interval within which the mean is 68% likely to lie.
    • The average squared deviation of scores in the sample, in squared units.
    • The average deviation of scores in the sample, in original units.
    • The population parameter sigma.

Question 23

  • Study: California study observed whether drivers stopped for pedestrians at crosswalks/intersections or sped through, categorized by luxury (BMW, Audi, Porsche) vs. non-luxury brands (Nissan, Kia, Toyota).
  • Observations noted between 11 AM and 1 PM over 14 day period.
  • Numerator & denominator are required for the following questions, not the completed calculations.
  • What proportion of the cars observed in the study were luxury brands?

Question 24

  • What is the probability of stopping for approaching pedestrians, given that the car is a luxury vehicle?

Question 25

  • What is the probability of driving a non-luxury vehicle and not stopping for approaching pedestrians?

Question 26

  • Describe what a Type II error is.
  • What is the probability of making one in the research?

Question 27

  • Select whether any of the following describe a random process:
    • The month of March 2023 will follow the month of February 2023.
    • The number of items you will get correct on this exam.
    • That it will snow tomorrow.
    • Flipping heads on a coin with two heads sides.

Question 28

  • Researcher examines whether a new math program helps kids learn.
  • 200 3rd grade children sampled, 100 are in the intervention and the remainder are enrolled in the standard grade 3 curriculum.
  • After the school year, children take a post-test.
  • A histogram displays improvement by either change scores (post-test - pre-test) with high scores reflecting more improvement.
  • Data are not normally distributed, so a randomization test must examine the study outcome.
  • Determine the critical element in the logic of the randomization test and why it works:
    • If the obtained p-value is smaller than 0.05, the null hypothesis can be rejected.
    • The design is experimental because the participants have been randomly assigned to groups.
    • The originally obtained statistic is compared to the distribution of obtained values.
    • If the null hypothesis is true, participants can be reassigned to groups at random.

Question 29

  • In the experiment's design, the researcher is investigating the differences between both pre-test and post-test.
  • Is the hypothesis directional or non-directional?

Question 30

  • The researcher conducts the randomization test, re-randomizing the data and computing the statistic 1000 times.
  • Determine which histogram should be used.

Question 31

  • Explain the reasoning behind the choice.

Question 32

  • His test statistic is 2.83.
  • Describe the statistical decision based on the selected histogram.

Question 33

  • Identify which of the following snippets of python code most probably produced the data for the histogram?
  • The data in the adjacent graph are from a Monte Carlo simulation.

Question 34

  • Based on what one learned about confidence intervals, interpret the results shown in the provided figure.

Question 35

  • Select the best distribution for the adjacent data plot:
    • Uniform
    • Binomial
    • Gaussian
    • Irregular
    • The original distribution cannot be determined from this plot

Question 36

  • Explain the reasoning for the choice from the previous item.

Question 37

  • Select why not to use use a 100% confidence interval:
    • There is a chance that we would miss the true population mean due to sampling error.
    • The interval would be so wide that it would become uninformative.
    • It is not traditional to use a 100% confidence interval.
    • The confidence interval formula does not allow for this.

Question 38

  • Match the distribution with its most likely descriptor (not all descriptors will be used):
    • Platykurtic
    • Normal
    • Binomial
    • Positively Skewed
    • Negatively Skewed

Question 39

  • Car thieves are operating near the UWO campus!
  • Last week, three cars were reported stolen and the police are investigating these crimes.
  • Each crime has two possible, the outcome (the car is recovered as 'R' or the car missing 'L')
  • Determine which options show the complete sample space for the results of this investigation:
    • {R, R, R, L, L, L}
    • {RRR, RRL, RLL, LLL}
    • {RRR, RRL, RLR, RLL, LRR, LRL, LLR, LLL}
    • None of these options accurately describe the sample space.

Question 40

  • What is the probability that exactly one car will be recovered?
  • Show the correct numerator and denominator without completing the calculation.

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

Description

Exam guidelines covering multiple-choice, matching, and short-answer questions. Focus on clear handwriting and time management. Key topics include Ronald Fisher's significance threshold and conditions for a good statistical model.

More Like This

Use Quizgecko on...
Browser
Browser