Podcast
Questions and Answers
When should a bar chart be used to represent data?
When should a bar chart be used to represent data?
- When the data involves time-series analysis.
- When visually graphing categorical variables. (correct)
- When graphically representing numerical or continuous data.
- When the data is inherently complex and requires detailed analysis.
What is the primary goal of statistical inference?
What is the primary goal of statistical inference?
- To accurately measure and record data from an entire population.
- To avoid the need for sampling by using existing registry data.
- To use a small sample to make an educated guess about the entire population. (correct)
- To analyze errors within the sample itself.
Which of the following is an example of systematic error?
Which of the following is an example of systematic error?
- Using a wrongly calibrated weighing machine. (correct)
- Variations that occur due to individual differences.
- Shifting each measurement by a random amount.
- Errors that arise from timing depending on reaction time.
How can diversity impact random error in measurement?
How can diversity impact random error in measurement?
How does increasing sample size typically affect random error?
How does increasing sample size typically affect random error?
What is a key characteristic of systematic errors?
What is a key characteristic of systematic errors?
Which of the following is a method used to address systematic errors?
Which of the following is a method used to address systematic errors?
A researcher plans to sample 500 people to determine the average age in a country, but a co-researcher suggests using the national birth and death registry instead. Why might this suggestion eliminate the need for statistical inference?
A researcher plans to sample 500 people to determine the average age in a country, but a co-researcher suggests using the national birth and death registry instead. Why might this suggestion eliminate the need for statistical inference?
What is a hypothesis?
What is a hypothesis?
A researcher is measuring the height of a giraffe using a measuring tape, but is getting measurements of 4.8m, 5.1m, and 5.3m when the true height is 5m. What type of error is this an example of?
A researcher is measuring the height of a giraffe using a measuring tape, but is getting measurements of 4.8m, 5.1m, and 5.3m when the true height is 5m. What type of error is this an example of?
What does random error introduce into study results?
What does random error introduce into study results?
How can random error typically be reduced?
How can random error typically be reduced?
A researcher is determining whether people in Town A have a different mean BMI compared to people in Town B. Which of the following represents the null hypothesis (HO)?
A researcher is determining whether people in Town A have a different mean BMI compared to people in Town B. Which of the following represents the null hypothesis (HO)?
In hypothesis testing, what does it mean to 'reject the null hypothesis'?
In hypothesis testing, what does it mean to 'reject the null hypothesis'?
In the context of hypothesis testing, what is the role of 'evidence' obtained from a study?
In the context of hypothesis testing, what is the role of 'evidence' obtained from a study?
A study finds that the mean height of a sample of men from Town A is 2 cm taller than an equivalent sample from Town B. What would you need to do before definitively claiming the same is true of the entire population of Town A and B?
A study finds that the mean height of a sample of men from Town A is 2 cm taller than an equivalent sample from Town B. What would you need to do before definitively claiming the same is true of the entire population of Town A and B?
When is it necessary to check for normality, while comparing 2 interventions?
When is it necessary to check for normality, while comparing 2 interventions?
Which type of data are parametric tests applicable to?
Which type of data are parametric tests applicable to?
If data is 'normal' then can use:
If data is 'normal' then can use:
A study aims to evaluate the impact of a hand hygiene training program on reducing hospital-acquired infections (HAIs). What is the most appropriate statistical test to evaluate the decline, six months before vs six months after the program was implemented?
A study aims to evaluate the impact of a hand hygiene training program on reducing hospital-acquired infections (HAIs). What is the most appropriate statistical test to evaluate the decline, six months before vs six months after the program was implemented?
What does a 95% confidence interval provide?
What does a 95% confidence interval provide?
How do confidence intervals help quantify random error?
How do confidence intervals help quantify random error?
Which factor does NOT influence the width of a confidence interval?
Which factor does NOT influence the width of a confidence interval?
What would need to happen in order to be more sure of our results?
What would need to happen in order to be more sure of our results?
A study estimates the mean systolic blood pressure in a population to be 120 mmHg and a 95% confidence interval of 115 to 125 mmHg. What best describes the meaning?
A study estimates the mean systolic blood pressure in a population to be 120 mmHg and a 95% confidence interval of 115 to 125 mmHg. What best describes the meaning?
A study comparing systolic blood pressure (BP) between two groups, had a 95% confidence interval for the mean difference of 1-5mmHg. Which of the following is true?
A study comparing systolic blood pressure (BP) between two groups, had a 95% confidence interval for the mean difference of 1-5mmHg. Which of the following is true?
How would you correctly evaluate whether the effect of an intervention is large enough to matter in real-world applications?
How would you correctly evaluate whether the effect of an intervention is large enough to matter in real-world applications?
If a study is statistically significant with a p-value of less than 0.05, which of the following is true?
If a study is statistically significant with a p-value of less than 0.05, which of the following is true?
Suppose the p-value were 0.01, what can we interpret from this?
Suppose the p-value were 0.01, what can we interpret from this?
Which of the following is a limitation of the p-value?
Which of the following is a limitation of the p-value?
Which of the following assumptions is made when interpreting a p-value?
Which of the following assumptions is made when interpreting a p-value?
What happens in 95% of cases with the confidence interval?
What happens in 95% of cases with the confidence interval?
What are some of the limitations of Statistical Significance?
What are some of the limitations of Statistical Significance?
How are systematic errors addressed?
How are systematic errors addressed?
How are Random errors addressed?
How are Random errors addressed?
After preforming statistical tests successfully, what do you need, to quantify?
After preforming statistical tests successfully, what do you need, to quantify?
What does Diversity do when it comes to random error?
What does Diversity do when it comes to random error?
According to the prompt, How are Random errors only reduced?
According to the prompt, How are Random errors only reduced?
From the 4S trial and the information provided, how would one interpret the effect of Simvastatin on mortality from cardio events?
From the 4S trial and the information provided, how would one interpret the effect of Simvastatin on mortality from cardio events?
What type of data is best represented by a bar chart?
What type of data is best represented by a bar chart?
What inherent characteristic contributes to random error in measurement?
What inherent characteristic contributes to random error in measurement?
In a study measuring average income in a specific town, increasing the diversity of the sample population is most likely to:
In a study measuring average income in a specific town, increasing the diversity of the sample population is most likely to:
In hypothesis testing, what is the purpose of the null hypothesis?
In hypothesis testing, what is the purpose of the null hypothesis?
A study aims to compare the effectiveness of Drug A versus Drug B on reducing blood pressure. Which of the following represents a valid null hypothesis?
A study aims to compare the effectiveness of Drug A versus Drug B on reducing blood pressure. Which of the following represents a valid null hypothesis?
A researcher conducts a study and obtains a p-value of 0.03. What decision should they make regarding the null hypothesis, assuming a significance level of 0.05?
A researcher conducts a study and obtains a p-value of 0.03. What decision should they make regarding the null hypothesis, assuming a significance level of 0.05?
In a study comparing a new treatment to an existing one, what does 'sufficient evidence' imply in the context of hypothesis testing?
In a study comparing a new treatment to an existing one, what does 'sufficient evidence' imply in the context of hypothesis testing?
Two researchers are investigating the average height of students at two different universities. After conducting their studies, they find a 3cm difference in height. To determine if this difference applies to the entire student population, what must they do?
Two researchers are investigating the average height of students at two different universities. After conducting their studies, they find a 3cm difference in height. To determine if this difference applies to the entire student population, what must they do?
When comparing two interventions, which type of data requires checking for normality?
When comparing two interventions, which type of data requires checking for normality?
What does it mean for data to be considered 'normal'?
What does it mean for data to be considered 'normal'?
What is the most suitable statistical test to evaluate the impact that a hand hygiene training program has on reducing hospital-acquired infections (HAIs)?
What is the most suitable statistical test to evaluate the impact that a hand hygiene training program has on reducing hospital-acquired infections (HAIs)?
What information does a 95% confidence interval provide in research?
What information does a 95% confidence interval provide in research?
How do confidence intervals help in quantifying random error within a study?
How do confidence intervals help in quantifying random error within a study?
Which of the following is NOT a factor influencing the width of a confidence interval?
Which of the following is NOT a factor influencing the width of a confidence interval?
In a study estimating mean systolic blood pressure, what would increase confidence in the results?
In a study estimating mean systolic blood pressure, what would increase confidence in the results?
A study estimates the mean systolic blood pressure to be 120 mmHg with a 95% confidence interval of 115 to 125 mmHg. What is the best interpretation of this result?
A study estimates the mean systolic blood pressure to be 120 mmHg with a 95% confidence interval of 115 to 125 mmHg. What is the best interpretation of this result?
A 95% confidence interval for the mean difference in systolic blood pressure (BP) between two groups is reported as 1-5mmHg. What conclusion, according to the 95% rule, can be drawn from this information?
A 95% confidence interval for the mean difference in systolic blood pressure (BP) between two groups is reported as 1-5mmHg. What conclusion, according to the 95% rule, can be drawn from this information?
How should one correctly evaluate whether the effect of an intervention is relevant in clinical settings?
How should one correctly evaluate whether the effect of an intervention is relevant in clinical settings?
From the options below, what does statistical significance tell us?
From the options below, what does statistical significance tell us?
With a p-value of 0.01, what can we interpret?
With a p-value of 0.01, what can we interpret?
From the following options, what is a limitation of the p-value?
From the following options, what is a limitation of the p-value?
When interpreting a p-value, what underlying assumption do you need to make?
When interpreting a p-value, what underlying assumption do you need to make?
What occurs in 95% of cases with the confidence interval?
What occurs in 95% of cases with the confidence interval?
What are the limitations of statistical significance?
What are the limitations of statistical significance?
What happens in the case of systematic error?
What happens in the case of systematic error?
From the options below, what can be used, for statistical methods to estimate, as methods to reduce errors?
From the options below, what can be used, for statistical methods to estimate, as methods to reduce errors?
How does Diversity impact random error?
How does Diversity impact random error?
How are Random errors reduced?
How are Random errors reduced?
With the 4S trial, how would one interpret the effects of Simvastatin?
With the 4S trial, how would one interpret the effects of Simvastatin?
Based on the baseline characteristics of the patients in the Electronic Communications and Home Blood Pressure Monitoring Trial table, which type of chart is the most appropriate to display the distribution of education levels?
Based on the baseline characteristics of the patients in the Electronic Communications and Home Blood Pressure Monitoring Trial table, which type of chart is the most appropriate to display the distribution of education levels?
A researcher wants to determine if there is a significant difference in mean BMI between Town A and Town B. Which statistical test is most appropriate if the data are normally distributed and population variances are equal?
A researcher wants to determine if there is a significant difference in mean BMI between Town A and Town B. Which statistical test is most appropriate if the data are normally distributed and population variances are equal?
A researcher measures the height of the same group of students twice using two different methods. What statistical test would be used to find a statistically signifiant difference?
A researcher measures the height of the same group of students twice using two different methods. What statistical test would be used to find a statistically signifiant difference?
A study is conducted to determine if there is a difference in resting heart rate in the study subjects at 6 a.m. and 6 p.m. What statistical test would be used?
A study is conducted to determine if there is a difference in resting heart rate in the study subjects at 6 a.m. and 6 p.m. What statistical test would be used?
A dental surgeon recorded collected data on sugary snack consumption and dental carries, but coded each as yes or no from separate participants. WHich test should they preform?
A dental surgeon recorded collected data on sugary snack consumption and dental carries, but coded each as yes or no from separate participants. WHich test should they preform?
To evaluate the impact of a hand hygiene program on the number of HAI infections among a nursing staff, what is the appropriate test?
To evaluate the impact of a hand hygiene program on the number of HAI infections among a nursing staff, what is the appropriate test?
To evaluate the effects the change in BP among three groups, which is most applicable?
To evaluate the effects the change in BP among three groups, which is most applicable?
Flashcards
Statistical Inference
Statistical Inference
Using a small sample to make an educated guess about the entire population.
Systematic Error
Systematic Error
Occurs in the same direction and magnitude every time a measurement is taken.
Random Error
Random Error
Shifts each measurement from its true value by a random amount and in a random direction.
P-value
P-value
Signup and view all the flashcards
Null Hypothesis (H0)
Null Hypothesis (H0)
Signup and view all the flashcards
Alternative Hypothesis (H1)
Alternative Hypothesis (H1)
Signup and view all the flashcards
Confidence Interval
Confidence Interval
Signup and view all the flashcards
Parametric Tests
Parametric Tests
Signup and view all the flashcards
Non-Parametric Tests
Non-Parametric Tests
Signup and view all the flashcards
Clinical significance
Clinical significance
Signup and view all the flashcards
Bar Chart
Bar Chart
Signup and view all the flashcards
Study Notes
Tutorial 5 - RCT (2)
- Focuses on statistical inference, hypothesis testing, and statistical tests.
- Reviews the graphical representation of data.
Data Types
- Two main data types: numerical (continuous) and categorical (ordinal, nominal, proportions).
- For categorical data, bar charts are appropriate for visually graphing categorical variables.
- Bar charts represent proportions/frequencies of different categories, for easily comparing how groups (e.g., race or sex) are distributed within interventions.
- Using the wrong type of figure can obscure data and can lead to misunderstanding.
Graphical Representation Questions
- Question 1a: For the "Race" row in the table, a bar chart is the appropriate graphical representation because race is a categorical variable.
- Question 1b: For the "Antihypertensive medication classes" row in the table, a bar chart is the correct choice since medication classes are categorical.
- Question 1c: For the "Education" characteristic, a bar chart is suitable because education levels are categorical.
- Question 1d: For the "Age" characteristic, a box plot is more appropriate as age is a numerical variable.
Statistical Inference Basics
- Statistical inference uses a small sample to make an educated guess about the entire population.
- It involves the process of sampling from a population to make inferences about that population.
Measurement Errors
- Two types of errors in measurement: systematic and random.
- Systematic error: occurs in the same direction and magnitude every time.
- Systematic error can be an error in the measurement tool/process.
- Random error: shifts each measurement from its true value by random amount and direction.
- Random error is an inherent part of all measurement processes.
- Increased diversity can amplify random error.
- Increasing sample size and repeating measurements can reduce random error.
Systematic vs. Random Errors
- Systematic Errors: Characterized by inaccuracy, skewing measurements away from true value and leading to inability to draw valid conclusions.
- Random Errors: Characterized by imprecision, creating variability around the true value, but averaging can cancel the variability to get closer.
- Systematic errors are addressed through study designs (crosssectional, cohort, casecontrol, intervention studies).
- Random errors are addressed via statistical inference (hypothesis testing, confidence intervals).
Understanding Statistical Inference
- Statistical inference may not be needed if you can calculate a population parameter from the birthdate of every citizen, which can be derived if there's access to ALL of the population data.
- Statistical inference is used in research studies because access to the entire population is often infeasible, so inferences have to be made from samples.
Consolidating Key Concepts
- Measurements of a giraffe's height (true height is 5m) yield measurements of 4.8m, 5.1m, and 5.3m. This is an example of random error.
- Random error can be reduced through repeated measurements.
- Researchers can reduce systematic error via training in correct measurement processes.
- Increasing sample size does not fix systematic error. Systematic error cannot be fixed by changing the sample size
Random Error Summary
- Represents natural fluctuations in data that occur without specific cause.
- Introduces variability in study results, obscuring effects, affecting statistical power, and potentially leading to misleading conclusions.
- Can only be reduced by increasing sample size.
- Can be estimated using statistical methods such as hypothesis testing and confidence intervals.
Hypothesis Testing - an Overview
- Hypothesis testing: Step 1, Identify the Question, Step 2, Select the Statistical Test, Step 3, Compare the Evidence, Step 4, Make your Conclusions
Setting up Hypothesis Testing
- Question: Does Intervention A (IA) lead to statistically significant weight loss compared to Intervention B (IB).
- Null hypothesis (H0): No difference in weight loss between IA and IB.
- Alternative hypothesis (H1): There is a difference in weight loss between IA and IB.
Defining Hypotheses
- HO ("null" hypothesis): Suggests no difference between your observations.
- H1: Suggests a real difference between observations.
- Example Question "Do people in Town A have a different mean BMI compared to people in Town B?” requires an independent t-test.
- There is no difference in the mean BMI between Town A and Town B (H0).
- H1: There is a difference in the mean BMI between Town A and Town B.
Hypothesis Testing in Practice
- Question: "Class A and Class B have a mix of girls and boys. The mean IQ scores of students in both classes were measured. Do mean IQ scores differ between Class A and Class B?". The correct H0 is, "There is no difference in the mean IQ scores between Class A and Class B."
- Question: "Do children from families with a higher income have a different incidence of dental caries than children from families with a lower income?" The correct H1 is, "There is a difference in the incidence of caries between children from families of different income."
"Proving" Your Hypothesis
- Statistical proof assumes the null hypothesis is true and seeks evidence against it. That's the baseline for hypothesis testing.
- Insufficient evidence to reject the null hypothesis: Unable to reject the null hypothesis.
- Sufficient evidence to reject the null hypothesis: Able to reject the null hypothesis.
Hypothesis Testing Details
- The evidence needed to reject or not reject HO comes from the study. E.g., the mean height of men in Town A is 2 cm taller than in Town B.
- Assess how sure of the results are. A key way is to repeat the study again and again: will one find the 2cm difference or is the first study a once-off.
- Convention: Accept up to a 5% probability that the observed result is by chance (aka significance level).
- Quantify the difference in the study and check whether the data meets the 5% significance level, via statistical tests.
Beginning Statistical Tests
- One needs to identify the outcome variable (or Y variable).
- Determine the number of groups in the independent variable (or X variable).
- If the outcome variable is numerical, one should check for normality.
Statistical Tests Flowchart
- Numerical Data ("Continuous," Comparing means):
- 1 group: One-sample t-test, Wilcoxon Signed-Rank test
- 2 groups: Paired t-test, Wilcoxon Signed Rank test (Paired); Two-sample t-test, Mann-Whitney U test (Independent)
- >2 groups: One-way ANOVA, Kruskal-Wallis test (Independent)
- Categorical Data ("Comparing Proportions"):
- 1 group: Binomial test
- 2 groups: McNemar's test (Paired); Chi-square test, Fisher's Exact test (Independent)
- >2 groups: Chi-square test (Independent)
Parametric vs Non-Parametric Tests
- Parametric tests: Applies only to continuous data and refers to data distribution.
- If the data distribution is "normal" then parametric tests can be used.
- If the data distribution is not normal, then non-parametric tests should be used.
- All tests for ordinal and nominal (categorical) data are non-parametric.
- The Fisher's exact test caters to small sample sizes where could expect small cells
Statistical Tests - Practice Questions
- To test the null hypothesis: “There is no difference between the mean age of having their first cigarette between boys and girls", a Two sample T-test is used.
- For the null hypothesis: “There is no difference between the prevalence of diabetes between Town A and Town B", one should use a Chi-Square test.
- For the null hypothesis: “There is no difference in the resting heart rate in the study subjects at 6am and 6pm”, one should use a Paired T-test.
- For the statement: “There is a difference in the incidence of cavities when they are 6 years of age...”, an ANOVA is most appropriate.
Week 6: 95% Confidence Interval
- 95% Confidence Interval: Range of values to estimate a population parameter, and used to check p-values, as well as statistical vs clinical significance
Learning Objectives
- Understand the concept of a 95% confidence interval (CI).
- Learn how confidence intervals (CI) help quantify random error.
- Explore factors that influence the width of a confidence interval (CI).
- Learn how to interpret confidence intervals (CIs) in research.
Confidence Intervals
- Range of values to estimate a population parameter.
- Provides an interval estimate around a sample statistic, and indicates how accurate the estimates are likely to be.
Understanding 95% CI
- The 'true value' will fall within each of the intervals; an assumption to be measured to see just how often this is truthful.
- In 100 samples, 95 of those had intervals that contained the true value.
- Random variants and measuring how often tests work affects the true value being derived.
Quantifying Random Error
- The width of the confidence interval signifies and determines uncertainty and the random error.
Factors Influencing CI Width
- Influenced by sample size, variability, and confidence level.
- Larger sample size reduces the width of a CI and the uncertainty.
- Small sample size increases the width of a CI.
- Low data variability reduces the width of the confidence interval.
- Large data variability increases the width of the interval.
Confidence Level Explained
- Lower confidence levels can be measured for various data points. For example, 90%, 95%, and 99%. However, 95% is most often used.
Applying 95% CIs: Examples
- A study has the mean systolic blood pressure to be 120 mmHg with a 95% confidence interval between 115 - 125 mmHg. Which means that 95% confident that the true population mean blood pressure lies between 115 and 125 mmHg.
Summarizing 95% Confidence Intervals
- 95% Means that if the study is repeated many times, it will produce a series of intervals.
- Those intervals will estimate the 'true population parameter' through all the numbers being generated.
- These Intervals will contain Width as influenced of sample size, Variability, and a confidence level.
- 95% of those generated will be a good representation.
Understanding P-Values
- P-value helps us find the 'meaning' of the data set.
- Pillars that represent: understanding, interpretation, understanding 95% CI's and p-values, the differences Between 95%CI's and p-values, and the significance among clinical studies.
P-Value Definitions
- The assumption that any differences is due to random error
- measure of extreme observed data, assuming the null is true
- P-value: Helps to determine if data is surprising if random is applied
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.