Podcast
Questions and Answers
In the evidence hierarchy, which type of study design generally provides stronger evidence?
In the evidence hierarchy, which type of study design generally provides stronger evidence?
- Cohort studies (correct)
- Case series / reports
- Background information / expert opinion
- Case-controlled studies
Which type of study design is characterized by collecting detailed information about a small group of patients with similar diagnoses or treatments?
Which type of study design is characterized by collecting detailed information about a small group of patients with similar diagnoses or treatments?
- Case reports/case series (correct)
- Randomized controlled trials
- Cohort studies
- Case-controlled studies
Which type of study assesses exposure and outcome at the same time, making it difficult to determine temporal relationships?
Which type of study assesses exposure and outcome at the same time, making it difficult to determine temporal relationships?
- Case-controlled study
- Randomized controlled trial
- Cross-sectional study (correct)
- Cohort study
In a case-control study, what is compared?
In a case-control study, what is compared?
Which study design involves measuring exposure in the present and following participants over time to observe outcomes?
Which study design involves measuring exposure in the present and following participants over time to observe outcomes?
Which study design is considered the ‘golden standard’ for inferring causality?
Which study design is considered the ‘golden standard’ for inferring causality?
In measuring disease frequency, defining 'what' refers to:
In measuring disease frequency, defining 'what' refers to:
When measuring the frequency of a disease, defining 'when' is important because:
When measuring the frequency of a disease, defining 'when' is important because:
Prevalence measures:
Prevalence measures:
Incidence measures:
Incidence measures:
What does a low incidence and high prevalence of a disease suggest?
What does a low incidence and high prevalence of a disease suggest?
Which of the following best describes 'nominal' data?
Which of the following best describes 'nominal' data?
What differentiates ordinal data from nominal data?
What differentiates ordinal data from nominal data?
Which of the following is an example of continuous numerical data?
Which of the following is an example of continuous numerical data?
What is the key characteristic of discrete numerical data?
What is the key characteristic of discrete numerical data?
For categorical data, which graphical display is most appropriate for showing the frequency of each category?
For categorical data, which graphical display is most appropriate for showing the frequency of each category?
Which graphical display is best suited to visualize the distribution of numerical data and identify its shape, center, and spread?
Which graphical display is best suited to visualize the distribution of numerical data and identify its shape, center, and spread?
Which graphical technique is suited to display the median, quartiles, and outliers in a dataset?
Which graphical technique is suited to display the median, quartiles, and outliers in a dataset?
What does a symmetrical data distribution imply?
What does a symmetrical data distribution imply?
Which is the appropriate way to numerically summarize categorical data?
Which is the appropriate way to numerically summarize categorical data?
A dataset on customer ages at an ice cream shop is skewed to the left. What does this indicate?
A dataset on customer ages at an ice cream shop is skewed to the left. What does this indicate?
In numerical summaries, what does ‘variability’ measure?
In numerical summaries, what does ‘variability’ measure?
Which measure of variability represents the middle 50% spread of the data?
Which measure of variability represents the middle 50% spread of the data?
Which statement is correct for two ice cream scoopers, where Scooper A is consistent, and Scooper B is less consistent with the ice cream amount?
Which statement is correct for two ice cream scoopers, where Scooper A is consistent, and Scooper B is less consistent with the ice cream amount?
What is a key difference between measures of central tendency like mean, median, and mode in symmetrical vs skewed distributions?
What is a key difference between measures of central tendency like mean, median, and mode in symmetrical vs skewed distributions?
What does systematic sampling require?
What does systematic sampling require?
What is k (sampling interval) equal to in systematic sampling?
What is k (sampling interval) equal to in systematic sampling?
What is a sampling frame?
What is a sampling frame?
What is a potential limitation of systematic sampling?
What is a potential limitation of systematic sampling?
If one needs to take strata into account during analysis, which sampling method is in use?
If one needs to take strata into account during analysis, which sampling method is in use?
Which of the following is an advantage of cluster sampling?
Which of the following is an advantage of cluster sampling?
What does the exclusion criteria in a study do?
What does the exclusion criteria in a study do?
Which sampling method involves selecting participants without a specific plan, relying on convenience or judgment?
Which sampling method involves selecting participants without a specific plan, relying on convenience or judgment?
In ____ sampling the researcher selects and enrolls whoever is readily available, until meeting the sample size quota and that is called __ sampling method.
In ____ sampling the researcher selects and enrolls whoever is readily available, until meeting the sample size quota and that is called __ sampling method.
A hospital enrolls patients for a study who selecting every third patient who came to the cardiology department. Which sampling strategy was used?
A hospital enrolls patients for a study who selecting every third patient who came to the cardiology department. Which sampling strategy was used?
Which study design is most suitable when detailed background information on a rare disease is needed?
Which study design is most suitable when detailed background information on a rare disease is needed?
What is a key limitation of cross-sectional studies when examining the relationship between exposure and outcome?
What is a key limitation of cross-sectional studies when examining the relationship between exposure and outcome?
In a cohort study assessing the effect of a new drug on disease progression, what is measured at the beginning of the study?
In a cohort study assessing the effect of a new drug on disease progression, what is measured at the beginning of the study?
In a study measuring health and disease, what is the purpose of defining 'who'?
In a study measuring health and disease, what is the purpose of defining 'who'?
What is the importance of specifying 'what' when measuring disease frequency?
What is the importance of specifying 'what' when measuring disease frequency?
Why is it important to specify the time period when measuring disease frequency?
Why is it important to specify the time period when measuring disease frequency?
Considering a population where a new, highly contagious disease is spreading rapidly, which measure will likely show a rapid increase first?
Considering a population where a new, highly contagious disease is spreading rapidly, which measure will likely show a rapid increase first?
If a new treatment dramatically extends the life of individuals with a previously fatal disease, while the incidence remains constant, what is the expected impact on prevalence?
If a new treatment dramatically extends the life of individuals with a previously fatal disease, while the incidence remains constant, what is the expected impact on prevalence?
What is the key characteristic of 'ordinal' data that distinguishes it from other types of categorical data?
What is the key characteristic of 'ordinal' data that distinguishes it from other types of categorical data?
If 'customer satisfaction' is measured on a scale of 'very dissatisfied', 'dissatisfied', 'neutral', 'satisfied', and 'very satisfied', what type of data is this?
If 'customer satisfaction' is measured on a scale of 'very dissatisfied', 'dissatisfied', 'neutral', 'satisfied', and 'very satisfied', what type of data is this?
A researcher wants to visualize the distribution of customer ages at an ice cream shop. Which graphical display is most appropriate?
A researcher wants to visualize the distribution of customer ages at an ice cream shop. Which graphical display is most appropriate?
You want to visualize the relationship between weather conditions (sunny, cloudy, rainy) and amount spent at an ice cream shop. Which graphical display is most appropriate?
You want to visualize the relationship between weather conditions (sunny, cloudy, rainy) and amount spent at an ice cream shop. Which graphical display is most appropriate?
In a right-skewed distribution of customer ages at an ice cream shop, how do the mean, median, and mode typically relate?
In a right-skewed distribution of customer ages at an ice cream shop, how do the mean, median, and mode typically relate?
In a left-skewed distribution, what does this generally indicate about where the data is clustered?
In a left-skewed distribution, what does this generally indicate about where the data is clustered?
Which numerical summary is most appropriate for describing the central tendency of ice cream flavor preferences?
Which numerical summary is most appropriate for describing the central tendency of ice cream flavor preferences?
What does a smaller standard deviation indicate about the amount of ice cream scooped by an ice cream scooper?
What does a smaller standard deviation indicate about the amount of ice cream scooped by an ice cream scooper?
What is the primary benefit of using stratified sampling over simple random sampling?
What is the primary benefit of using stratified sampling over simple random sampling?
Why is it essential to account for strata in the analysis when using non-proportional stratified sampling?
Why is it essential to account for strata in the analysis when using non-proportional stratified sampling?
When would cluster sampling be more appropriate than simple random sampling?
When would cluster sampling be more appropriate than simple random sampling?
What is a potential disadvantage of cluster sampling?
What is a potential disadvantage of cluster sampling?
What could be a disadvantage of systematic sampling in a real-world scenario?
What could be a disadvantage of systematic sampling in a real-world scenario?
Which of the following describes ‘sampling unit’?
Which of the following describes ‘sampling unit’?
A researcher is conducting a study on a specific population. What does the 'target population' refer to?
A researcher is conducting a study on a specific population. What does the 'target population' refer to?
What is the purpose of having inclusion criteria when designing a research study?
What is the purpose of having inclusion criteria when designing a research study?
When designing a study, what is the purpose of exclusion criteria?
When designing a study, what is the purpose of exclusion criteria?
What is a key difference between probability and non-probability sampling?
What is a key difference between probability and non-probability sampling?
Which of the following is a defining characteristic of convenience sampling?
Which of the following is a defining characteristic of convenience sampling?
What is a key feature of purposive sampling?
What is a key feature of purposive sampling?
In which type of sampling does the first participant refer to the second, to find and include new participants?
In which type of sampling does the first participant refer to the second, to find and include new participants?
What is a potential limitation that may occur across non-probability samples?
What is a potential limitation that may occur across non-probability samples?
In measuring health and disease, what encompasses the 'where' aspect?
In measuring health and disease, what encompasses the 'where' aspect?
What constitutes 'defining cases' when measuring disease frequency?
What constitutes 'defining cases' when measuring disease frequency?
When defining 'who' in measuring disease frequency, what is the primary focus?
When defining 'who' in measuring disease frequency, what is the primary focus?
How are systematic reviews positioned in the evidence hierarchy?
How are systematic reviews positioned in the evidence hierarchy?
What distinguishes critically-appraised topics (evidence syntheses) from systematic reviews?
What distinguishes critically-appraised topics (evidence syntheses) from systematic reviews?
A cohort study on the effect of exercise on heart disease followed a group of joggers and a group of non-joggers for 10 years. What study design is most similar?
A cohort study on the effect of exercise on heart disease followed a group of joggers and a group of non-joggers for 10 years. What study design is most similar?
A researcher aims to study the prevalence of obesity and its association with physical activity levels in a community at one point when people are at an ice cream shop. Which study design is appropriate?
A researcher aims to study the prevalence of obesity and its association with physical activity levels in a community at one point when people are at an ice cream shop. Which study design is appropriate?
How do case reports/case series contribute to medical knowledge?
How do case reports/case series contribute to medical knowledge?
What is a primary limitation of case reports and case series?
What is a primary limitation of case reports and case series?
How does defining ‘what' (defining cases) impact the measurement of disease frequency?
How does defining ‘what' (defining cases) impact the measurement of disease frequency?
What type of data is 'number of ice cream scoops'?
What type of data is 'number of ice cream scoops'?
Which graphical display is suited to display a symmetrical data distribution?
Which graphical display is suited to display a symmetrical data distribution?
In a study, the ages of participants have a symmetrical distribution. What measure of central tendency is most likely appropriate?
In a study, the ages of participants have a symmetrical distribution. What measure of central tendency is most likely appropriate?
What is a key element of simple random sampling?
What is a key element of simple random sampling?
What makes stratified sampling different than simple random sampling?
What makes stratified sampling different than simple random sampling?
A bakery is divided into a small, medium, and large size based on staff, then a proportionate amount of staff is selected to sample which measure of chocolate do they prefer. Which sampling method was used?
A bakery is divided into a small, medium, and large size based on staff, then a proportionate amount of staff is selected to sample which measure of chocolate do they prefer. Which sampling method was used?
What scenario is most suited for non-probability samples?
What scenario is most suited for non-probability samples?
What distinguishes probability from non-probability sampling?
What distinguishes probability from non-probability sampling?
You want to examine patients with a rare disease and find that no one has information, so the previous enrollee refers the next participant. Which sampling strategy is in use?
You want to examine patients with a rare disease and find that no one has information, so the previous enrollee refers the next participant. Which sampling strategy is in use?
What is the purpose of the inclusion criteria in a study protocol?
What is the purpose of the inclusion criteria in a study protocol?
Flashcards
Case Report/Case Series
Case Report/Case Series
Observational studies where detailed information is collected about an individual or a small group of patients with similar diagnoses.
Cross Sectional Studies
Cross Sectional Studies
A study design where exposure and outcome are measured at the same time.
Prevalence
Prevalence
The frequency or number in a defined population who have a specified disease or condition.
Incidence
Incidence
Signup and view all the flashcards
Case Control Studies
Case Control Studies
Signup and view all the flashcards
Cohort Studies
Cohort Studies
Signup and view all the flashcards
Randomized Clinical Trial
Randomized Clinical Trial
Signup and view all the flashcards
Categorical Data
Categorical Data
Signup and view all the flashcards
Numerical Data
Numerical Data
Signup and view all the flashcards
Nominal Data
Nominal Data
Signup and view all the flashcards
Ordinal Data
Ordinal Data
Signup and view all the flashcards
Continuous Data
Continuous Data
Signup and view all the flashcards
Discrete Data
Discrete Data
Signup and view all the flashcards
Non-Probability Sampling
Non-Probability Sampling
Signup and view all the flashcards
Simple Random Sampling
Simple Random Sampling
Signup and view all the flashcards
Systematic Sampling
Systematic Sampling
Signup and view all the flashcards
Stratified Sampling
Stratified Sampling
Signup and view all the flashcards
Cluster Sampling
Cluster Sampling
Signup and view all the flashcards
Study Notes
Introduction to Study Designs
- A study design is a specific method for collecting, analyzing, and interpreting data in research.
Learning Objectives
- The goal is to explain the hierarchy of evidence in study designs.
- The goal is to understand the differences between various study designs.
Hierarchy of Evidence in Study Designs
- Systematic reviews are at the top of the hierarchy because they give a complete and organized overview of a research question.
- Critically-appraised topics are evidence syntheses.
- Critically-appraised individual articles refer to article synopses.
- Randomized controlled trials (RCTs) are a type of study design.
- Cohort studies and Case-Controlled studies are types of study designs.
- Case series and reports are types of study designs.
- Background information and expert opinion are at the bottom of the hierarchy.
Case Report/Case Series
- Observational studies collect detailed information about an individual (case report) or a small group of patients (case series) who have similar diagnoses or treatments.
- These studies can be like detective stories, and they reveal what happened, not why.
- They do not compare patients, so cannot confirm the effect of an intervention due to unseen factors
- This means, there is risk of bias
Cross Sectional Studies
- Exposure and outcome are measured at the same time.
- It cannot be determined if the exposure caused the outcome, or the outcome caused the exposure
- As an example, obesity and diabetes can be assessed simultaneously.
Measuring Health and Disease
- When measuring health and disease, it is important we understand who gets the disease.
- What are risk factors.
- Where do people get the disease.
- When do people get the disease.
- Goal to intervene.
Measuring Disease Frequency
- Three things must be defined when measuring disease frequency
- WHAT: Defining cases means defining who is counted as a case and how that is determined
- WHO: The size of the population at risk determines the numbers being studied
- WHEN: The time point or period during which data are collected
Defining Cases
- Case definitions should consider the level of severity whether all cases, hospitalized cases, mild cases, and deaths are included.
- Whether all episodes or just the first time an illness is diagnosed are included.
- Outcomes are not always discrete for example Hypertension such as SBP ≥140 mmHg or DBP ≥90 mmHg.
- Seropositivity.
- Intensity of infection.
- A cut-off point may need to be defined to separate cases from non-cases.
Defining the "Population at Risk"
- The population from which the cases originate must be defined.
- It can be defined by several factors like geographically with a region, country, or refugee camp.
- Also can be any age, sex, or race / ethnicity specified.
- The 'population at risk' should only include people who would be counted as a case if they were to have the condition.
- Someone should only be counted as a case if they are from the population and become a case during the risk period.
Defining the Time Period
- Disease frequency measures relate to a particular point in time and relate to what happens over a particular time-period.
- It is important to specify the time-period and / or include it in the calculation for measures of disease frequency.
Two Measurements of Disease Frequency
- Prevalence (prevalent cases) frequency (or number) in a well-defined population who are specified with that disease or other condition within a defined period.
- Which includes all previous cases such as diabetes, hypertension, etc.
- Incidence (incident cases) frequency (or number) new events of the outcome from each study population based on what period is examined.
Summary for Measuring Health and Disease
- Measure the what the who and the when
- Incidence captures new cases within a specific time
- Prevalence captures all existing cases within a specified period
Case Control Studies
- Participants with outcome in a matched group, that do not have the same outcome.
- For instance, participants with, VS, participants without diabetes.
- There is some uncertainty on measurement, as measurement timing is often not defined
Cohort studies
- Exposures are measured in the present, with participants being monitored over time.
- Obesity is measured before diabetes presents.
Randomized Clinical Trial
- During the process, participants are randomly allocated to either an exposure or no exposure, these differences are then monitored.
- Gold standard to predict causality
Study Design Summary
Study design | When are the exposure and outcome measured? | Are participants matched on the outcome? | Are participants randomized to the exposure? |
---|---|---|---|
Cross sectional | Simultaneously | No | No |
Case-control study | Outcome measured after the exposure | Yes | No |
Cohort study | Outcome measured after the exposure | No | No |
RCT | Outcome measured after the exposure | No | Yes |
Data Types and Numerical Summaries
- A data type can be either categorical or numerical.
- Numerical data has continuous or discrete aspects
- Categorical data has nominal/ordinal aspects
Data Type: Categorical
- Nominal Categories with no inherent order.
- Customer's gender is male or female.
- Ice cream flavours of chocolate, vanilla, or strawberry.
- Ordinal: Ranked categories such as customer satisfaction, and intervals between aren't consistent Very dissatisfied or Dissatisfied or Neutral or Satisfied, Very satisfied.
Bar Charts
- For categorical data, this involves the ice cream flavours and there counts.
Numerical Summary: Categorical data
- This refers to the frequency which is counts or observations
- As well as the percentages, based on the proportion relative to the total number of observations.
Numerical Data Types
- Continuous numerical data can take any value within a range, and represent measurements.
- Customer's age or amount spent on ice cream.
- Discrete is countable with distinct values with distinct values
- Number of ice cream scoops can be counted.
Graphical Display: Numerical Data
- Displays for nominal scale are histograms and boxplots.
Graphical Display: Boxplot Components
- Box plots show several components, and the minimum values.
- The first quantile.
- Whiskers.
- Median.
- 3rd quantile
- Interquartile range.
- Outliers.
- Maximums.
Graphical Display: Data Distribution
- Data distributions can be either symmetric or skewed.
- A symmetric data set is balanced on each side.
- Imbalanced sets lead to data skew.
Skewed Data Example
- Right-skewed data can occur when age is clustered towards the younger ones, where there are fewer older customers.
- Left-skewed data can occur when age is clustered towards the more matured, where there are fewer younger customers.
- The interpretation of distributions, often are examples of customer experience.
- Normal distribution is a balanced mix of customers across age groups.
- Right skewed data show a larger proportion of younger customers.
- Left skewed data, show larger proportion of older customers.
Numerical Summary
- A "numerical summary" indicates something about central tendencies.
- What's the customer's average age? Is the main question.
- Or the most common ice cream flavour.
- Or the number of ice cream scoops bought?
- The mode is the most frequently occurring value.
- The median is the middle value.
- And the average is what is the mean
Data Distribution
- In numerical data there are different means, modes and medians depending on the distributions
- In a normal distributions the median, is about equal to the mode
- Otherwise it is either negative or positive
Numeric Variability
- Besides central tendencies, there is data on numerical variability
- Order same number of scoops, or different
- Range is defined to show variability
- This includes an inter quartile range
- or Standard deviation also show variability
- There is small area showing the range, and an inter quartile range that is in the middle
Standard Deviation
- Average distance from the means
- Consider scenario with scooper "A" and "B", that scoop ice cream, but are inconsistent
- "Scooper A" scoops, have a lower standard deviation (the range is smaller) compared to "B"
Numerical Summary
- In numerical data there is a variability portion and also a central tendency, which can be looked at through various means
Sampling Methods
- Different data collection and description, include describing the benefits, plus their limitations
Sampling Method
- Study designs, include different ways to define that and also sampling methods. Where there are
- Errors, that can be random or systematic.
- Population versus sample
Selection of Participants
- Simple random sample
- Cluster
- Probability methods
- Systematic
- Stratified
- Non probability methods
- Sampling methods
- Convenience
- Purposive
- Quota
- Snowball
Key Definitions
- Sampling Unit: A person, household, or school, etc.
- Target (or Reference ) Population: The population of interest.
- Sampling Design (or Scheme): The procedure for selecting the sampling units from the study population.
- Sampling frame: List of sampling units from which the sample is selected.
Non-Probability Samples
- Convenience are individuals that are chosen without any specific pattern, such as a clinic goer
- With Purposive samples, investigators use balanced criteria for select individuals
- With "snowball" the initial individual refers to the researcher to another
- Quatos select predetermined number of individuals
- They are easy and cheap, but not representative
Simple Random Sampling
- Equal probability: Everyone has the same chance of being selected
- Probability of selecting each unit is known
- Removes possibility of bias
- Drawn by listing population, and randomly selected
Simple Random: Benefits and Limitations
Sampling
Equal probability: Everyone has the same chance of being selected
- Benefits:*
- Representative samples are a good representation on your populations Limitations are sampling frame needs a list of people in your population
- It can easily and logically become difficult with a larger population.
Systematic Sampling
- Sampling units need to be in a sequence
- Determine k
- Population / sample size
- Choose Random and begin.
- Select unit at regular intervals
- EXAMPLE: A sample of 600 is wanted, out of 10000
- The intervals are, rounded to 17
- A random person is chosen
- 15 was chosen, and the 15th person is to be sampled
- The subsequent ppl, are to be sampled 32, 49
Systematic Sampling: Benefits and Limitations
- Works well when units are already in Sequence.
- Logistically sometimes easier than simply random sampling.
- Susceptible to errors based on season. and time variations.
- Are ppl being studied in an are more likely to visit, during lunch or later.
Stratified Sampling
- Separate the population based on set characteristics
- Such as sex, age, ethnicity, geographic region.
- The number of people selected per strata can be proportional or nonproportional
- If no proportional, then an adjustment statement must be input into the analysis.
Stratified Sampling Benefits and Limitations
- Benefits*
- Practical for large graphic areas
- You're interested in specific subgroups
- Can be more varied on sampling based population.
- Limitations*
- Be cautious of the strata statement
- The strata must be inputted in the formula analysis
Cluster Sampling
- Most populations exist in clusters.
- Often there is no information on all possible Units, but there is some knowledge on the unit(s)
- Cities, buildings exist.
Cluster Benefits and Limitations
- Useful when populations are widely dispersed*
- Naturally occurring units are more homogenous than those of the total populations*
- Smaller effective size sample will be smaller*
Inclusion and Exclusion Criteria
- Inclusion is used to identify the main group within a population to study
- Exclusion helps eliminate interference that creates issues
Summary Sampling
Sampling Method | Characteristics | Benefits | Limitations |
---|---|---|---|
Non-Probability Sampling | Selects samples based on non random criteria | Easy, quick cheap | Biased. Non representative. Findings cant be generalized |
Simple Random Sampling | Members with equal change. | Impartial. Straight forward. generalizable results. | Difficult potential and missing segment |
Startified Sampling | Divides the population | Represents accurate results which lower the simple and more simple | Requires great population knowledge, complex. More higher costs. |
Cluster Sampling | Divides population in clusters. Selects entire cluster randomy | Logical for large area, reduces costs and logistical issues | More simpler error the precision cluster may very interally |
Systematic Sampling | Select member of fix interval chosen by starting points | Suitable for large efficient and simple | Periodicity bias cannot be order truly be required. |
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.