Introduction to Statistics

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson
Download our mobile app to listen on the go
Get App

Questions and Answers

Which of the following processes is NOT typically included in the application of statistics?

  • Intuition (correct)
  • Analysis
  • Collection
  • Interpretation

A researcher aims to study the impact of a new drug on blood pressure levels in patients with hypertension. Which field of statistics is MOST applicable to this study?

  • Vital Statistics
  • Biostatistics (correct)
  • Inferential Statistics
  • Descriptive Statistics

A public health department wants to track the changes in infant mortality rates over the past decade to assess the effectiveness of prenatal care programs. Which branch of statistics is the MOST appropriate for this task?

  • Descriptive Statistics
  • Inferential Statistics
  • Vital Statistics (correct)
  • Predictive Analytics

When planning national health services, why is statistical analysis considered essential?

<p>To efficiently allocate resources based on health needs (C)</p> Signup and view all the answers

A researcher collects data on patients' ages, genders, and blood pressure readings to explore the risk factors associated with cardiovascular diseases. Which of the following BEST describes 'age' in this context?

<p>A Quantitative Variable (A)</p> Signup and view all the answers

Which statement correctly differentiates between constant and variable data types?

<p>Constants do not vary from one entity to another, while variables do. (B)</p> Signup and view all the answers

Which of the following variables is an example of a discrete quantitative variable?

<p>Number of Pregnancies (B)</p> Signup and view all the answers

Which of the following BEST exemplifies a qualitative ordinal variable?

<p>Stage of a Disease (D)</p> Signup and view all the answers

A researcher wants to collect data on the ethnicity, religious affiliation, and job title of participants in a study. Which type of qualitative variable applies to this data?

<p>Nominal (C)</p> Signup and view all the answers

In a study examining the relationship between salt intake and hypertension, which variable is MOST likely the dependent variable?

<p>Hypertension (C)</p> Signup and view all the answers

When examining the relationship between obesity and heart disease, a researcher notices that physical activity levels influence both obesity and the risk of heart disease. In this scenario, what would be the MOST accurate classification for physical activity?

<p>Confounding Variable (A)</p> Signup and view all the answers

In a study determining the impact of salt intake on hypertension where hypertension leads to CHD, what is the role of hypertension in the relationship between salt intake and CHD?

<p>Intermediate variable (B)</p> Signup and view all the answers

Which of the following is NOT typically considered a primary source for ongoing, real-time health data?

<p>Scientific Journals (C)</p> Signup and view all the answers

Which method of data collection would be BEST suited to gather in-depth perspectives and detailed experiences from participants?

<p>Interviews (B)</p> Signup and view all the answers

In the context of data presentation, what is the primary purpose of using tables?

<p>To arrange data in rows and columns for easy comparison (D)</p> Signup and view all the answers

Which of the following is a key characteristic that a good table should possess?

<p>It should be self-explanatory. (C)</p> Signup and view all the answers

When creating a table, what is the MOST important consideration for the title?

<p>It should be clear, concise, and answer what, when, and where. (B)</p> Signup and view all the answers

When constructing a simple frequency distribution table for qualitative data such as gender, how do you typically organize the table?

<p>List the variables in the left column and their frequencies in the right column. (A)</p> Signup and view all the answers

In constructing a simple frequency distribution table for a quantitative variable, such as patient age, what is the second critical step after determining the range of values?

<p>Selecting a reasonable number of classes (class interval) (B)</p> Signup and view all the answers

In analyzing the relationship between smoking and lung cancer, what is the MOST common type of table used to present the data?

<p>Contingency Table (A)</p> Signup and view all the answers

If a study categorizes different types of treatment (A, B, C) for a disease and examines the outcomes (cured, improved, died), which type of table is BEST suited for presenting this data?

<p>C x R contingency table (B)</p> Signup and view all the answers

A researcher wants to compare the distribution of mother's age between low-birth-weight babies and normal-weight babies. Which type of table is BEST suited for this analysis?

<p>Comparing Frequency Distribution Table (C)</p> Signup and view all the answers

A researcher aims to classify students by both their weight and height to study patterns in physical health. Which method of data categorization is MOST appropriate?

<p>Two-Way Table (C)</p> Signup and view all the answers

In which scenario is an arithmetic scale line graph MOST appropriate for data representation?

<p>Representing data that change over time (C)</p> Signup and view all the answers

Which of the following statements BEST describes the use of histograms?

<p>Histograms are used only when the table is a simple frequency distribution type. (B)</p> Signup and view all the answers

Which statement BEST describes a feature or characteristic unique to a frequency polygon?

<p>It connects the midpoints of class intervals with straight lines. (A)</p> Signup and view all the answers

When should a scatter diagram be used for data presentation?

<p>To determine the relationship between two variables (C)</p> Signup and view all the answers

What is a key characteristic of a simple bar chart?

<p>It is composed of columns of the same width. (A)</p> Signup and view all the answers

How does a grouped bar chart enhance the information conveyed by a simple bar chart?

<p>By presenting multiple variables within each category simultaneously (A)</p> Signup and view all the answers

Which of the following statments BEST describes a component bar chart?

<p>It presents bars divided into portions representing classifications. (A)</p> Signup and view all the answers

When is a pie chart MOST appropriately used for data visualization?

<p>When the values have a constant sum, such as 100% (B)</p> Signup and view all the answers

Which of the following BEST describes how segments are arranged in a pie chart?

<p>Clockwise from largest to smallest (C)</p> Signup and view all the answers

What is one of the key limitations of spot maps when analyzing disease risk?

<p>They do not account for population density. (C)</p> Signup and view all the answers

How do area maps enhance the information conveyed by spot maps?

<p>By showing presence or absence of an event over a geographic area (B)</p> Signup and view all the answers

In what way is a dot plot similar to a scatter diagram?

<p>They both plot one variable against another. (C)</p> Signup and view all the answers

What information is BEST conveyed using box plots?

<p>Central location (median) and dispersion (inter-quartile range) (D)</p> Signup and view all the answers

Which numerical presentation is considered a MOST important measure of dispersion used in statistics?

<p>Standard Deviation (A)</p> Signup and view all the answers

What is the purpose of numerical presentation in statistics?

<p>To summarize a collection of data into understandable numbers (D)</p> Signup and view all the answers

Which of the following is NOT considered a measure of central tendency?

<p>Range (C)</p> Signup and view all the answers

What is the definition of the arithmetic mean?

<p>The sum of all observations divided by the number of observations (D)</p> Signup and view all the answers

Which of the following is a DISADVANTAGE of using the arithmetic mean?

<p>It can be affected by extreme observations. (D)</p> Signup and view all the answers

How is the median determined when there is an even number of observations in a dataset?

<p>It is the average between the two middle numbers. (A)</p> Signup and view all the answers

When is the mode LEAST likely to be determinable?

<p>When all observations have the same frequency (A)</p> Signup and view all the answers

What does the range measure in a dataset?

<p>Dispersion (C)</p> Signup and view all the answers

What key information does the standard deviation provide about a dataset?

<p>The dispersion around the mean (B)</p> Signup and view all the answers

In a normal distribution curve, approximately what percentage of observations fall within one standard deviation of the mean?

<p>68.3% (A)</p> Signup and view all the answers

What is the MOST essential characteristic of a normal distribution curve?

<p>Symmetrical (B)</p> Signup and view all the answers

Flashcards

What is Statistics?

The science dealing with the collection, analysis, interpretation, presentation, and dissemination of data.

What is Biostatistics?

The application of statistics to biological sciences, medicine, and public health.

What are Vital Statistics?

Branch of biostatistics focused on vital human events like births, deaths, and marriages.

Uses of statistics?

Planning health services, conducting research, causality of disease, and clinical judgment.

Signup and view all the flashcards

What is Data?

Set of values or observations recorded on one or more studied units.

Signup and view all the flashcards

What is a Constant?

A data value that does not vary from person to person, such as the number of eyes.

Signup and view all the flashcards

What are Variables?

Characteristic that varies from person to person or over time and is the main concern of statistics.

Signup and view all the flashcards

What is a Quantitative Variable?

Variables expressed as quantities (continuous or discrete).

Signup and view all the flashcards

What is a Continuous Variable?

Numerical variable that can take any value within a range (e.g., height, weight).

Signup and view all the flashcards

What is a Discrete Variable?

Quantities obtained through enumeration (e.g., number of pregnancies).

Signup and view all the flashcards

What is a Qualitative Variable?

Variables expressed as categories or qualities (ordinal or nominal).

Signup and view all the flashcards

What are Ordinal Variables?

Qualitative data that can be arranged in a specific order (e.g., disease stage).

Signup and view all the flashcards

What are Nominal Variables?

Categories that cannot be meaningfully ordered (e.g., marital status).

Signup and view all the flashcards

What is a Confounding Variable?

A variable that influences both the independent and dependent variables.

Signup and view all the flashcards

What is an Intermediate Variable?

A variable that causally links the independent and dependent variables.

Signup and view all the flashcards

What is a Background Variable?

A variable which may impact the relationship between independent and dependent variables (e.g. age, sex, occupation).

Signup and view all the flashcards

What is a Historical and ongoing recording system?

Census, vital statistics, and hospital records are examples of it.

Signup and view all the flashcards

Tools for Data Collection?

Observation, questionnaires, interviews, and record analysis.

Signup and view all the flashcards

What is Tabular Presentation?

Arrangement of data in rows and columns.

Signup and view all the flashcards

What is a Simple Frequency Distribution Table?

Table for a single variable.

Signup and view all the flashcards

What is a Contingency Table?

A table showing association between two or more variables.

Signup and view all the flashcards

What is a Comparing Frequency Distribution Table?

Distributes two different groups across one variable.

Signup and view all the flashcards

What is a Two Way Table?

Classifies one group according to two variables.

Signup and view all the flashcards

What is Diagrammatic Presentation?

Graphs and charts.

Signup and view all the flashcards

What are Graphs?

Methods using coordinate systems to show quantitative data.

Signup and view all the flashcards

What are Charts?

Using only one coordinate.

Signup and view all the flashcards

What is an Arithmetic Scale Line Graph?

Suitable for representing data over a period of time.

Signup and view all the flashcards

What is a Histogram?

Represents simple frequency distributions with columns.

Signup and view all the flashcards

What is a Frequency Polygon?

Connects midpoints of class intervals with a straight line.

Signup and view all the flashcards

What is a Scatter Diagram?

Shows the relationship between two variables.

Signup and view all the flashcards

What is a Simple Bar Chart?

Composed of columns (bars) with the same width.

Signup and view all the flashcards

What is a Grouped Bar Chart?

Like bar chart, but variables are of each categories are joining.

Signup and view all the flashcards

What is a Component Bar Chart?

Bars are divided into colored or shaded portions.

Signup and view all the flashcards

What is a Pie Chart?

Used when values have a constant sum (100%).

Signup and view all the flashcards

What are Spot Maps?

Uses dots or symbols to show event locations.

Signup and view all the flashcards

What are Area Maps?

Shows event existence over geographic areas.

Signup and view all the flashcards

What are Dot Plots?

Similar to a scatter diagram, plots one variable against another.

Signup and view all the flashcards

What are Box Plots?

Graphs used to compare the distribition of non-continuous variables.

Signup and view all the flashcards

What is Numerical Presentation?

Summarizes data into understandable numbers.

Signup and view all the flashcards

What is the Arithmetic Mean?

Sum of observations divided by the number of observations.

Signup and view all the flashcards

What is the Median?

The value dividing a series of observations into two equal halves.

Signup and view all the flashcards

What is the Mode?

Observation with the highest frequency.

Signup and view all the flashcards

What is the Range?

Difference between the highest and lowest values.

Signup and view all the flashcards

What is the Standard Deviation?

Measures deviation in a dataset.

Signup and view all the flashcards

What are normal curves?

A symmetrical continuous distribuion of infinite range that is bell shaped.

Signup and view all the flashcards

Study Notes

Statistics

  • Statistics is the science dealing with the collection, analysis, interpretation, presentation, and dissemination of data.

Biostatistics

  • Biostatistics applies statistical methods to biological sciences, medicine, and public health.

Vital Statistics

  • Vital Statistics focuses on vital events such as births, deaths, marriages, and fertility rates.

Uses of Statistics

  • Statistics are useful for planning national health services.
  • Statistics are essentail for conducting medical and epidemiological research.
  • Statistics assist in determining the casuality of disease.
  • Statistical reasoning is important for clinical judgment.

Data

  • Data represents a set of values or observations recorded on one or more studied units.

Types of Data

  • Constants do not vary among individuals, making them not applicable for statistics.
  • Variables vary among individuals or over time, the main focus of statistics.

Quantitative Variables

  • Quantitative variables are expressed as quantities or numbers.

Quantitative Variable Subtypes

  • Continuous variables are obtained from measurement, which can take integral or frational amounts.
  • Discrete variables are obtained through enumeration, which can only take integral values.

Qualitative Variables

  • Qualitative variables cannot be expressed as quantities or numbers and take the form of qualities or names.
  • Qualitative variables can be categorized, but no enumerated.

Qualitative Variable Subtypes

  • Ordinal variables have categories that can be arranged in definite order.
  • Nominal variables have categories that cannot be arranged in a definite order.

Variable Transformation

  • Quantitative variables can be transformed into qualitative ordinal variables.

Other Variable Classifications

  • Independent variables are input/cause variables.
  • Dependent variables are outcome/effect variables.
  • Confounding variables are intervening variables.
  • Background variables include age, sex, and occupation.

Sources of Data

  • Historical and ongoing recording systems include census, vital statistics, hospital records and international publications.
  • Surveys: Used to collection of data
  • Experiments: Used to collection of data

Tools for Data Collection

  • Observation
  • Questionnaires
  • Interviews
  • Record analysis

Presentation of Statistical Data

  • Tabular: Data presented in tables
  • Diagrammatic: Data presented in diagrams
  • Numerical: Data presented numerically

Tabular Presentation

  • A table is a data set arranged in rows and columns.

Types of Tables

  • Simple frequency distribution table: for one variable
  • Contingency table: Used for two or three variables.
  • Comparing frequency distribution table
  • Two way table: Two way classification

Characteristics of a Good Table

  • Self-explanatory
  • Titles should be clear, concise, and answer what, when, and where.
  • Titles are separated from the table's body.
  • Rows and columns are labeled clearly.
  • Specific units of measure are provided.
  • Totals should be given.
  • Abbreviations/symbols are explained in footnotes.
  • Simple as possible; small are preferable to single larger ones.
  • The source is given in a footnote for non-original data.

Simple Frequency Distribution Table (Qualitative)

  • Qualitative variables
  • Table consists of 2 columns
  • Left column shows the variable, then categories, then total
  • Right column shows the frequency of observations for each category.
  • Percentage of each included

Simple Frequency Distribution Table (Quantitative)

  • Quantitative variables
  • The range of values are detemrined by difference between the smallest and largest
  • Select a reasonable amount of class intervals [5-20] based on magnitude and table requirements
  • Determine the width of the class interval
  • Range of values / class interval = width of each class

Contingency Tables

  • Contingency tables, also known as cross-tabulations, are used to show the relationship between two or more categorical variables.
  • Data is presented as totals.

Two-by-Two Table

  • Used to show a relationship between two variables, such as exposure and disease.

C x R Table

  • More than 2 columns by 2 or more rows.
  • Illustrate different types of treatments for a disease and their outcomes.

Comparing Frequency Distribution Tables

  • Compare two different groups based on one variable.

Two Way Table

  • Classifies one group according to two variables.

Diagrammatic Presentation

  • Methods that show quantitative data using coordinate systems.
  • X-axis: horizontal and displays the independent variable.
  • Y-axis: vertical and displays the dependent variable/frequency.
  • Charts: Use only one coordinate

Types of Graphs

  • Arithmetic scale line graph
  • Histogram
  • Frequency polygon
  • Scatter diagram

Arithmetic Scale Line Graph

  • Best for representing data with a time variable.

Histogram

  • Suitable for simple frequency distributions type.
  • No spaces between columns signify continuous quantitative data.

Frequency Polygon

  • Connects midpoints of class intervals with a straight line.
  • Best when the variable is a continuous quantitative type.

Scatter Diagram

  • Shows the relationship or association between two variables.

Types of Charts

  • Simple bar chart
  • Component bar chart
  • Grouped bar chart
  • Pie chart
  • Maps (geographic coordinate chart)
  • Dot plots and box plots

Simple Bar Chart

  • Columns have the same width.
  • Categories are shown on the X-axis and frequency on the Y-axis.
  • Best for qualitative data (2 types) and qualitative discrete type.

Grouped Bar Chart

  • A simple bar chart showing more than one variable.

Component Bar Chart

  • Bars are divided into portions.
  • Portions are colored or shaded to denote classification.

Pie Chart

  • Values should have a constant sum.
  • Suitable for variables with 3-10 categories.
  • Segments are arranged in order of magnitude, starting at 12 O'clock.

Maps (Geographic Coordinate Chart)

  • Used to show the location of events.

Spot Maps

  • Use dots/symbols to show where an event or disease occurs.
  • Show the geographical distribution of an event.
  • May use plot or symbols to indicate more than 1 case

Area Maps

  • Show whether an event is occurring over a geographic area.

Dot Plots

  • Similar to a scatter diagram, plotting one variable against another.
  • The x-axis represents discrete categories.

Box Plots

  • Used to compare distributions of non-continuous variables.
  • Illustrate data distributions as box and whiskers diagrams.
  • The box: represents the middle 50% of the data.
  • Whiskers: They extend from minimum to maximum values.
  • A line: Shows the median across the box.
  • The central location (median) and dispersion (inter-quartile range) can be shown

Numerical Presentation

  • Summarizes data into an easily understandable number.

Measures of Central Values

  • Arithmetic mean ± SD
  • Median
  • Mode
  • Midrange
  • Geometric mean

Measures of Dispersion

  • Range
  • Standard deviation
  • Variance
  • Average deviation
  • Co-efficient of variation

Arithmetic Mean (Average)

  • The sum of all observations divided by the number of observations.
  • Computed by summing values and dividing by the count.

Arithmetic Mean Advantages

  • All observations are taken into consideration.
  • Best average for quantitative data to be used in statistical analysis.

Arithmetic Mean Disadvantages

  • Not suitable for qualitative data.
  • Affected by extreme observations.

Median

  • The value that divides a series of observations into two equal halves when they are listed from lowest to highest.
  • If the number is odd: median = (n+1)/2
  • If the number is even: median = (n+1)/2, median = average between 3rd and 4th number = 7+8/2 = 7.5

Mode

  • The observation which has the highest frequency.
  • If there are 2 modes, choose smaller

Mode Advantages

  • Usable for all types of variables.
  • Not affected by extremes.

Mode Disadvantages

  • Cannot be determined if all observations have the same frequency.
  • 2 or more mode are sometimes obtained

Range

  • The difference between the highest and lowest values.
  • It takes the same unit of the original.

Range Disadvantages

  • Provides not information about data scatter.
  • Greatly affected by extreme values.

Standard Deviation

  • Definition: Squared root of the sum of the squared deviations of the values from the mean divided by the number of values minus one.

Standard Deviation Importance

  • Shows the dispersion around the mean.
  • Used to test the difference of 2 values.
  • In normal distribution curve
  • 68.3 % of the observation fall in the area between 1 SD.
  • 95.5 % of the observation fall in the area between 2 SD.
  • 99.7 % of the observation fall in the area between 3 SD.

Normal Curve

  • Symmetrical unimodal continuous distribution of infinite range.
  • It is described by its mean & SD of the population
  • Heights determined by the value of SD.
  • More concentration of observations at middle
  • It approaches infinity in either directions

Characteristics of Normal Curve

  • Bell shaped and Symmetrical
  • In a normally distributed population, mean = median = mode.

Uses of Normal Curve

  • Solves certain questions
  • If mean is known median and mode can be know
  • Estimates the population mean from the sample mean.

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

More Like This

Biostatistics: Definition, Types, and Methods
40 questions
Introduction to Biostatistics
20 questions

Introduction to Biostatistics

LikableSydneyOperaHouse574 avatar
LikableSydneyOperaHouse574
Use Quizgecko on...
Browser
Browser