1: Biostats Chapter 1: Intro & Graphs

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

Which of the following best describes what individuals are in a dataset?

  • The objects being described in a set of data. (correct)
  • The average measurement taken from the dataset.
  • The properties that characterize the dataset.
  • The count or proportion of each type of variable.

Which of the following is a method for charting categorical data?

  • Dot plot
  • Time plot
  • Bar graph (correct)
  • Histogram

Which statement accurately defines a quantitative variable?

  • A variable that includes categories or groups.
  • A variable that categorizes individuals into discrete groups.
  • A variable that can take on different numerical values. (correct)
  • A variable that describes a property using counts.

When classifying variables, what question should be asked to determine if it is quantitative or categorical?

<p>What characteristic describes each individual? (A)</p> Signup and view all the answers

What does the vertical axis of a histogram represent?

<p>The frequency or relative frequency of the data (C)</p> Signup and view all the answers

When creating histogram classes, which of the following should be avoided?

<p>Using classes with 0 or 1 counts (B)</p> Signup and view all the answers

What is the recommended starting number of classes for a histogram?

<p>5 to10 classes (C)</p> Signup and view all the answers

What is a common mistake made when developing histogram classes?

<p>Creating overly summarized classes that lose information (A)</p> Signup and view all the answers

Which of the following best describes the ideal histogram class characteristics?

<p>Balanced with sufficient information without excessive detail (D)</p> Signup and view all the answers

What should be observed when interpreting histograms?

<p>The overall pattern and deviations from it (C)</p> Signup and view all the answers

What is the role of classes in a histogram?

<p>To divide the range of values into equal-size intervals (B)</p> Signup and view all the answers

Flashcards

Individual

An object or subject being described in a dataset. It can be a person, object, animal, or anything you want to gather data about.

Variable

A characteristic or aspect of an individual that can be measured or categorized. It's what you're collecting information about.

Quantitative Variable

A variable that represents a quantity or number. You can calculate averages and other numerical summaries.

Categorical Variable

A variable that describes a quality or category, not a quantity. You can count how many individuals belong to each category.

Signup and view all the flashcards

Classifying Variables (1 of 2)

Deciding if a variable is quantitative (a number) or categorical (a description) by asking: What is being recorded? Is it a number or a statement?

Signup and view all the flashcards

What is the main focus of BMS 511 Biostats & Statistical Analysis?

This course covers fundamental statistical approaches relevant to biomedical and clinical research. It helps students understand how to formulate research questions, collect data, and apply appropriate statistical methods using real-world health and research scenarios.

Signup and view all the flashcards

What are the core topics addressed in BMS 511?

The course materials cover a wide range of topics including developing research questions, data collection techniques, and applying appropriate statistical methods. The emphasis is on using these skills in real-world health and research settings.

Signup and view all the flashcards

What is the relationship between statistics and medical research in this course?

The course emphasizes the link between statistics and medical research, showing how statistical methods are crucial for interpreting and drawing conclusions from medical studies.

Signup and view all the flashcards

What is the assessment structure in BMS 511?

A combination of assessments is used to evaluate student performance, with a strong emphasis on open-book individual exams and in-class, closed-book quizzes. The course also includes a group project and homework assignments.

Signup and view all the flashcards

What are the practical skills students will learn in BMS 511?

The course covers essential statistical techniques for analyzing and interpreting biomedical and clinical data. Students learn to select the most appropriate statistical methods based on the nature of their research questions and the data available.

Signup and view all the flashcards

Quantitative variable example: Age at death

A variable that is a numerical measurement and can be ordered or ranked.

Signup and view all the flashcards

Categorical variable example: Diagnosis

A variable that categorizes individuals into different groups based on a characteristic.

Signup and view all the flashcards

Quantitative Variable example: Number of mice with metastases

A variable that represents the number of times an event occurs.

Signup and view all the flashcards

Quantitative Variable example: Total number of metastases

A variable that represents a numerical measurement about a characteristic.

Signup and view all the flashcards

Histogram

A type of graph showing the distribution of a quantitative variable by dividing the data into equal-sized intervals called classes, with each class represented by a column whose height corresponds to the frequency (count or percentage) of data points within that interval.

Signup and view all the flashcards

Binning

A statistical procedure for dividing numerical data into equal-sized intervals or classes. The classes are represented along the horizontal axis of a histogram, determining the width of the bars.

Signup and view all the flashcards

Frequency

The number of values in a particular class of a histogram. It represents the frequency of observations within that interval.

Signup and view all the flashcards

Relative Frequency

The percentage of values in a particular class of a histogram, calculated by dividing the frequency of that class by the total number of values in the data set.

Signup and view all the flashcards

Skyscraper Graph

A histogram with very few bars, where the bars are tall and narrow, representing a very concentrated data distribution. This means most data points are grouped within just a few intervals.

Signup and view all the flashcards

Pancake Graph

A histogram with many wide and short bars, representing a very spread-out data distribution where many intervals have only a small number of values. This occurs when the class width is too small.

Signup and view all the flashcards

Choosing Histogram Classes

The process of selecting the appropriate number of classes for a histogram, balancing the need for sufficient detail with the need for clear visualization. Too many classes can obscure the overall pattern, while too few can lose valuable information.

Signup and view all the flashcards

Study Notes

Course Information

  • The course is BMS 511 Biostats & Statistical Analysis, Chapter 1: Intro & Displaying Data with Graphs
  • The instructor is Guang Xu, PhD, MPH
  • The instructor is an assistant professor of Biostatistics and Public Health at the College of Osteopathic Medicine, Marian University
  • The course provides a comprehensive introduction to statistical practices relevant to biomedical and clinical research
  • The course covers experimental questions and approaches to data collection, statistical analysis, basic statistical concepts, and appropriate statistical methodology
  • The course emphasizes the relationship between statistics and medical research
  • Examples from healthcare and research are used to illustrate statistical concepts and methodology.

Required Texts

  • The required textbook is "The practice of statistics in the life sciences" by Baldi, Brigitte, and David S. Moore, 4th edition, Macmillan Higher Education
  • Required supplementary textbook is "Epidemiology: With Student Consult Online Access," 5e, by Gordis, L., WB Saunders Co., Philadelphia (5th edition, 2013), ISBN: 978-1455737338
  • "Intuitive Biostatistics: a nonmathematical guide to statistical thinking" by Motulsky, Harvey, 4th edition, Oxford University Press
  • "Statistics in Medicine" by Robert H. Riffenburgh, 3rd edition, Elsevier

Course Grades

  • Exam I: 16%
  • Exam II: 16%
  • Exam III: 16%
  • Exam IV: 16%
  • In-class Quizzes: 16%
  • Homework (10 total): 10%

Group Project

  • The group project gives students a chance to apply the learned statistics and to think critically
  • The project necessitates an understanding of the story, methods, results, conclusions, future, and personal thoughts.

Office Hours

Learning Objectives

  • Determine and apply methods for depicting data distributions using graphs.
  • Define and utilize individuals and variables.
  • Understand the various types of data: categorical and quantitative
  • Understand different ways to display categorical data (e.g., bar graphs, pie charts)
  • Understand different ways to display quantitative data (e.g., histograms, dot plots)
  • Interpret histograms.
  • Graph time series using time plots.

Variable Types

  • Quantitative Variables: Variables that measure or assess a quantity, allowing for calculating an average for all individuals. Examples include age, blood pressure, leaf length.
  • Categorical Variables: Variables that describe a characteristic or quality of an individual, allowing for counting or calculating the proportion of individuals with that characteristic. Examples include gender, blood type, flower color

Classifying Variables

  • Identify the individuals being studied.
  • Define what is being recorded about those individuals.
  • Determine whether the recorded information is a number (quantitative) or a description (categorical).

Graphing Categorical Data

  • Bar graphs: Each category is represented by a bar. The bar's height represents the frequency or relative frequency of individuals in that category.
  • Pie charts: The whole pie represents all individuals. Each slice represents a category, and the size of the slice corresponds to the proportion of individuals in that category.

Graphing Quantitative Data

  • Histograms: A summary graph for a single numerical variable, useful for understanding variability, especially with large datasets.
  • Dot plots: A graph of raw data, useful to clarify variation patterns, primarily with small datasets.
  • Time plots: A graph employing time on the horizontal axis and a variable on the vertical axis, highlighting changes over time.

Making Histograms

  • Divide the range of the quantitative variable into equal-size intervals.
  • The vertical axis represents either frequency (counts) or relative frequency (percentages of total).
  • For each interval, create a column whose height corresponds to the count or percentage of data points in that interval.

Choosing Histogram Classes

  • The process of selecting classes is iterative (repeated)
  • Avoid too many classes with values of 0 or 1 (pancake graph).
  • Avoid overly summarized classes (skyscraper graph); data is no longer informative.
  • Begin by selecting 5 to 10 intervals, then adjust accordingly.

Interpreting Histograms

  • Shape: Examine the overall pattern and deviations. Look for patterns like unimodal (single peak), bimodal (two peaks), symmetric (similar shape on both sides), skewed (one tail longer).
  • Center: Estimate the approximate midpoint of the data.
  • Spread: Determine the range of values observed.
  • Outliers: Identify values that deviate significantly from the overall pattern and try to explain them.

Common Distribution Shapes

  • Symmetric: The graph of the left half of the data and right half looks identical.
  • Left-Skewed: The left side of the graph, containing extreme values, extends further than the right side.
  • Right-Skewed: The right side of the graph, containing extreme values, extends further than the left side.

Outliers

  • An outlier is a data point that falls outside the overall distribution pattern.
  • Look for outliers and try to explain them.
  • Note that the largest observation isn't always an outlier. It must not be consistent with the rest of the data.

Making Dot Plots

  • Create a single axis representing the variable's range.
  • Place a dot for each data point, positioned according to its value on the axis.
  • Stack dots when multiple data points have the same value.

Graphing Time Series

  • Time plots usually involve plotting a variable against time.
  • Look for overall trends and cyclical variability in the data.

Homework

  • Homework is available under the "Modules" section on Canvas.

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

More Like This

Use Quizgecko on...
Browser
Browser