Chapter 4: Displaying and Describing Categorical Data

SweetFaith avatar
SweetFaith
·
·
Download

Start Quiz

Study Flashcards

9 Questions

What is the first rule of data analysis according to the text?

Make a picture

What does a relative frequency table display?

Proportions or percentages of the total in each category

Pie charts display the whole group as a rectangle.

False

Bar charts display the distribution of a 1 variable.

categorical

Match the country with the percentage of respondents who said 'Yes' to using social networking sites: (Choose one country for each percentage)

Britain = 10.5% Egypt = 6.8% Germany = 15.6% Russia = 23.0% United States = 23.3%

What is the main cause of Simpson’s Paradox?

Inappropriately combining percentages of different groups

In the context of Simpson's Paradox, what can be misleading?

Overall success rate calculations

Who has a better success rate closing sales of paper according to Table 4.7? Peter has a success rate of 90% while Katrina has a success rate of ____%.

95

True or False: In Simpson's Paradox, the overall success rate calculation is always an accurate reflection of individual performances.

False

Study Notes

Displaying and Describing Categorical Data

The Three Rules of Data Analysis

  • Make a picture (visual representation) to reveal things not seen in tables of numbers
  • Show important features and patterns in the data
  • Provide an excellent means to report findings to others

Frequency Tables

  • Organize data by recording counts and category names
  • Categories are labeled in each row of the frequency table
  • May lump values together in an "Other" category if there are too many

Relative Frequency Tables

  • Display proportions or percentages of the total that lie in each category
  • Examples: Table 4.2 shows percentages of Loblaw stores in eastern Canada

Charts

Bar Charts

  • Display the distribution of a categorical variable, showing counts for each category
  • Observe the area principle: the area occupied by a part of the graph corresponds to the magnitude of the value it represents
  • May be drawn in vertical columns or horizontally
  • Examples: Figure 4.4 shows the number of Loblaw stores by province in eastern Canada

Pie Charts

  • Show the whole group as a circle ("pie") sliced into pieces
  • The size of each piece is proportional to the fraction of the whole in each category
  • Example: Figure 4.4 shows the number of Loblaw stores by province in eastern Canada

Before Making a Bar Chart or Pie Chart

  • Ensure everything adds up to 100% (e.g., 100% of Loblaw stores in Canada)
  • Check that categories don't overlap (e.g., no store is in both "Ontario" and "Other" categories)
  • Consider what you want to communicate about the data and choose the proper method

Exploring Two Categorical Variables: Contingency Tables

What a Contingency Table Shows

  • How the values of one variable are contingent on the value of another variable
  • Example: Table 4.3 shows responses to using social networking sites in five countries

Marginal Distributions

  • The marginal distribution of a variable is a frequency or relative frequency distribution of either the row or column variable
  • Row and column totals of the contingency table provide the marginal distributions

Conditional Distributions

  • Restrict variables in a distribution to show the distribution for just those cases that satisfy a specified condition
  • Example: Table 4.6 shows the conditional distribution of Social Networking conditioned on two values of Country

Segmented (or Stacked) Bar Charts

  • Treat each bar as the "whole" and divide it proportionally into segments corresponding to the percentage in each group
  • Example: Figure 4.6 shows the distribution of responses to the question for men and women

Simpson's Paradox

  • Results from inappropriately combining percentages of different groups
  • Example: Table 4.7 shows the success rates of Peter and Katrina in selling different products

What Can Go Wrong?

  • Violate the area principle in charts
  • Keep it honest: avoid confusing percentages and ensure the scale of data is consistent
  • Don't confuse percentages or forget to look at variables separately
  • Use enough individuals or cases in gathering data and providing results
  • Don't overstate your case or use unfair or inappropriate percentages
  • Be aware of Simpson's Paradox when combining percentages of different groups

Learn how to analyze and visualize categorical data using charts and tables, and understand the three rules of data analysis. Discover how to effectively use bar and pie charts to reveal important features in data.

Make Your Own Quizzes and Flashcards

Convert your notes into interactive study material.

Get started for free
Use Quizgecko on...
Browser
Browser