Demographics Survey Quiz
38 Questions
0 Views

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

What is the purpose of adding Time to the Date dimension in the warehouse?

  • To analyze which promotions drive the most sales
  • To support tracking promotions and their effects on sales
  • To capture transaction information to the minute (correct)
  • To determine the size of the Date_Dim for 10 years of data
  • What is the main reason for creating a Promotion dimension in the warehouse?

  • To analyze the effectiveness of promotions on sales (correct)
  • To capture transaction information in the OLTP Sales Transaction table
  • To determine the size of the Date_Dim for 10 years of data
  • To support tracking Time of Day in the warehouse
  • What is an example of a causal dimension in the warehouse?

  • Date
  • Transaction ID
  • Promotion (correct)
  • Time of Day
  • What is a challenge when analyzing the effectiveness of promotions?

    <p>Guaranteeing that the promotion drove the sales</p> Signup and view all the answers

    What is the grain of the revised schema in the warehouse?

    <p>Store, Date, SKU, Transaction ID, Time, Promotion</p> Signup and view all the answers

    What is an attribute of the Promotion dimension?

    <p>Promotion Code</p> Signup and view all the answers

    What is a potential issue with promotions in the warehouse?

    <p>Cannibalization of sales</p> Signup and view all the answers

    Why is it important to create a 'No Promotion' type in the Promotion dimension?

    <p>To account for non-promotion sales</p> Signup and view all the answers

    What is the primary goal of the theme 'To grow sales across all market segments and product lines'?

    <p>To increase sales in a specific region</p> Signup and view all the answers

    What is the business process being modeled in the Cardinal Merch case study?

    <p>POS retail sales</p> Signup and view all the answers

    What does the 'grain' of the business process convey?

    <p>The level of detail associated with fact table measurements</p> Signup and view all the answers

    What is the reason for choosing the most granular or atomic information captured by the business process?

    <p>To improve data analysis</p> Signup and view all the answers

    What is the primary key in the DateDim table?

    <p>Date</p> Signup and view all the answers

    What is the purpose of declaring the grain of the business process?

    <p>To specify the level of detail associated with fact table measurements</p> Signup and view all the answers

    What is a critical success factor?

    <p>A group of data elements that are central to achieving a goal</p> Signup and view all the answers

    What is the foreign key in the SalesFact table that links to the ProductDim table?

    <p>SKU</p> Signup and view all the answers

    What is the purpose of the query in the context?

    <p>To answer a specific business question</p> Signup and view all the answers

    What is the primary key in the StoreDim table?

    <p>StoreNumber</p> Signup and view all the answers

    What type of data is stored in the SalesFact table?

    <p>Fact data</p> Signup and view all the answers

    What is the final step in the dimensional modeling process?

    <p>Identify the facts</p> Signup and view all the answers

    What is the relationship between the SalesFact table and the DateDim table?

    <p>One-to-many</p> Signup and view all the answers

    What is the purpose of the ProductDim table?

    <p>To store product information</p> Signup and view all the answers

    What is the approximate number of rows in the Product Dimension when accounting for different merchandising schemes across stores and historical products?

    <p>150K</p> Signup and view all the answers

    What is the natural key in the Product Dimension?

    <p>SKU Number</p> Signup and view all the answers

    What is the hierarchy of the Product Dimension?

    <p>SKU=&gt;Brand=&gt;Category=&gt;Department</p> Signup and view all the answers

    What is an example of a geographic hierarchy in the Store Dimension?

    <p>Store=&gt;District=&gt;Region=&gt;State</p> Signup and view all the answers

    What is the formula for Gross Margin?

    <p>Gross Margin = Gross Profit/Sales Dollar Amount</p> Signup and view all the answers

    What should we choose to store as a fact, Gross Profit or Gross Margin?

    <p>Gross Profit</p> Signup and view all the answers

    What is an example of an attribute in the Product Dimension?

    <p>Product Description</p> Signup and view all the answers

    What is the purpose of the Store Dimension?

    <p>To represent the primary geographic dimension</p> Signup and view all the answers

    What is the purpose of a junk dimension?

    <p>To restrict the total potential size of a dimension</p> Signup and view all the answers

    How many rows would a junk dimension have if it had 3 genders, 100 ages, 3 marital statuses, 3 return statuses, and 3 child under 15 statuses?

    <p>8100</p> Signup and view all the answers

    What is a consideration when creating a junk dimension?

    <p>The total potential size of the dimension</p> Signup and view all the answers

    Why might you want to remove some unrealistic combinations from a junk dimension?

    <p>To make the dimension more realistic</p> Signup and view all the answers

    What is an example of an unrealistic combination that might be removed from a junk dimension?

    <p>Customers under 10, married with children</p> Signup and view all the answers

    How many rows are typically appropriate for a junk dimension?

    <p>Up to 20,000 rows</p> Signup and view all the answers

    What is the benefit of using a junk dimension with a smaller number of rows?

    <p>It reduces the total potential size of the dimension</p> Signup and view all the answers

    What is a characteristic of a junk dimension?

    <p>It has a large number of rows</p> Signup and view all the answers

    Study Notes

    DIM_DEMOGRAPHICS

    • Pre-loaded with every combination of responses
    • Columns: DemoKey, Gender, Age, AgeGroup, MaritalStatus, ReturnStatus, Child15Status
    • Contains 8,100 rows (calculated by multiplying the number of potential values for each attribute)

    Junk Dimensions

    • Calculated by taking the total number of potential values for each attribute
    • Should be restricted to a total potential size of up to 20,000 rows
    • May want to remove unrealistic combinations, but this adds complexity to the load process

    Data Model

    • The Query: selects sum of QuantitySold and sum of NetPrice from SalesFact, filtered by DivisionName, CategoryName, and CharDate
    • Critical Success Factors: a central goal, e.g. to grow sales across all market segments and product lines
    • Business Questions: specific questions tied to data, e.g. how many men's statement shirts were sold in the Philippines during NCAA semifinals 2016 and what was the total net sales?

    Dimensional Modeling Process

    • Consists of four main steps: select the business process to model, declare the grain of the business process, choose the dimensions that apply to each fact table row, and identify the facts
    • Dimensional modeling is both science and art

    Cardinal Merch Case Study

    • Step 1: select the business process to model, e.g. POS retail sales
    • Step 2: declare the grain of the business process, specifying the level of detail associated with fact table measurements
    • Choose the most granular or atomic information captured by the business process

    Date Dimension

    • Example attributes: Date, CharDate, etc.
    • Grain conveys the level of detail associated with fact table measurements

    Product Dimension

    • Example attributes: SKU Number, UPC, Product Description, Brand Description, etc.
    • Product dimension will contain about 150K rows when accounting for different merchandising schemes across stores and historical products
    • Product hierarchy: SKU=>Brand=>Category=>Department

    Store Dimension

    • Example attributes: Store Number, Store Name, Store Street Address, etc.
    • Represents primary geographic dimension
    • Store hierarchies include: Store=>State, Store=>District=>Region, etc.

    Additivity

    • If CMI wants to look at gross margin, it's calculated by gross profit/sales dollar amount
    • Should choose to store gross profit or gross margin as a fact

    Simple Warehouse Data Model

    • Consists of DateDim, ProductDim, SalesFact, and StoreDim tables
    • Each table has primary keys and foreign keys to establish relationships

    Time of Day

    • If we add Time to the Date dimension, the size of the Date_Dim for 10 years of data to the minute would be significant
    • Alternatively, we can add it as a new dimension, changing the grain to Store, Date, SKU, Transaction ID, Time

    Promotion Dimension

    • Example attributes: Promotion Code, Promotion Name, Price Reduction Type, etc.
    • We can create a new dimension to support tracking promotions, with a promotion type 'No Promotion' to account for non-promotion sales
    • Revised schema includes the Promotion dimension, making the grain Store, Date, SKU, Transaction ID, Time, Promotion

    Studying That Suits You

    Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

    Quiz Team

    Description

    Test your knowledge of demographics with this quiz, covering gender, age, marital status, and more!

    More Like This

    EU27 Demographic Survey Analysis
    18 questions
    DEI Survey Study Notes
    8 questions
    Sectoral Representation Survey Analysis
    45 questions
    Socioeconomic Survey of Awwalpur Village
    24 questions
    Use Quizgecko on...
    Browser
    Browser