Business Intelligence Chapter 2 Summary
10 Questions
0 Views

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

What is a primary characteristic of structured data?

  • It may contain any combination of textual and web content.
  • It is only relevant for observational data.
  • It is easily accessible but lacks organization.
  • It is in a standardized format and has a well-defined structure. (correct)
  • Which of the following statements best defines data integrity?

  • The accuracy, completeness, consistency, and validity of data. (correct)
  • The ability to access data when needed.
  • The match between the actual and expected data.
  • The comprehensiveness of the data available for analysis.
  • What does data granularity refer to in the context of data readiness for analytics?

  • The level of detail at which variables and data are defined. (correct)
  • The ability to quickly access data from different sources.
  • The relevance of the data variables to the study.
  • The completeness of the available data.
  • Which of the following accurately describes unstructured data?

    <p>Any combination of textual, imagery, voice, and web content.</p> Signup and view all the answers

    What is one of the critical aspects related to data security according to the nature of data?

    <p>Data should only be utilized by authorized individuals.</p> Signup and view all the answers

    Which of the following best describes the primary difference between nominal and ordinal data?

    <p>Nominal data has no intrinsic order while ordinal data does.</p> Signup and view all the answers

    What is a defining feature of ratio data compared to interval data?

    <p>Ratio data provides meaningful comparisons between measured values like income, while interval data does not.</p> Signup and view all the answers

    Which of the following processes is NOT typically included in data preprocessing?

    <p>Data segmentation</p> Signup and view all the answers

    In the context of categorical data, what is the primary characteristic of ordinal variables?

    <p>Ordinal variables convey a rank order among categories.</p> Signup and view all the answers

    What problem does data reduction mainly address during data preprocessing?

    <p>Reducing the volume of data while retaining its integrity.</p> Signup and view all the answers

    Study Notes

    Business Intelligence, Analytics, and Data Science: A Managerial Perspective - Chapter 2 Summary

    • Data: A collection of facts, usually obtained from experiences, observations, or experiments. Data can be numbers, words, images, etc. It's the fundamental building block for information and knowledge. Data quality and integrity are critical for analysis. Data integrity includes accuracy, completeness, consistency, and validity of an organization's data.

    Nature of Data

    • Data types include structured, unstructured, and semi-structured.

      • Structured data: Standardized format, well-defined structure, follows a consistent order, and easily accessed. Examples include names, dates, addresses, credit card numbers, stock information, geolocation. Usually easily understood by machine language.
      • Unstructured data: Any combination of textual, imagery, voice, and web content—not organized in a predefined format.
      • Semi-structured data: Data that doesn't conform to a rigid schema but has certain organizational elements (like tags or labels). Examples include XML, JSON, HTML, log files.
    • Data is categorized by properties:

      • Categorical data: Represents different types or groups.
        • Nominal: Labels variables without quantitative value. (e.g., male/female, hair color, nationality).
        • Ordinal: Variables have an order, but the differences between the values are not meaningful. Examples include Likert scales.
      • Numerical data: Represents values that can be measured and ordered.
        • Interval: Concerned with both the order and difference between variables. (e.g., classifying people as teenager, youth, etc.)
        • Ratio: Deals with the order of variables, differences between them, and has a meaningful zero point. (e.g., income, height, weight)

    Metrics for Analytics-Ready Data

    • Data source reliability: Is the source dependable?
    • Data content accuracy: Does the data match the task/job?
    • Data accessibility: Is the data easy to access when needed?
    • Data security and data privacy: Who has access to the data and under what conditions?
    • Data richness : Is the data complete or near complete?
    • Data consistency: Is the data collected and combined accurately?
    • Data currency/timeliness: Is the data up-to-date, recent, and new?
    • Data validity: Does the data match the expected data?
    • Data relevancy: Are the variables in the data relevant to the study?
    • Data granularity: Are the variables and data defined at the lowest level of detail possible?

    Data Preprocessing

    • The real-world data is often not ready for analysis due to its dirty, misaligned, complex, inaccurate nature.

    • Data preprocessing techniques are needed to ready the data for analysis— including data consolidation, cleaning, transformation, and reduction.

    • Data reduction: Focuses on reducing the dimensionality (number of features) or the volume of data.

      • Variable selection: Selecting relevant variables from a large set.
      • Dimensional reduction: Reducing the number of variables.
      • Sampling: Selecting a representative subset of data.
      • Balancing/stratification: Reducing imbalances in data sets. (e.g., stratified sampling)
    • Discretization: Converting continuous data into discrete intervals or categories (e.g., age groups, income brackets).

    • Data normalization: Reorganizing data to remove unstructured and redundant information, standardizing the data format.

    • Creating attributes: Define attributes to store more information (like priority levels).

    Statistical Modeling for Business Analytics

    • Descriptive Analytics: Describes the data as it is (e.g., using mean, median, mode, histograms , skew, kurtosis)

    • Predictive Analytics: Focuses on understanding trends and patterns to predict future outcomes.

    • Prescriptive Analytics: Recommends actions based on predictions.

    • Statistics: A collection of mathematical techniques to characterize and interpret data

    • Descriptive statistics: Describing the data. (mean, median, mode, measures of dispersion, histograms, skew, kurtosis)

    • Arithmetic Mean

    • Median

    • Mode

    • Range

    • Variance

    • Standard Deviation

    • Mean Absolute Deviation (MAD)

    • Inferential statistics: Drawing inferences about the population based on sample data (e.g., regression modeling).

    Regression Modeling for Inferential Statistics

    • Regression: A technique in inferential statistics used to identify relationships between explanatory (input) and response (output) variables. Can used for hypothesis testing and forecasting.

    Business Reporting

    • Report: Information leading to decision-making, a communication artifact conveying specific information. Reports fulfill multiple functions, ensuring proper departmental functioning, providing information, providing analysis results, persuading action, and creating organizational memory.

    Types of Business Reports

    • Metric management reports: Use KPIs and metrics to manage business performance.
    • Dashboard-type reports: Graphical displays of performance indicators.
    • Balanced scorecard-type reports: Management system that improves business processes by analyzing internal and external outcomes.

    Data Visualization

    • The use of visual representations to explore, make sense of, and communicate data. Involves aggregation, summarization and contextualization of data. Related to information graphics scientific visualization and statistical graphics.
    • Using Charts, graphs, illustrations, etc.
    • Different types of graphs and charts include histograms, pie charts, scatter plots, line graphs.

    Performance Dashboards

    • Common in BPM software and BI platforms
    • Provide visual displays of important information on a single screen for quick understanding and drilling down for analysis.
    • Effective dashboards should be designed with clear presentation of data and exceptions enabling quick assimilation.
    • Include design elements like prioritizing alerts, clear use of visuals, information layers (monitoring, analysis, and management).

    Studying That Suits You

    Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

    Quiz Team

    Description

    Dive into Chapter 2 of 'Business Intelligence, Analytics, and Data Science: A Managerial Perspective,' where we explore the fundamental concepts of data. Learn about the different types of data, including structured, unstructured, and semi-structured data, and why data quality and integrity are essential for effective analysis.

    More Like This

    Data Quality Quiz
    3 questions

    Data Quality Quiz

    NobleSardonyx avatar
    NobleSardonyx
    Data Quality Management Quiz
    5 questions
    Dimensions of Data Quality Quiz
    3 questions
    Healthcare Data Quality and Accuracy
    10 questions
    Use Quizgecko on...
    Browser
    Browser