Business Intelligence Chapter 2: Descriptive Analytics

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

Which of the following types of data is represented by subjects such as marital status and hair color?

  • Interval data
  • Ordinary data
  • Nominal data (correct)
  • Ratio data

What aspect is least important in ordinal data compared to the order of variables?

  • Categories of the variables
  • Types of variables
  • Differences between the values (correct)
  • Examples of the data

Which data preprocessing step involves removing inaccuracies from the dataset?

  • Data transformation
  • Data cleaning (correct)
  • Data reduction
  • Data consolidation

In the context of numerical data, which of the following is an example of ratio data?

<p>Income levels (C)</p> Signup and view all the answers

Which aspect of real-world data is highlighted as a major challenge before it can be used for analytics?

<p>Data is frequently inaccurate and complex (B)</p> Signup and view all the answers

What is the primary characteristic of structured data?

<p>It is highly organized and easily understood by machine language. (A)</p> Signup and view all the answers

Which of the following best describes data integrity?

<p>The accuracy, completeness, consistency, and validity of data. (D)</p> Signup and view all the answers

What does data granularity refer to?

<p>The level of detail at which variables and data are defined. (D)</p> Signup and view all the answers

Which of the following factors does NOT contribute to the quality of analytics-ready data?

<p>Data diversity across different formats (A)</p> Signup and view all the answers

In the context of data metrics, what does data currency refer to?

<p>The currentness and timeliness of the data. (B)</p> Signup and view all the answers

Flashcards

Data

Facts collected through experiences, observations, or experiments. It's the foundation for information and knowledge. It can be numbers, words, images, etc.

Structured Data

Data that follows a specific structure, is organized, and easily understood by computers. Examples include names, dates, addresses, and financial information.

Unstructured Data

Data that lacks a defined structure and is difficult for computers to process. Examples include images, videos, audio recordings, and social media posts.

Data Integrity

The accuracy, completeness, consistency, and validity of an organization's data. It's crucial for reliable analytics.

Signup and view all the flashcards

Analytics-Ready Data

Data that is reliable, accurate, accessible, secure, and relevant to the task at hand. It's essential for effective analytics.

Signup and view all the flashcards

Categorical Data

Data types which can be sorted into groups, like gender or age categories.

Signup and view all the flashcards

Numerical Data

Data representing values that can be measured and arranged in a logical order, like height or weight.

Signup and view all the flashcards

Ordinal Data

A type of numerical data where the order matters. Examples include a Likert Scale for measuring satisfaction.

Signup and view all the flashcards

Interval Data

A type of numerical data where both the order and difference between values are important. Examples include temperature scales.

Signup and view all the flashcards

Ratio Data

A type of numerical data where the order, difference, and ratio between values are important. Examples include weight or income.

Signup and view all the flashcards

Study Notes

Business Intelligence, Analytics, and Data Science: A Managerial Perspective - Chapter 2

  • Descriptive Analytics I: This chapter covers the nature of data, statistical modeling, and visualization.

  • Data: A collection of facts; usually obtained from experiences, observations, or experiments. Data can include numbers, words, images, etc.

  • Data Quality and Integrity: Critical to analytics. Data integrity includes accuracy, completeness, consistency, and validity of an organization's data.

  • Data Types:

    • Structured Data: Standardized format, well-defined structure, complies with a data model, and follows a persistent order, easily accessed. Examples: names, dates, addresses, credit card numbers, stock info, geolocation.
    • Unstructured Data: Any combination of textual, imagery, voice, and web content.
    • Semi-structured Data: Extensible markup language (XML), hypertext markup language (HTML), JavaScript Object Notation (JSON), log files, etc.
  • Metrics for Analytics Ready Data:

    • Data source reliability
    • Data content accuracy
    • Data accessibility
    • Data security and data privacy
    • Data richness
    • Data consistency
    • Data currency/data timeliness
    • Data validity and relevance
    • Data granularity
  • Categorical Variables:

    • Nominal Data: Used to label variables without quantitative value. Examples: gender, hair color, nationality, marital status.
    • Ordinal Data: Variables ordered based on their relative position. The differences between the values aren't necessarily consistent or meaningful (e.g., Likert scales: very likely, likely, neutral, unlikely, very unlikely).
  • Numerical Variables:

    • Interval Data: Variables with order and difference. Examples: Classification of people (teenager, youth, middle-age, etc.)
    • Ratio Data: Order and difference between variables, with a true zero point. Examples: income, height, weight, sales, etc.
  • Data Preprocessing: The process of preparing data for analytics. This includes data consolidation, data cleaning, data transformation, and data reduction.

  • Data Reduction Techniques:

    • Variables: Dimensional reduction, variable selection
    • Cases/samples: Sampling, balancing/stratification, discretization
  • Dimensionality Reduction: A process to reduce the number of dimensions (features) in a dataset while preserving the most important properties. This is useful for large, high-resolution images.

  • Discretization: Converting continuous data into discrete intervals or categories. Typically applied to large datasets to simplify analysis or model building (e.g., age groups, income brackets).

  • Data Normalization Reorganizing data to remove unstructured or redundant information, enabling a standardized data format. This creates an organized data system.

  • Data Visualization Using visual representations for exploring, understanding and communicating data. Often incorporates charts, graphs, illustrations.

  • Statistical Modeling for Business Analytics: Uses statistics to analyze data. Includes:

    • Descriptive Statistics: Describing data (e.g., mean, median, mode, range, variance).
    • Inferential Statistics: Drawing conclusions about a population based on a sample (e.g., hypothesis testing, regression, various graphs/charts).
  • Regression Modeling: A statistical technique to characterize the relationship between explanatory and response variables. Used for hypothesis testing and forecasting.

  • Business Reporting:

    • Report: Information to drive decisions, acting as a communication artifact.
    • Report Composition: Includes sources, format (text, table, graphs), distribution channels (in-print, email, portal).
    • Dashboard Design Considerations: The design should present data in a clear and concise manner so information is easily accessible, enabling rapid assimilation.
  • Performance Dashboards: Used in Business Process Management (BPM) and Business Intelligence (BI) platforms. Provide visual displays of information, combined on a single screen for quick analysis. These dashboards facilitate drilling into data and exploration.

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

More Like This

Use Quizgecko on...
Browser
Browser