Data Management Overview

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson
Download our mobile app to listen on the go
Get App

Questions and Answers

Why is high-quality data accessibility crucial for modern enterprises?

  • It reduces the need for employee training programs.
  • It enhances marketing campaigns through targeted advertising.
  • It ensures compliance with environmental regulations.
  • It enables effective decision-making and operational efficiency. (correct)

What is the primary aim of data management in an organization?

  • To minimize the volume of data stored.
  • To ensure all data is stored offline for security purposes.
  • To restrict data access to a select few employees.
  • To transform raw data into high-quality, usable information. (correct)

Which factor contributes most significantly to the difficulty and expense of modern data management?

  • The presence of strict government regulations on data handling.
  • The lack of skilled data entry personnel.
  • The limited availability of data storage hardware.
  • The exponential increase in data volume and its scattered distribution. (correct)

What is the role of Master Data Management (MDM) in an organization's data strategy?

<p>To integrate data from various sources into a unified view. (D)</p> Signup and view all the answers

What is the primary function of a Document Management System (DMS)?

<p>To manage, archive, and purge electronic documents. (B)</p> Signup and view all the answers

How does data synchronization in the healthcare sector contribute to improved supply chain efficiency?

<p>By facilitating easier and faster product sourcing through consistent information. (A)</p> Signup and view all the answers

In what way has the implementation of database management systems (DBMS) improved data handling compared to file-based systems?

<p>By solving data problems related to redundancy, inconsistency, and isolation. (A)</p> Signup and view all the answers

Which best describes the function of database management software (DBMS)?

<p>Software that controls the creation, maintenance, and use of databases. (A)</p> Signup and view all the answers

What is a key advantage of a distributed database over a centralized database?

<p>Enhanced security and reliability through data replication. (D)</p> Signup and view all the answers

What is a major drawback of using a centralized database system?

<p>Reduced scalability due to a single point of access. (D)</p> Signup and view all the answers

Which function of a Database Management System (DBMS) involves ensuring data accuracy and consistency?

<p>Data Integrity and Maintenance (C)</p> Signup and view all the answers

What is the purpose of data mining within a data warehouse?

<p>To identify patterns that can be used for predictions. (D)</p> Signup and view all the answers

Which type of database is best suited for storing detailed, transaction-specific data?

<p>Operational (B)</p> Signup and view all the answers

In the database development process, what is the main objective of 'Data Planning'?

<p>Developing a model of business processes. (D)</p> Signup and view all the answers

What is the focus of the 'Logical Design' phase in database development?

<p>Translating conceptual models into a data model of a DBMS. (D)</p> Signup and view all the answers

What distinguishes Online Transaction Processing (OLTP) from Online Analytics Processing (OLAP)?

<p>OLTP manages transaction data, designed to break down complex information into simpler data tables while OLAP is used for organizing large business databases (B)</p> Signup and view all the answers

What is a key characteristic of 'dirty data'?

<p>It lacks integrity, validation, and is incomplete or inaccurate. (C)</p> Signup and view all the answers

According to the principle of diminishing data value, how does the value of data change over time?

<p>It diminishes as it ages, especially after a few months. (C)</p> Signup and view all the answers

In the context of data management, what is meant by 'data silos'?

<p>Isolated data stores that are not easily accessible by other systems. (C)</p> Signup and view all the answers

What are the four essential elements of a Database Management System (DBMS)?

<p>Data structure, data modeling language, data query language, and transaction mechanisms. (A)</p> Signup and view all the answers

Which data structure is characterized by a hierarchical arrangement of records, resembling a tree with a root and sub-ordinates, indicating a one-to-many relationship?

<p>Hierarchical (B)</p> Signup and view all the answers

A data model that allows multiple records to be linked to the same parent, what kind of data language model?

<p>Network (A)</p> Signup and view all the answers

What is Structured Query Language (SQL) primarily used for?

<p>Querying and managing data in a database. (A)</p> Signup and view all the answers

In the process of moving data to a data warehouse, what does the transformation step involve?

<p>Standardizing formats, cleaning the data, and integrating it. (D)</p> Signup and view all the answers

What is the primary function of an active data warehouse (ADW)?

<p>Capturing transactions in real-time and integrating them into the warehouse. (D)</p> Signup and view all the answers

Which term refers to a single store of structured, semi-structured, and unstructured enterprise data stored in its natural or raw format?

<p>Data lake (C)</p> Signup and view all the answers

In the context of data governance, what does 'data governance' primarily ensure?

<p>The availability, usability, integrity, and security of data. (A)</p> Signup and view all the answers

What is the main goal of Master Data Management (MDM)?

<p>To integrate data from various sources to create a unified view. (D)</p> Signup and view all the answers

Which of the following refers to data that describes other data, providing information about data's properties?

<p>Metadata (A)</p> Signup and view all the answers

When organizations began using computers, they started with one application at a time, usually accounting, billing, and payroll. What did each app primarily function as?

<p>Stand-alone system (A)</p> Signup and view all the answers

What is the primary role of an Enterprise Data Warehouse (EDW)?

<p>Pulling together data from disparate sources for analysis and reporting (D)</p> Signup and view all the answers

Which process involves interpreting words and concepts within their context to extract meaningful information?

<p>Text Mining (B)</p> Signup and view all the answers

Which of the following describes the function of Electronic Records Management (EMR)?

<p>Managing electronic documents and image paper documents (C)</p> Signup and view all the answers

According to the stages of record lifecycle highlighted in materials, what is the first step in the record lifecycle?

<p>Create/Receive (C)</p> Signup and view all the answers

What is the primary purpose of 'e-discovery' in the context of electronic records management?

<p>To gather electronically stored information for legal or regulatory actions. (A)</p> Signup and view all the answers

What could be the result of not adhering to the e-discovery requested in the court of law?

<p>Charges of obstructing justice (D)</p> Signup and view all the answers

In the context of file management systems, how is data organized in a computer system?

<p>In a hierarchy consisting of bits, bytes, fileds, records, files, and databases. (C)</p> Signup and view all the answers

Which problem is often associated with using a file environment for storing and managing data?

<p>Data redundancy, inconsistency, and isolation (D)</p> Signup and view all the answers

Which file access stores sequentially stores record in a sequence?

<p>Sequential Access (B)</p> Signup and view all the answers

Flashcards

Goal of Data Management?

Providing the infrastructure and tools to transform raw data into usable information of the highest quality.

Master Data Management (MDM)

Processes to integrate data from various sources and enterprise apps to create a unified view.

Document Management System (DMS)

Hardware and software to manage, archive, and purge files and e-documents.

Green Computing

Efforts to conserve natural resources and reduce effects of computer usage on the environment.

Signup and view all the flashcards

Database Management System (DBMS)

Software that controls the creation, maintenance, and use of databases

Signup and view all the flashcards

Centralized Database

A database where all data is stored in a single location.

Signup and view all the flashcards

Distributed Database

A database where data is spread across multiple locations.

Signup and view all the flashcards

Data Filtering and Profiling

Inspection of data for errors, inconsistencies, redundancies, and incomplete information.

Signup and view all the flashcards

Data integrity and maintenance

Correcting, standardizing, and verifying the consistency and integrity of data.

Signup and view all the flashcards

Data Synchronization

Integrating, matching, or linking data from different sources.

Signup and view all the flashcards

Data Warehouse

An integrated collection of data.

Signup and view all the flashcards

Data Marts

Databases that hold subsets of data from the data warehouse.

Signup and view all the flashcards

Data Centers

Facilities containing mission-critical ISs and components that deliver data and IT services.

Signup and view all the flashcards

Data Mining

Searching data within a warehouse to identify useful patterns.

Signup and view all the flashcards

Data Dictionary

Catalog or directory containing metadata.

Signup and view all the flashcards

Metadata

Data about data.

Signup and view all the flashcards

Operational Database

Stores detailed data, e.g. customer or inventory databases.

Signup and view all the flashcards

Distributed Database

Replicates/distributes copies of databases to network servers.

Signup and view all the flashcards

External database

Accesses wealth of information online; newspapers, magazines, etc

Signup and view all the flashcards

Hypermedia database

Uses hyper text and hyper media documents stored on websites.

Signup and view all the flashcards

Data Planning

Administrators work with corporate to develop a business process of the enterprise.

Signup and view all the flashcards

Data Modeling

Process where the relationships between data elements are identified

Signup and view all the flashcards

Online Transaction Processing (OLTP)

Manage volatile transactional data, which are simplified into tables.

Signup and view all the flashcards

Online Analytics Processing (OLAP)

Organizing large business databases for cube analysis.

Signup and view all the flashcards

Dirty Data

Data that lacks integrity/validation.

Signup and view all the flashcards

Principle of Diminishing Data Value

Data's value diminishes as it ages.

Signup and view all the flashcards

Principle of 90/90 Data Use

90% of data is seldom accessed after 90 days.

Signup and view all the flashcards

Data Silos

Stand-alone data stores not accessible by other systems.

Signup and view all the flashcards

Data Modeling Language

Hierarchical, network, relational, and object-oriented.

Signup and view all the flashcards

Hierarchical Model

A hierarchical database forms a tree-like structure.

Signup and view all the flashcards

Network Model

A network model has many-to-many relationships, creating a web.

Signup and view all the flashcards

Relational Data Model

Relational data elements are stored in simple tables with rows and columns.

Signup and view all the flashcards

Multi-dimensional database

Database to organized data to express the relationships between data

Signup and view all the flashcards

Network Database

A data model that allows multiple records to be linked to the same parent.

Signup and view all the flashcards

Relational database model

Managing data using collection of data elements to organize groups

Signup and view all the flashcards

Structure Query Language (SQL)

The declarative language to simplify data access by that users want

Signup and view all the flashcards

Enterprise Data Warehouses (EDW)

Data warehouses are EDWs that pull data together from disparate databases.

Signup and view all the flashcards

Business Analytics

A function of applying technologies, algorithms and expertise in software

Signup and view all the flashcards

Text Mining

Broad category involving interpreted words and concepts in context

Signup and view all the flashcards

Business Records

Documentation of a business event, action, decision, or transaction.

Signup and view all the flashcards

Data Governance

The overall management of the availability, usability, integrity, and security of data.

Signup and view all the flashcards

Study Notes

Why Data Management Matters

  • Enterprises need high-quality, accessible data to be effective
  • Data should be complete and in context to be reliable
  • It is estimated that organizations with at least 1,000 knowledge workers lose approximately $5.7 million annually due to employees spending time reformatting data across different applications

Goal of Data Management

  • Data management focus on providing the infrastructure and tools necessary to convert raw data into high-quality, usable information

Data Management Challenges

  • Exponentially increasing volumes of data
  • The scattering of data throughout organizations
  • Creation and usage of data offline without quality control
  • Possible redundancy and staleness of data

Current Key Issues in Data Management

  • Master Data Management (MDM) involves integrating data from various sources and enterprise applications to create a unified view
  • Document Management System (DMS) offers hardware and software for file management, archiving, and purging electronic documents
  • Green computing is about conserving resources and minimizing the environmental impact of computer usage

Costs of Data Errors

  • Healthcare administrators waste 24%-30% of their time correcting data errors
  • Each incorrect transaction can cost $60-$80 to fix
  • Approximately 60% of invoices among supply chain partners contain errors, each costing $40-$400 to reconcile
  • Billions of dollars are lost annually in the healthcare supply chain due to data disconnects

Benefits of Data Synchronization

  • Faster and simpler product sourcing because of accurate and consistent item information
  • Reduced fraud and unauthorized purchasing
  • Reduced unnecessary stock
  • Lower prices due to purchase volume awareness
  • Increased patient safety

Stand-alone Systems

  • Systems in isolation lead to redundancy, inconsistency, and isolation in data

Database Management Systems (DBMS)

  • DBMS is a software for controlling the creation, maintenance, and usage of databases

Database Types

  • Centralized database: a database located, stored, and maintained in a single location
  • Distributed database: a database stored on multiple computers and locations

Distributed Databases: Advantages

  • Security
  • Reliability
  • Speed
  • Scalability

Distributed Databases: Disadvantages

  • Availability
  • Expense
  • Security

Centralized Databases: Advantages

  • Better control of data quality
  • Ease of use
  • Better IT security
  • Better data integrity

Centralized Databases: Disadvantages

  • Transmission delay
  • Security
  • Reliability
  • Scalability

Data Warehouses, Data Marts, and Data Centers

  • Data warehouses are integrated collections of data
  • Data marts hold subsets of data from a data warehouse
  • Data centers are facilities that house mission-critical ISs and components for data and IT service delivery

Current Database Types

  • Operational: Stores detailed data; for example, a customer database uses Microsoft Access database management software
  • Distributed: Replicates and distributes copies/parts of databases to network servers; for example, data on the World Wide Web, intranets, and extranets
  • External: Provides access to online information, such as search engines like Google and Yahoo, and news outlets and magazines
  • Hypermedia: Uses hypertext and hypermedia documents stored on websites, for example, graphics, images, photos, and video clips

Data Planning and Modeling

  • Database administrators and designers work with corporate and end-user management to develop an enterprise model
  • Data modeling is identifying the relationships between data elements
  • Logical elements are data elements and relationships among data
  • Physical - data are stored and accessed on the storage devices of a computer system

Online Transaction Processing (OLTP)

  • OLTP is designed for managing transaction data
  • Volatile and break down complex information into simpler data tables
  • Balances transaction-processing efficiency and query efficiency
  • Not optimized for data mining

Online Analytics Processing (OLAP)

  • OLAP includes organizing large business databases
  • Data is divided into one or more cubes that fit business needs

DBMSs (mid-2014)

  • Oracle's MySQL
  • Microsoft's SQL Server
  • PostgreSQL
  • IBM's DB2
  • Teradata Database

Trend Toward NoSQL Systems

  • Provides higher performance
  • Allows easy data distribution across different nodes
  • Enables scalability and fault tolerance
  • Offers greater flexibility
  • Easier to administrate

Dirty Data Explained

  • Lacks integrity/validation, reduces user trust
  • Incomplete, out of context, outdated, inaccurate, inaccessible, or overwhelming

Principle of Diminishing Data Value

  • Data value decreases as it ages
  • Blind data spots lasting 30 days or longer inhibit optimal performance
  • Global financial service institutions need real-time data to hit peak performance

Principle of 90/90 Data Use

  • Up to 90% of data is rarely accessed after 90 days, except for auditing
  • Approximately 90% of data loses most of its value after three months

Data Life Cycle

  • Internal and external data is combined into a central data warehouse
  • Personal expertise and judgement is used to manipulate the data for analysis
  • Results will be visualized
  • Can be used in SCM, CRM, e-commerce etc

Data Silos

  • Data silos are stand-alone data stores not accessible by other information systems or departments
  • Information requirements include: understandable, relevant, timely accurate and secure

Elements of a DBMS

  • Four important elements: data structure, data modeling language, data query language, and transaction mechanisms

Data Structure

  • A specialized format for organizing and storing data
  • General data structures include: files, record, table, tree

Data Hierarchy

  • Bit: the smallest unit
  • Byte: eight bits/one character
  • Field: one item within record
  • Example - last name
  • Record: Set of related fields Example – employee #, name, pay rate, etc.
  • File: Set of related records
  • Database: Entire collection of files

File Access

  • Sequential access: stores record in a sequence
  • Random access: stores record in random order
  • Indexed Sequential Access Method (ISAM): stores sequentially but also stores randomly

Limitations of the File Environment

  • File processing has become cumbersome, costly, and inflexible in supplying needed data
  • This makes data management more difficult
  • There is also duplication of data
  • This leads to data inconsistency and data isolation
  • There are also data integration issues
  • This leads to difficulty and time constraits

Structure Query Language (SQL)

  • The most widely-used mainstream declarative language that works with any database
  • Simplifies data access by requiring users to declare what data they want instead of telling the DBMS how to get it

Three Steps to move from Database to Data Warehouse or Data Mart

  • Extracted from designated databases
  • Transformed by standardizing formats, cleaning the data, and integrating them
  • Loaded into a data warehouse

Active Data Warehouse (ADW)

  • The ADW captures transactions as they change and integrates them into the warehouse with batch or scheduled cycle refreshes
  • Companies using ADW can interact with customers for better service, respond to business events in near real-time, and share up-to-date data among merchants, vendors, customers, and associates

Data Lakes

  • Data lakes provide a single store of structured, semi-structured, and unstructured enterprise data stored in its natural format

Data Governance

  • Data governance are rules over availability, usability, integrity, and security of data
  • Master data is a term used to describe business-critical information on customers, products, locations, employees and other things needed for operations and transactions
  • Master data management (MDM) integrates data from various sources and enterprise applications for a unified view

Enterprise data warehouses (EDW)

  • Data warehouses pull together data from disparate sources and databases
  • Data Warehouses are the primary source of cleansed data for analysis, reporting, and Business Intelligence (BI)
  • Data marts may subsidize warehouses high costs

Active Data Warehouse (ADW)

  • Real-time data warehousing and analytics
  • Transform by standardizing formats, cleaning the data, integration

Business Analytics

  • The entire function of applying technologies, algorithms, human expertise, and judgment

Data Mining

  • Software that enables users to analyze data from various dimensions or angles
  • Helps categorize them and find correlative patterns among fields in the data warehouse

Data and Text Mining Concepts

  • Text Mining: Consists of broad category involving interpreted words and concepts in context
  • Sentimental Analysis: Understanding the consumer intent

Electronic Records Management

  • Business Records is a documentation of a business event, action, decision, or transaction
  • Electronic Records Management (EMR) are tools, scanners, and databases that manage and archive electronic documents, and image paper documents
  • Index and store documents according to company policy or legal compliance
  • Success depends on partnering with key players

Business Value of E-Records Management

  • It is important for companies to abide by legal actions against them
  • E-discovery is the process of gathering electronically stored information for a trail, legal, investigation or administrative action
  • A company must produce what is requested and risk charges of obstructing justice

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

More Like This

Use Quizgecko on...
Browser
Browser