Data Management & Database Systems

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson
Download our mobile app to listen on the go
Get App

Questions and Answers

Which of the following best describes the role of a Data Management system within an organization?

  • Overseeing the physical security of database servers.
  • Managing the flow of data from creation to obsolescence. (correct)
  • Creating marketing strategies based on data analysis.
  • Developing new software applications for data entry.

What is the primary role of a Data Warehouse, as opposed to a standard database?

  • To ensure data is stored in a completely unorganized manner.
  • To support complex analysis and decision-making. (correct)
  • To provide real-time transaction processing.
  • To manage day-to-day operational data.

Which of the following scenarios would benefit most from using a Data Mart instead of a Data Warehouse?

  • An organization wants to implement a new customer relationship management (CRM) system.
  • A large corporation needs to integrate data from all departments for enterprise-wide analysis.
  • A company needs to process high volumes of real-time transactions.
  • A small marketing team wants to analyze campaign performance data. (correct)

A financial transaction involves transferring funds from one account to another. According to ACID principles, what does 'Atomicity' ensure in this transaction?

<p>Either the entire transaction is completed, or no part of it is. (C)</p>
Signup and view all the answers

What is the key advantage of a distributed database architecture compared to a centralized database?

<p>Enhanced scalability and reliability. (B)</p>
Signup and view all the answers

Which operation is NOT typically associated with CRUD (Create, Read, Update, Delete) operations in a DBMS?

<p>Archiving (C)</p>
Signup and view all the answers

An e-commerce website needs to handle a large number of concurrent transactions while maintaining data integrity. Which type of database system is most suitable for this?

<p>Online transaction processing (OLTP) (A)</p>
Signup and view all the answers

What is the primary function of a database query language?

<p>To find and retrieve data from a database. (D)</p>
Signup and view all the answers

In a hierarchical database model, how are data elements related?

<p>Through a one-to-many parent-child relationship. (B)</p>
Signup and view all the answers

Which database model is most suitable for representing complex relationships where a record can have multiple parent records?

<p>Network data model (D)</p>
Signup and view all the answers

Why is the relational data model widely used in database management?

<p>It uses a simple table structure to organize data. (B)</p>
Signup and view all the answers

Which of the following is a key characteristic of a declarative language like SQL?

<p>It focuses on what result is desired, not how to achieve it. (D)</p>
Signup and view all the answers

According to the CAP theorem, what is a fundamental limitation in distributed databases?

<p>A distributed database can only satisfy at most two of the three guarantees i.e. Consistency, Availability and Partition tolerance. (D)</p>
Signup and view all the answers

What is the primary goal of Extract, Transform, Load (ETL) processes?

<p>To move, clean, and integrate data into a data warehouse. (C)</p>
Signup and view all the answers

Which of the following best describes the purpose of 'Change Data Capture' (CDC)?

<p>To automatically detect and apply changes made in data sources. (B)</p>
Signup and view all the answers

What is the primary benefit of data deduplication?

<p>Reduced storage costs and improved data consistency. (A)</p>
Signup and view all the answers

What is the main advantage of using a cloud data warehouse (CDW)?

<p>Lower initial investment and easier scalability. (A)</p>
Signup and view all the answers

What does data governance encompass?

<p>The overall mangement of data availability, integrity and security (D)</p>
Signup and view all the answers

Why is master data management (MDM) crucial for organizations?

<p>It creates a unified view of data across different systems. (B)</p>
Signup and view all the answers

What is the primary focus of 'customer-centric' approach in business?

<p>Providing a positive customer experience. (C)</p>
Signup and view all the answers

Flashcards

Data Management

Managing data flow from creation to deletion.

Database

A systematic collection of data sets or records.

Database Management System (DBMS)

Software for organized data storage and management, used for processing and querying.

Data Warehouse

A large dataset integrating data for complex analysis and decision-making.

Signup and view all the flashcards

Data Mart

A small-scale data warehouse supporting a specific function or department.

Signup and view all the flashcards

Transaction

A single unit of work that accesses and modifies database contents.

Signup and view all the flashcards

Atomicity

Ensures all modifications are completely made or discarded.

Signup and view all the flashcards

Consistency

Ensuring data remains consistent at the end of a transaction.

Signup and view all the flashcards

Isolation

A transaction operates separately from others.

Signup and view all the flashcards

Durability

A completed transaction is permanent and irreversible.

Signup and view all the flashcards

CRUD

Operations a DBMS performs: Create, Read, Update, Delete.

Signup and view all the flashcards

Online Transaction Processing (OLTP)

Database design breaking complex info into simple tables to balance processing efficiency.

Signup and view all the flashcards

Online Analytics Processing (OLAP)

Optimized data warehouses and data marts for complex data analysis.

Signup and view all the flashcards

Structured Query Language (SQL)

A standardized query language for accessing databases.

Signup and view all the flashcards

Not Only SQL (NoSQL)

A non-relational database query language.

Signup and view all the flashcards

Extract, Transform, Load (ETL)

Moving data into a data warehouse through extraction, transformation, and loading.

Signup and view all the flashcards

Change Data Capture (CDC)

Capturing and applying changes made to data sources across enterprise data stores.

Signup and view all the flashcards

Data Deduplication

Removes duplicate data and standardizes formats.

Signup and view all the flashcards

Cloud Data Warehouse (CDW)

Database delivered as a managed service in the public cloud.

Signup and view all the flashcards

Active Data Warehouse (ADW)

Captures transactions as they change and integrates them into the data warehouse.

Signup and view all the flashcards

Study Notes

  • Biz-tech ecosystem needs business to treat IT as an equal partner and IT needs to perform well.
  • Data management describes managing data from creation and initial storage to deletion.
  • Data management goals include mitigating risks/costs of regulations, meeting legal requirements, maintaining data accuracy/availability, certifying data consistency, and ensuring data conforms to organizational best practices.
  • A database is a collection of data sets/records stored systematically.
  • A database management system (DBMS) integrates with data collection systems to systematically store, access, and manage data optimized for fast transaction processing and querying.
  • Four elements of a DBMS are data structure, data modeling language, data query language, and transaction mechanisms.
  • Data structure is a specialized format for organizing/storing data and includes files, records, tables, and trees.
  • Data warehouse integrates large data sets from multiple databases/data silos for complex analysis, discovery, and decision-making.
  • Data mart is a small-scale data warehouse for a single function/department and is better for analyzing complex data for enterprises that cannot afford a large investment in data warehousing.
  • A transaction is a single logical unit of work that accesses/modifies database contents.
  • ACID test ensures data in a database meets atomicity, consistency, isolation, and durability criteria.
  • Atomicity means all database modifications are completely made or discarded.
  • Consistency refers to maintaining data integrity by leaving all data in a consistent state at the end of a transaction.
  • Isolation separates a transaction, making it independent of others.
  • Durability requires a completed transaction to become a permanent, irreversible part of the database.
  • Centralized database stores all data in one physical location with better control of data quality, ease of use, IT security, and data integrity. Transmission delay and security are cons, and it is not always reliable.
  • Distributed database stores portions of the database on multiple computers within a network, offering security, reliability, speed, and scalability; availability, expense, and security are cons.
  • Scalability is the system's ability to grow in size to handle increasing data and user loads.
  • CRUD operations are Create, Read, Update, and Delete.
  • Online transaction processing (OLTP) system is how DBMS is referred. The design breaks down complex information into simpler tables.
  • Online analytics processing (OLAP) optimizes data warehouses and data marts and analyzes complex data from a data warehouse.
  • Database query language retrieves data from a database and info system via user queries.
  • Hierarchical database has a tree-like structure based on one-to-many parent-child relationships.
  • Network data model allows multiple records to link to the same parent.
  • Relational data model manages data using tables to collect groups of elements into relations.
  • Object-oriented database management system models data as objects containing both data and relationships.
  • Blockchain is a distributed ledger represented by a chain of data blocks, it records transactions, establishes user identity and establishes contracts.
  • Hash is a function that takes input and outputs a unique fixed-size hexadecimal key for the data.
  • Query is an ad hoc (unplanned) user request for data.
  • Structured Query Language (SQL) accesses databases using a standardized query language.
  • Declarative language is a high-level programming language, allowing users to express what they want without detailing how to get it.
  • Not Only SQL (NoSQL) is a non-relational database query language without a predefined schema that is designed for corporations with large numbers of daily transactions.
  • CAP theorem guides NoSQL databases, which can only meet two of the three components: Consistency, Availability, and Partition.
  • Fault tolerance means no single failure causes any loss of service.
  • NewSQL combines SQL reliability with NoSQL speed and performance, offering better functionality, services, and ACID performance guarantees.
  • Transaction mechanism helps ensure integrity and concurrency and eliminate multiplicity via committing or rollback operations.
  • Committing occurs when permanent changes are performed within the transaction
  • Rollback occurs when any changes to data that have been performed within an uncommitted transaction are undone
  • Factors to consider when evaluating DBMS performance are: data latency, volatility, data consistency, query response time, predictability, and processing capabilities.
  • Data Latency is the time between data creation and availability for querying.
  • Immediate Consistency means new queries reflect updated data immediately, while eventual consistency means not all queries reflect changes uniformly, but it operates well at a low latency.
  • Data warehouses are the primary source of cleansed data for analysis, reporting, and business intelligence.
  • Data mart is a small-scale data warehouse supporting a single function or department.
  • Extract, Transform, Load (ETL) moves data into a data warehouse by extracting, standardizing and cleaning, and loading it.
  • Change data capture (CDC) processes capture changes made to data sources and applies the changes across the enterprise to keep stores synchronized.
  • Data deduplication removes duplicates and standardizes data formats.
  • Cloud data warehouse (CDW) is a database in a public cloud managed for analytics, scale, and ease of use.
  • Active Data warehouse (ADW) captures transactions when they change and integrates them into the warehouse for real-time data warehousing and analytics.
  • Enterprise data warehouses (EDW) integrate data from many databases across an enterprise.
  • Data lake is a single store of raw structured, semi-structured, and unstructured enterprise data in its natural format.
  • Data Governance is the management of data availability, usability, integrity, and security in an enterprise.
  • Master data is business-critical info on customers, products, vendors, locations, employees, etc., serving as a layer between transactional and analytical data.
  • Master data management (MDM) integrates data from sources to create a unified view.
  • Data Entity is anything real/abstract a company wants to collect/store data about.
  • Master file stores records describing main database entities like customers or products and is usually periodically updated.
  • Customer-centric focuses on providing a positive customer experience to drive profit and competitive advantage.
  • Information management is the process of collecting, storing, managing, and maintaining accurate, timely, reliable, valid, available, unique, and relevant data.
  • Data life cycle begins with database storage, then data is loaded into a data warehouse for analysis, and then is used by knowledge workers or in business apps.
  • Principle of diminishing data value states that the value of data decreases as it ages.
  • Principle of 90/90 data use says that 90% of stored data is seldom accessed after 90 days.
  • Data silos are stand-alone data stores inaccessible by other ISs or departments.
  • Dirty data is of such poor quality that it cannot be trusted for decisions.
  • Cost of poor quality data = Lost Business + Cost to Prevent Errors + Cost to Correct Errors.
  • Electronic document is any actively worked on paper or electronic form, file, email, contract, or lease.
  • Electronic record is any finalized document that is not altered.
  • Electronic content includes documents, records, and unstructured data like audio, video, and multimedia files.
  • Electronic document management electronically stores, maintains, and tracks electronic documents and images.
  • Electronic records management system (ERMS) electronically manages record creation and maintenance using classification schemes, retention/disposal schedules, and access controls.
  • Enterprise content management captures, stores, retrieves, and manages digital assets including documents, records, emails, and multimedia.
  • Enterprise content management system (ECMS) captures, preserves, and manages structured and unstructured assets and secures them in compliance with policies.

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

More Like This

Database Management Systems (DBMS)
17 questions
Database Management Systems (DBMS)
40 questions

Database Management Systems (DBMS)

AppropriateGyrolite7060 avatar
AppropriateGyrolite7060
Data Management and Database Systems
44 questions
Use Quizgecko on...
Browser
Browser