Podcast
Questions and Answers
Which element is part of technical metadata?
Which element is part of technical metadata?
- Source of the data
- Business definition
- Meaning
- Mapping (correct)
What is NOT a characteristic of business metadata?
What is NOT a characteristic of business metadata?
- Size of the data (correct)
- Source of the data
- Business definition
- Description from a business perspective
In the context of data quality, which activity is part of monitoring data quality?
In the context of data quality, which activity is part of monitoring data quality?
- Planning quality assessments
- Defining data integrity measures
- Auditing existing data processes (correct)
- Implementing quality measures
Which phase is NOT included in the lifecycle of Business Insights & Analytics?
Which phase is NOT included in the lifecycle of Business Insights & Analytics?
Which component is essential for improving data quality?
Which component is essential for improving data quality?
What does the phase of 'Sourcing' in the data life cycle primarily involve?
What does the phase of 'Sourcing' in the data life cycle primarily involve?
Which of the following is NOT an activity associated with the 'Protection & Usage' phase?
Which of the following is NOT an activity associated with the 'Protection & Usage' phase?
What is the primary purpose of the 'Archiving' phase in the data life cycle?
What is the primary purpose of the 'Archiving' phase in the data life cycle?
In the context of the data life cycle, what is meant by 'Sharing'?
In the context of the data life cycle, what is meant by 'Sharing'?
Which of the following correctly represents a characteristic of 'good data'?
Which of the following correctly represents a characteristic of 'good data'?
Which phase in the data life cycle is primarily concerned with the proper disposal of data?
Which phase in the data life cycle is primarily concerned with the proper disposal of data?
Which of the following is an essential skill for managing data throughout its lifecycle?
Which of the following is an essential skill for managing data throughout its lifecycle?
What is the primary goal of data governance as defined in DMBOK?
What is the primary goal of data governance as defined in DMBOK?
Which area of the DMBOK framework involves the processes of planning and maintaining data?
Which area of the DMBOK framework involves the processes of planning and maintaining data?
What is a key process involved in the 'Storage & Protection' phase of the data life cycle?
What is a key process involved in the 'Storage & Protection' phase of the data life cycle?
Which of the following best describes the DMBOK framework?
Which of the following best describes the DMBOK framework?
What is one requirement for the naming of XML element names?
What is one requirement for the naming of XML element names?
Which statement is true regarding XML attribute values?
Which statement is true regarding XML attribute values?
What does proper nesting in XML refer to?
What does proper nesting in XML refer to?
What does an XML processor do with extra white space in the file?
What does an XML processor do with extra white space in the file?
Why is indentation recommended for XML documents?
Why is indentation recommended for XML documents?
What is a prohibited practice when naming XML elements?
What is a prohibited practice when naming XML elements?
Which special character in XML is replaced by " & lt; "?
Which special character in XML is replaced by " & lt; "?
What character type is not recommended in XML element names?
What character type is not recommended in XML element names?
Which of the following is a valid comment format in XML? (Ignore quotation marks.)
Which of the following is a valid comment format in XML? (Ignore quotation marks.)
What should be included at the beginning of each XML file?
What should be included at the beginning of each XML file?
In XML, which of the following is a valid way to represent a date?
In XML, which of the following is a valid way to represent a date?
What does & represent in XML?
What does & represent in XML?
Which statement about white space in XML is accurate?
Which statement about white space in XML is accurate?
Which of these tag names can be used in XML?
Which of these tag names can be used in XML?
What is the primary difference between XML and JSON presentation style?
What is the primary difference between XML and JSON presentation style?
How are special characters represented in XML?
How are special characters represented in XML?
What is the first stage in the data life cycle?
What is the first stage in the data life cycle?
Which of the following activities is part of the Protection & Usage stage?
Which of the following activities is part of the Protection & Usage stage?
What does archiving typically involve?
What does archiving typically involve?
Why do enterprises engage in data destruction?
Why do enterprises engage in data destruction?
In which part of the data life cycle would you find data transformation and synthesis?
In which part of the data life cycle would you find data transformation and synthesis?
Which of the following is NOT a function of the Sharing stage?
Which of the following is NOT a function of the Sharing stage?
Which step comes after the Archiving process in the data life cycle?
Which step comes after the Archiving process in the data life cycle?
What is involved in the Sourcing stage of the data life cycle?
What is involved in the Sourcing stage of the data life cycle?
Flashcards
Sourcing (Data Life Cycle)
Sourcing (Data Life Cycle)
The process of acquiring data from various sources, often referred to as data capture or acquisition.
Storage & Preparation (Data Life Cycle)
Storage & Preparation (Data Life Cycle)
Involves storing, maintaining, and preparing data for use. This stage ensures that the data is organized, cleaned, and ready for analysis.
Protection & Usage (Data Life Cycle)
Protection & Usage (Data Life Cycle)
This involves securely using data for organizational tasks while protecting it from unauthorized access or misuse.
Sharing (Data Life Cycle)
Sharing (Data Life Cycle)
Signup and view all the flashcards
Archiving (Data Life Cycle)
Archiving (Data Life Cycle)
Signup and view all the flashcards
Business Metadata
Business Metadata
Signup and view all the flashcards
Business Definition
Business Definition
Signup and view all the flashcards
Technical Metadata
Technical Metadata
Signup and view all the flashcards
Data Quality
Data Quality
Signup and view all the flashcards
Data Governance
Data Governance
Signup and view all the flashcards
Data Destruction
Data Destruction
Signup and view all the flashcards
Sourcing
Sourcing
Signup and view all the flashcards
Storage & Preparation
Storage & Preparation
Signup and view all the flashcards
Protection & Usage
Protection & Usage
Signup and view all the flashcards
Sharing
Sharing
Signup and view all the flashcards
Archiving
Archiving
Signup and view all the flashcards
Data Publication
Data Publication
Signup and view all the flashcards
Copying data into archive & Removing archived data
Copying data into archive & Removing archived data
Signup and view all the flashcards
Data Life Cycle
Data Life Cycle
Signup and view all the flashcards
Preparation
Preparation
Signup and view all the flashcards
Storage & Protection
Storage & Protection
Signup and view all the flashcards
Usage
Usage
Signup and view all the flashcards
Destruction
Destruction
Signup and view all the flashcards
Unique attribute names
Unique attribute names
Signup and view all the flashcards
Indentation in XML
Indentation in XML
Signup and view all the flashcards
Nesting elements in XML
Nesting elements in XML
Signup and view all the flashcards
Root element in XML
Root element in XML
Signup and view all the flashcards
XML declaration
XML declaration
Signup and view all the flashcards
Case sensitivity in XML
Case sensitivity in XML
Signup and view all the flashcards
Valid XML tag names
Valid XML tag names
Signup and view all the flashcards
Attribute values in XML
Attribute values in XML
Signup and view all the flashcards
What is XML?
What is XML?
Signup and view all the flashcards
What makes XML self-describing?
What makes XML self-describing?
Signup and view all the flashcards
What are tags and elements in XML?
What are tags and elements in XML?
Signup and view all the flashcards
What are attributes in XML?
What are attributes in XML?
Signup and view all the flashcards
How are special characters handled in XML?
How are special characters handled in XML?
Signup and view all the flashcards
Is XML flexible in defining tags?
Is XML flexible in defining tags?
Signup and view all the flashcards
What is JSON?
What is JSON?
Signup and view all the flashcards
What is the structure of JSON?
What is the structure of JSON?
Signup and view all the flashcards
Study Notes
Lesson 2: Data Life Cycle
- Learning Objectives:
- Understand the phases of the data lifecycle
- Identify the processes and activities within each phase
- Understand the DAMA Framework knowledge areas
- Interpret context diagrams
- Understand how analytics relates to the DAMA framework
- Distinguish good from bad data
- Interpret XML data formats
Data Life Cycle
- Phases: Sourcing, Storage & Preparation, Protection & Usage, Sharing, Archiving, Destruction
- Processes & Activities (Description):
- Sourcing: Collecting and capturing data from various sources, also known as 'data capture' or 'data acquisition'.
- Storage & Preparation (Storage & maintenance): Storing, maintaining, and preparing data for usage.
- Protection & Usage (Permitted use of data): Applying data to tasks needed to operate the enterprise, while protecting the data.
- Sharing: Sending data to users and entities inside and outside the enterprise for specific purposes (publication).
- Archiving: Archiving data that is no longer actively used for a specific retention period.
- Destruction: Removing every copy of data item from the enterprise, also known as 'purging' or 'permanently destroying'.
- Processes & Activities (Specific processes):
- Sourcing: Obtain external data, create/enter data, receive/capture data signals
- Storage & Preparation: Move and store data, cleanse/enrich data, transform/synthesise data, integrate data from multiple sources
- Protection & Usage: Apply data to enterprise tasks, protect/monitor/audit usage, search/classify/explore data, model/analyse data.
- Sharing: Data publication, data visualization, data sharing, data movement/copying, delivering data products to customers
- Archiving: Copying data into archive, removing archived data from active environments.
- Destruction: Permanently destroying data
Data Governance (DMBOK Definition & Activities)
- Definition: Planning, oversight, and control over the management of data and its related resources.
- Processes & Activities (Activities List):
- Enforcing consistent definitions, rules, business metrics, policies for data use, and establishing reference data and data ownership.
Data Architecture (DMBOK Definition & Activities)
- Definition: The overall structure of data and data-related resources as an integral part of the enterprise architecture.
- Processes & Activities (Activities List):
- Defining data needed to meet business needs, data facts and dimensions, logical data models, enterprise data flows. Examine completeness and correctness of data sources.
Context Diagram - Example
- A visual representation of the relationship between different systems and processes within an organization.
Data Modeling & Design (DMBOK Definition & Activities)
- Definition: Analysis, design, building, testing, and maintenance of data structures
- Processes & Activities (Activities List):
- Design and build: Conceptual, logical and physical data modeling , master data modeling, modeling and design for different architectures (data warehouse, data lake, cloud data storage).
Data Storage & Operations (DMBOK Definition & Activities)
- Definition: Deployment and management of structured physical data assets storage
- Processes & Activities (Activities List):
- Manage: Building and operating data storage solutions, performance management, backup & recovery of data assets, monitoring, archiving, and purging of data assets.
Data Security (DMBOK Definition & Activities)
- Definition: Ensuring privacy, confidentiality, and appropriate access to data
- Processes & Activities (Activities List):
- Define: Privacy and security, access management, security governance, data protection (encryption).
Data Integration & Interoperability (DMBOK Definition & Activities)
- Definition: Acquisition, extraction, transformation, movement, delivery, replication, federation, virtualization, and operational support of data assets.
- Processes & Activities (Activities List):
- Manage: Data acquisition and movement, transformation, interoperability and integration, data migration and conversion.
Documents & Content (DMBOK Definition & Activities)
- Definition: Storing, protecting, indexing, and enabling access to data found in unstructured sources and making it available for integration with structured data.
- Processes & Activities (Activities List):
- Govern: Content management, managing physical documents, managing electronic records.
Reference & Master Data (DMBOK Definition & Activities)
- Definition: Managing shared data to reduce redundancy and ensure better data quality through standardized definition and use of data values.
- Processes & Activities (Activities List):
- Govern: Establishing and managing systems of record for business, spatial, or market data, acquiring or creating systems of reference, defining data business rules.
Data Warehousing & Business Intelligence (DMBOK Definition & Activities)
- Definition: Managing analytical data processing and enabling access to decision support data for reporting and analysis
- Processes & Activities (Activities List):
- Govern: Data profiling and warehousing, data discovery, searching, and querying, operational and analytical reporting, analytics.
Metadata (DMBOK Definition & Activities)
- Definition: Collecting, categorizing, maintaining, integrating, controlling, managing, and delivering metadata
- Processes & Activities (Activities List):
- Manage: Business glossary / data dictionary, data classification.
Metadata: Information about data
- Metadata describes the data, as it's created, stored, transformed, accessed, and used across the entire enterprise.
- Business metadata describes data from a business perspective.
- Technical metadata describes the data as interpreted by software tools.
Qualities of Good Data (Five C's)
- Clean: Accurate, no missing data points, conforms to a format, no invalid entries.
- Consistent: Follows the same standards, definitions, and codes. Uses the same meaning
- Conformed: Data that can be shared across the same dimensions using the same business meaning.
- Current: Data as recent as required for business purposes.
- Comprehensive: Sufficient and complete for the intended purpose
XML Data Format
- A text-based format used to share data.
- Uses tags to describe data pieces.
- A metalanguage that allows for custom markup languages.
- XML is a specification for storing information with clear structure, following predefined rules.
- A root element is required -- this root element contains all the other elements in the document
XML vs JSON
- XML (Extensible Markup Language) is a markup language
- JSON (JavaScript Object Notation) is a lightweight data-interchange format.
Additional Topics
- Discussion on 'good/bad' data, examples of its origins.
- Discussion on why enterprises use data purging (destruction).
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.
Related Documents
Description
Test your knowledge on data management concepts and the data life cycle in this quiz. Explore various phases such as Sourcing, Sharing, and Archiving, along with essential skills and characteristics of good data. Understand the importance of data governance in managing data effectively.