Information Categories and Metadata Quiz

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to Lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

What are three categories that describe the nature of information resources?

  • Internal Information, External Information, Proprietary Information
  • Structured Information, Unstructured Information, Semi-structured Information (correct)
  • Digital Information, Analog Information, Mixed Information
  • Primary Information, Secondary Information, Tertiary Information

Give an example of each category of information.

Structured information: sales transaction records with clearly defined fields, such as customer number, item number, and amount. Unstructured information: a manila folder containing assorted items about a lawsuit, such as photos, notes, and articles. Semi-structured information: a web page with a title, subtitle, content, and a few images.

How do you characterize the relationships within each category of information?

Structured information: relationships are straightforward and easily identifiable, such as a customer's order being directly linked to the customer record. Unstructured information: relationships are difficult to identify, and the information is often stored in a disorganized manner. Semi-structured information: relationships are easier to query and combine than unstructured information, but they lack the strong structure of fully structured data.

What is Metadata?

<p>Metadata is data about data, and it clarifies the nature of the information.</p> Signup and view all the answers

What does Metadata describe for structured information?

<p>For structured information, metadata describes the definitions of each of the fields, tables, and their relationships. For example, in a database, metadata might describe that a field called &quot;BirthDate&quot; is of type &quot;Date/Time&quot; and has a specific format like &quot;MM/DD/YYYY&quot;.</p> Signup and view all the answers

What are the characteristics of information that affect quality (select all that apply)?

<p>Duplication (A), Consistency (B), Completeness (C), Accuracy (D), Precision (E), Timeliness (F), Bias (G)</p> Signup and view all the answers

What were the early design approaches to managing information resources?

<p>Before databases, information was stored in computer files, and each department maintained its own records. This approach led to problems such as data redundancy, lack of integration, inconsistent data definitions, and data dependence. This was called the &quot;File Processing Systems&quot; approach. Later, hierarchical databases, network databases, and relational databases were developed to address these limitations.</p> Signup and view all the answers

What are the advantages of Hierarchical Databases?

<p>Simple and efficient for one-to-many relationships. (A)</p> Signup and view all the answers

What are the advantages of Network Databases?

<p>Supports many-to-many relationships. (A)</p> Signup and view all the answers

What are the advantages of Relational Databases?

<p>Flexible for complex relationships. (B), Supports many-to-many relationships. (C), Highly flexible, easy to query, and supports complex relationships. (E), Requires careful planning and normalization to avoid redundancy. (F)</p> Signup and view all the answers

What are the steps in planning a relational data model?

<p>The steps include: 1. Identify Entities and Attributes, 2. Define Primary Keys, 3. Normalize the Data Model, 4. Establish Relationships.</p> Signup and view all the answers

What is the benefit of planning a relational data model?

<p>Proper planning ensures the database is well-structured, reduces redundancy, and makes it easier to maintain and query. It also minimizes the need for costly changes down the line.</p> Signup and view all the answers

What is a Primary Key?

<p>A field (or group of fields) that makes each record unique in a table. For example, an employee ID uniquely identifies each employee in an employee table.</p> Signup and view all the answers

What is a Foreign Key?

<p>A field in one table that is the primary key in another table, used to link records between tables. For example, a department ID in the employee table links to the department table, where department ID is the primary key.</p> Signup and view all the answers

How are Primary and Foreign Keys used to create links between tables in a relational database?

<p>Primary keys help to uniquely identify records within a table. Foreign keys act as bridges, linking records in one table to the primary key in another table, effectively establishing relationships and enabling data retrieval across tables.</p> Signup and view all the answers

What is the typical strategy to access a database?

<p>Most people access a database through an application interface with user-friendly web-based forms. These forms enable users to securely enter, edit, delete, and retrieve data.</p> Signup and view all the answers

How do users access an Access Database?

<p>Users can access an Access database through forms and reports generated by the software. Access provides tools for creating forms and reports, making it easy for users to interact with the database.</p> Signup and view all the answers

What are other strategies for accessing a database?

<p>All of the above (E)</p> Signup and view all the answers

What is the role of the Database Administrator (DBA)?

<p>The DBA is responsible for the efficient operation of the company's databases. They perform essential tasks such as monitoring and optimizing performance, troubleshooting bottlenecks, setting up new databases, enhancing security, planning capacity requirements, designing backup and disaster recovery plans, and working with department heads and the IT team to resolve problems and build innovative applications.</p> Signup and view all the answers

What is the career outlook for Database Administrators?

<p>The job outlook for database administrators is strong, with a projected 10-year job growth of 31% and a median salary of $87,200. The role is critical as organizations increasingly rely on data for decision-making.</p> Signup and view all the answers

What is SQL?

<p>SQL, or Structured Query Language, is a standard query language used to manipulate information in relational databases.</p> Signup and view all the answers

How is SQL used to query a database?

<p>SQL is used to create, read, update, and delete records in a database. For example, a simple SQL query to retrieve the last name and first name of an employee with the last name &quot;Park&quot; would look like this: SELECT LastName, FirstName FROM Employees WHERE LastName = &quot;Park&quot;;</p> Signup and view all the answers

What is a Shadow System?

<p>A shadow system is a smaller database or information system developed by individuals or departments outside of the IT department. These systems focus on specific information requirements and are not managed by central IT staff.</p> Signup and view all the answers

Why are Shadow Systems used?

<p>Shadow systems are often created because employees want to get their jobs done more efficiently and quickly, especially when the central IT systems are slow to adapt or lack specific functionality.</p> Signup and view all the answers

How are Shadow Systems managed?

<p>Shadow systems are typically managed by the individuals or departments that create them. They are often built using tools like Microsoft Access or Excel.</p> Signup and view all the answers

What are the advantages of Shadow Systems?

<p>Shadow systems provide quick solutions to specific problems and allow departments to customize systems to their needs.</p> Signup and view all the answers

What is Master Data Management (MDM)?

<p>Master Data Management (MDM) is an approach that addresses inconsistencies in how employees use data by achieving uniform definitions for entities and their attributes across all business units. It ensures that everyone in the organization uses the same definitions for terms like &quot;employee,&quot; &quot;sale,&quot; or &quot;student.&quot;</p> Signup and view all the answers

What is a Data Steward?

<p>A Data Steward is a person responsible for ensuring that people adhere to the definitions for master data in their organizational units. They act as watchdogs and bridge builders to maintain data consistency.</p> Signup and view all the answers

What is the role of Master Data Management in an organization's integration strategy?

<p>Master Data Management is critical for integrating data from multiple sources, especially during mergers, and when combining systems from different departments. It ensures that reports and summaries are consistent and accurate.</p> Signup and view all the answers

What is a Data Warehouse?

<p>A Data Warehouse is a central data repository containing information drawn from multiple sources, used for analysis, intelligence gathering, and strategic planning.</p> Signup and view all the answers

What are the three steps in building a Data Warehouse?

<p>The three steps are: 1. Extract, 2. Transform, 3. Load.</p> Signup and view all the answers

What is meant by "Extract" in a Data Warehouse?

<p>Extract refers to extracting data from its home database, which may include operational systems or external sources.</p> Signup and view all the answers

What is meant by "Transform" in a Data Warehouse?

<p>Transform involves cleaning and transforming the extracted data to conform to common data definitions and ensure data consistency.</p> Signup and view all the answers

What is meant by "Load" in a Data Warehouse?

<p>Load refers to the process of loading the cleaned and transformed data into the data warehouse, which may be a single or ongoing process.</p> Signup and view all the answers

What are examples of internal sources of data for a data warehouse?

<p>Internal sources include operational data from the company's own systems, such as customer records, transactions, inventory, and human resource information.</p> Signup and view all the answers

What are four examples of Data Warehouse Architectures?

<p>Data Cubes (A), Relational Database (B), Data Warehouse Appliance (C), Virtual Federated Warehouse (D)</p> Signup and view all the answers

Which Data Warehouse Architecture is suitable to meet today's growing demand for real-time information?

<p>The Virtual Federated Warehouse is suitable for real-time information because it extracts and transforms data in real time rather than taking periodic snapshots.</p> Signup and view all the answers

What are the defining features of Big Data (select all that apply)?

<p>Velocity (B), Volume (C), Variety (D)</p> Signup and view all the answers

What is Data Mining?

<p>Data Mining is a type of intelligence gathering that uses statistical techniques to explore large data sets, hunting for hidden patterns and relationships that are undetectable in routine reports.</p> Signup and view all the answers

What is the difference between Data Mining and Data Dredging?

<p>Data Dredging refers to finding relationships that might occur by accident and have little value, whereas Data Mining focuses on discovering meaningful and actionable insights.</p> Signup and view all the answers

What is the goal of Data Mining?

<p>The goal of Data Mining is to uncover patterns and relationships that can help organizations make better decisions, predict future trends, and gain a competitive advantage.</p> Signup and view all the answers

What are examples of Databases Without Boundaries?

<p>Examples include: Craigslist, Instagram, and Google Person Finder.</p> Signup and view all the answers

How do ownership issues affect information management?

<p>Ownership issues arise because people often view information resources protectively, even when the organization claims ownership. For example, salespeople may want to protect access to their sales leads, or departments may want to control who can modify their records. These issues can lead to conflicts and delays in making changes to the database.</p> Signup and view all the answers

How do information management needs differ among stakeholder groups (select all that apply)?

<p>Customers want simple, reliable user interfaces and quick access to information. (A), Top-Level Management needs strategic information from big data. (B), Government Agencies require compliance reports using their own definitions. (C), Operating Units need transaction-level reports and systems that support fast-moving business requirements. (D)</p> Signup and view all the answers

Flashcards

Structured Information

Information that is ordered and easily organized into hierarchies.

Unstructured Information

Information that lacks inherent structure and is difficult to organize.

Semi-Structured Information

Information that has some structure but lacks full organization.

Metadata

Data that describes other data, clarifying its nature and properties.

Signup and view all the flashcards

Accuracy

The degree to which information is free from errors.

Signup and view all the flashcards

Precision

The specificity of information, avoiding vagueness.

Signup and view all the flashcards

Completeness

The extent to which all necessary data is included.

Signup and view all the flashcards

Consistency

Uniformity of data across different records and systems.

Signup and view all the flashcards

Timeliness

The relevance of information based on how current it is.

Signup and view all the flashcards

Bias

A subjective slant in information that affects objectivity.

Signup and view all the flashcards

Duplication

Redundant data entries that can mislead or distort information.

Signup and view all the flashcards

File Processing Systems

Early data management systems storing files for each department independently.

Signup and view all the flashcards

Hierarchical Database

A database structure that organizes data in a tree-like format.

Signup and view all the flashcards

Network Database

A database allowing flexible relationships between records.

Signup and view all the flashcards

Relational Database

Organizes data into tables that relate to one another via keys.

Signup and view all the flashcards

Primary Key

A unique identifier for records in a table.

Signup and view all the flashcards

Foreign Key

A field in one table that links to a primary key in another table.

Signup and view all the flashcards

SQL

Structured Query Language used for managing and querying relational databases.

Signup and view all the flashcards

IVR

Interactive Voice Response; a technology for voice or keypad input accessing databases.

Signup and view all the flashcards

Shadow System

An unofficial database created by individuals outside IT for specific needs.

Signup and view all the flashcards

Master Data Management (MDM)

An approach ensuring consistent definitions for data across an organization.

Signup and view all the flashcards

Data Warehouse

A central repository for data analysis from various sources.

Signup and view all the flashcards

Big Data

Large, rapidly growing datasets that are challenging to store and analyze.

Signup and view all the flashcards

Data Mining

Intelligence gathering that uncovers hidden patterns in large datasets.

Signup and view all the flashcards

Databases Without Boundaries

Databases mainly managed by external users rather than the organization.

Signup and view all the flashcards

Ownership Issues

Conflicts arising from differing perceptions of data ownership within an organization.

Signup and view all the flashcards

Data Warehouse Architectures

Different structures for organizing data warehouses to meet needs.

Signup and view all the flashcards

Data Steward

A person responsible for enforcing master data definitions within units.

Signup and view all the flashcards

Study Notes

Information Categories

  • Structured information is organized and ordered, broken down into parts like facts or data. A sales transaction with fields for date, customer, item, and amount is an example. Relationships are straightforward, like a customer order relating to a customer record.
  • Unstructured information has no inherent order or structure. Examples include assorted documents or items about a legal case (e.g., photos, handwritten notes, and articles). Relationships are difficult to identify.
  • Semi-structured information has some structure but less than fully structured information. Web pages, with titles, subtitles, and content are examples. Relationships are easier to query and combine than unstructured data, but lack the strong structure of structured data.

Metadata

  • Metadata describes data about data (clarifies information).
  • Structured metadata for information describes the field definitions, tables, and relationships in information. Eg. a field called BirthDate is of data type Date/Time and format MM/DD/YYYY.
  • Unstructured metadata describes properties of a document or resource. For a photo, metadata might include the date taken, photographer, keywords like beach or vacation.

Information Quality

  • Accuracy: errors in dates, spelling, or prices affect quality (e.g., incorrect zip code).
  • Precision: estimations with varying degrees of accuracy; "about 2 miles" for property surveys is inappropriate.
  • Completeness: omission of certain information may need to be accounted for (e.g., house number to process an order)
  • Consistency: conflicting definitions across different data gathering methods regarding information, resulting in incorrect or incomplete summaries.
  • Timeliness: outdated data has less value than up-to-date data (e.g stock trading with a fraction of a second).
  • Bias: lack of objectivity, may lead to misleading data like a manager trying to make a summary report seem greater, by including canceled orders
  • Duplication: repetitive information, for example, a customer listed more than once in a database due to changes in address.

Early Database Management Approaches

  • File processing systems: each department manages and stores its own records (e.g., payroll, HR). Problems arise from data redundancy, inconsistent definitions, and a lack of data integration.
  • Hierarchical databases: tree-like structure for one-to-many relationships. Difficult to manage many-to-many relationships.
  • Network databases: more flexible than hierarchical ones. They can handle many-to-many relationships more easily but are complex to manage.
  • Relational databases: most widely used, organize data in tables linked by matching fields. Highly flexible and simple to manage complex relationships. More common in current systems.

Database Access Strategies

  • Typical strategy: application interface (web-based forms) for users to enter, edit, or retrieve data securely.
  • Other techniques: query languages (SQL), IVR systems, mobile apps, or natural language interfaces for specific tasks.

Database Administrator (DBA) Role

  • Responsible for the efficient operation and management of databases. The key duties of DBAs include monitoring, optimizing, and troubleshooting systems, setting up new databases, and handling security issues. The responsibilities include resolving problems, building innovative applications and adhering to the company's database needs.
  • Career outlook is strong, with projected growth and a median salary of $87,200.

Structured Query Language (SQL)

  • Standard language for manipulating data in relational databases.
  • Used for creating, reading, updating, and deleting information in databases.

Databases Without Boundaries

  • Databases managed by parties outside the enterprise (e.g., Craigslist, Instagram, Google Person Finder).

Information Ownership Issues

  • Conflicts arise from information ownership perspectives of different stakeholder groups,
  • Top-level management requires high-level data insights.

Studying That Suits You

Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

Quiz Team

Related Documents

More Like This

Use Quizgecko on...
Browser
Browser