Module 1 and 2.pdf
Document Details
Uploaded by ModernFermium
Tags
Related
Full Transcript
Fundamentals of Information Management (module 1) What is Data? Data refers to the collected resource itself, which are the basic data discovered, investigated, collected and created in real world. It can be used in a variety of forms like text, numbers, media, bytes, etc. It...
Fundamentals of Information Management (module 1) What is Data? Data refers to the collected resource itself, which are the basic data discovered, investigated, collected and created in real world. It can be used in a variety of forms like text, numbers, media, bytes, etc. It can be stored in pieces of paper or electronic memory, etc. What is Information? Information refers to various data that are organized, classified, analyzed, and systematized for certain purpose according to certain rules. Data vs Information Data: raw, unprocessed facts without context Information: data that has been processed and organized to provide meaning Information Management (IM) IM is a field that holds the collection, storage, organization, retrieval, and dissemination of information with an organization or system. Information Management Strategies IMS is are essential for optimizing how the information is handled within an organization. They help to ensure that the information is accurate, accessible, secure, and used effectively. Information Management Strategies 1. Information Governance Strategy a) Policy Development b) Roles and Responsibilities c) Compliance Information Management Strategies 2. Data Management Strategy a) Data Quality Management b) Data Integration c) Data Governance Information Management Strategies 3. Information Security Strategy a) Access Control b) Encryption c) Incident Response Information Management Strategies 4. Information Architecture Strategy a) Taxonomy Design b) Information Structure c) User Experience Information Management Strategies 5. Information Lifecycle Management a) Creation and Collection b) Storage and Archiving c) Disposal and Retention Activity 1 – Reflection Paper Database Definition and Structure (module 2) What is Database? Database is an organized collection of related data in a way that is stored and accessed electronically. Data processing sample What is DBMS? a software that allows creation, definition and manipulation of database, allowing users to store, process and analyze data easily. provides us with an interface or a tool, to perform various operations like creating database, storing data in it, updating data, creating tables in the database and a lot more. provides protection and security to the databases. Characteristics of DBMS 1. Data Abstraction – how data is physically stored, what data is stored and the relationships between them, and how the data is presented to the users. 2. Data Independence – changes in data structure or storage do not affect the application programs. 3. Data Integrity – ensures the accuracy and consistency of data through constraints, validation rules, and normalization to reduce data redundancy. 4. Data Security – control user access and implements user authentication, authorization, and encryption to safeguard data. 5. Data Redundancy and Consistency – Reduces data duplication and maintains consistency through centralized control and normalization. Characteristics of DBMS (conti...) 7. Transaction Management - ensures reliable transaction processing with ACID properties (Atomicity, Consistency, Isolation, Durability). 8. Concurrency Control - manages simultaneous data access by multiple transactions with locking mechanisms and isolation levels to prevent conflicts. 9. Backup and Recovery - provides mechanisms for data backup and recovery to protect against data loss and corruption. 10. Scalability - supports both vertical and horizontal scaling to handle increasing loads and data volumes. 11. Data Modeling - enables schema design and supports various data models (e.g., relational, document) based on the type of DBMS. Characteristics of DBMS (conti...) 13. Multi-User Support - allows multiple users to access and manipulate the database simultaneously with proper user management. 14. Data Recovery - maintains transaction logs and uses checkpointing to facilitate data recovery in case of system failures. 15. Administrative Tools - provides tools for database creation, maintenance, and monitoring, along with user interfaces for administration. 16. Data Distribution - supports replication and sharding to improve data availability, Advantages of DBMS Controls database redundancy: It can control data redundancy because it stores all the data in one single database file and that recorded data is placed in the database. Data sharing: In DBMS, the authorized users of an organization can share the data among multiple users. Easily Maintenance: It can be easily maintainable due to the centralized nature of the database system. Reduce time: It reduces development time and maintenance need. Backup: It provides backup and recovery subsystems which create automatic backup of data from hardware and software failures and restores the data if required. Multiple user interface: It provides different types of user interfaces like graphical user interfaces, application program interfaces Disadvantages of DBMS Cost of Hardware and Software: It requires a high speed of data processor and large memory size to run DBMS software. Size: It occupies a large space of disks and large memory to run them efficiently. Complexity: Database system creates additional complexity and requirements. Higher impact of failure: Failure is highly impacted the database because in most of the organization, all the data stored in a single database and if the database is damaged due to electric failure or database corruption then the data may be lost forever. History of Database System 1968 - File-Based database were introduced. Data was maintained in a flat file. 1968-1980 - was the era of the Hierarchical Database. Prominent hierarchical database model was IBM's first DBMS. It was called IMS (Information Management System). early 1960s - Charles Bachman developed the first DBMS at Honeywell called Integrated Data Store (IDS) but it was standardized in 1971 by the CODASYL group (Conference on Data Systems Languages). 1970 - Present - It is the era of Relational Database and Database Management. The Database Users Database Administrators Database Administrator or DBA is the one who manages the complete database management system. DBA takes care of the security of the DBMS, it's availability, managing the license keys, managing user accounts and access etc. Application Programmer or Software Developer This user group is involved in developing and designing the parts of DBMS. End User Types of DBMS 1. Relational DBMS (RDBMS) RDMBS uses tables to store data, with relationships define between them. (ex. MySQL, PostgreSQL, Oracle) Figure of DBMS and SQL 2. NoSQL DBMS NoSQL designed for unstructured or semi-structured data, often used in big data and real-time applications (ex. MongoDB, Cassandra, Redis) 3. Object-Oriented DBMS Object-Oriented DBMS stores data in objects, similar to object-oriented programming, (ex. ObjectDB) Data Models 1. Hierarchical Model The hierarchical database model organizes data into a tree-like structure, with a single root, to which all the other data is linked. The hierarchy starts from the Root data, and expands like a tree, adding child nodes to the parent nodes. In this model, a child node will only have a single parent node. This model efficiently describes many real-world relationships like the index of a book, etc. IBM's Information Management System (IMS) is based on this model. 1. Hierarchical Model (conti..) Data is organized into a tree-like structure with a one-to-many relationship between two different types of data. Advantages/Disadvantages of the Hierarchical Model Because it has one-to-many relationships between different types of data so it is easier and fast to fetch the data. But the Hierarchical model is less flexible. And it doesn't support many-to-many relationships. 2. Network Model The Network Model is an extension of the Hierarchical model. In this model, data is organized more like a graph, and allowed to have more than one parent node. In the network database model, data is more related as more relationships are established in this database model. Also, as the data is more related, hence accessing the data is also easier and fast. This database model uses many-to-many data relationships. Integrated Data Store (IDS) is based on this database model. This was the most widely used database model before Relational Model was 2. Network Model (conti..) The implementation of the Network model is complex, and it's very difficult to maintain and modify. You may want to explore this if you are developing some social networking applications, although the Graph Database model is new and is far better than the Network Database model. Advantages of the Network Model It supports complex relationships It allows more flexibility 3. Document Model In this model, data is stored as documents, often in JSON or XML Format (used in NoSQL database). Documents databases have flexible schema, meaning that not all document in a collection need to have the same fields. Advantages of the Document Model The document model is ubiquitous and enables rapid software development. The flexible schema allows for the data model to change as an application's requirements change. 4. Relational Model In this model, data is organized in two-dimensional tables and the relationship is maintained by storing a common field. This model was introduced by E.F Codd in 1970, and since then it has been the most widely used database model. The basic structure of data in the relational model is tables. All the information related to a particular type is stored in rows of that table. Hence, tables are also known as relations in the relational model. You can design tables, normalize them to reduce data redundancy, and use Structured Query language or SQL to access data from the tables. 4. Relational Model (conti..) Some of the most popular databases are based on this database model. For example, Oracle, MySQL, etc. Advantages of the Relational Model It's simple and easy to implement. Poplar database software is available for this database model. It supports SQL using which you can easily query the data. DBMS Queries DBMS Queries 1. SQL (Structured Query Language) – used to query in relational databases. Common commands include ‘SELECT’, ‘INSERT’, ‘UPDATE’, ‘DELETE’. Sample query select * students where status = ‘reg’; 2. NoSQL Query Language – vary by database type, such as MongoDB’s query language for document databases. Sample query const cursor = db.collection(‘students').find({ status: ‘reg’ }); Activity 2 – Infographic Creation Illustrate the importance of database and their structures. (2 models)