Podcast
Questions and Answers
What is a database?
What is a database?
A database is an organised collection of data.
Who is the module coordinator for Full Time?
Who is the module coordinator for Full Time?
Which day and time is the Full-Time Group CT class scheduled?
Which day and time is the Full-Time Group CT class scheduled?
Data are discrete objective facts that have meaning in isolation.
Data are discrete objective facts that have meaning in isolation.
Signup and view all the answers
The module will cover topics including _____ Basics, Data Models, Query Processing in the Relational Model.
The module will cover topics including _____ Basics, Data Models, Query Processing in the Relational Model.
Signup and view all the answers
What will students be able to do at the end of this module?
What will students be able to do at the end of this module?
Signup and view all the answers
Which Database Management Systems (DBMS) will be used in the labs?
Which Database Management Systems (DBMS) will be used in the labs?
Signup and view all the answers
What is the estimated amount of data in the digital universe as of 2017?
What is the estimated amount of data in the digital universe as of 2017?
Signup and view all the answers
What do Knowledge and Wisdom help with?
What do Knowledge and Wisdom help with?
Signup and view all the answers
Which statement is true regarding knowing EName?
Which statement is true regarding knowing EName?
Signup and view all the answers
Knowing Manager uniquely identifies DName.
Knowing Manager uniquely identifies DName.
Signup and view all the answers
Is knowing Salary enough to identify DName?
Is knowing Salary enough to identify DName?
Signup and view all the answers
Dept# uniquely gives DName, which gives Manager.
Dept# uniquely gives DName, which gives Manager.
Signup and view all the answers
What is an example of a problem with a file-based approach?
What is an example of a problem with a file-based approach?
Signup and view all the answers
What is the advantage of the database approach over the file-based approach?
What is the advantage of the database approach over the file-based approach?
Signup and view all the answers
The database is a shared collection of ______________.
The database is a shared collection of ______________.
Signup and view all the answers
Study Notes
Module Information
- The module is called “Introduction to Databases” and is referred to as COM106.
- The module is taught by Dr. Matthew Burns for full-time students and Dr. Hanif Ullah for part-time students.
- The module is supported by Dr. Naveed Khan for lectures and labs, and Dr. Jose Santos for labs.
Module Delivery
- Lectures are held on Mondays from 15:15 to 17:05 in BC-LG-211.
- Part-time students attend lectures on Mondays from 17:15 to 19:05 in BC-02-303 and BC-03-303.
- Full-time students in IC, CS, and SE attend lectures on Fridays from 09:15 to 12:05 in BC-02-303, BC-03-303, BC-03-305, and BC-03-311.
- Full-time students in CT attend lectures on Fridays from 12:15 to 15:05 in BC-03-305.
Module Content
- The module will cover the following topics:
- Database Basics
- Data Models
- Query Processing in the Relational Model
- Data Management Issues
- Running SQL in Jupyter Notebooks
- Combining SQL and Python
Learning Outcomes
- Students will be able to explain concepts relating to the design, structure, implementation, manipulation, and management of relational databases.
- Students will be able to apply models, processes, and techniques in the design, implementation, and use of database systems to meet business needs.
- Students can construct a database from a given relational schema using a commercial database management system.
- Students can perform basic data manipulation and information retrieval using SQL.
Database Management Systems
- Microsoft SQL Server 2017 or Azure SQL Databases are utilized as the primary database management systems (DBMS) in labs.
- Jupyter Notebooks will be introduced.
Understanding Data
- Data refers to facts and statistics collected for reference or analysis.
- Computer data can be information processed or stored by a computer, in various forms, including bits, bytes, text, images, audio, software programs, or other types of data.
Databases and Importance
- Databases are organized collections of data.
- They allow for easy access, management, and updating of information.
- They are of fundamental importance in a digital world, especially when needing organized data to build knowledge and make decisions.
Data, Information, Knowledge, and Wisdom
- Data are discrete, objective facts lacking meaning in isolation.
- Information is data with relevance and purpose.
- Knowledge is information placed in context based on facts and meaning; it is actionable.
- Wisdom involves understanding and using knowledge for effective decision-making.
Digital Universe Data
- There is an immense amount of data in the digital universe, estimated at 16 Zettabytes (1021 bytes) in 2017.
- 90% of this data was created in the last two years.
Importance of Data Analysis
- There is a growing need to analyze data in near real-time to derive value.
Key Terms
- Data Analytics: The process of examining raw data to extract meaningful insights and patterns.
- Big Data: Extremely large and complex data sets that are difficult to process using traditional methods.
- Data Mining: The process of discovering patterns and insights from large datasets.
- Zettabyte: A unit of data equal to one trillion gigabytes.
Data Dependencies
-
A data dependency exists when one or more attributes in a table determine another attribute(s). For example, knowing the EName (Employee Name) uniquely identifies the Manager in the given data table.
-
Knowing the Manager uniquely identifies the Dept# (Department Number) in the data table.
-
Knowing the Salary does not uniquely identify the DName (Department Name) in the table.
-
Knowing the DName (Department Name) does not uniquely identify the Manager in the table.
-
Knowing the EName (Employee Name) uniquely identifies the Dept# (Department Number) in the data table.
Data Duplication
- The data table showcases data duplication. For example, the Department name ("Admin") and the Manager ("Whyte") are repeated for multiple employees, causing redundancy and inconsistency in the data.
Problems with File-Based Approach
- In a file-based approach, each department or entity maintains its own independent data, leading to data redundancy, inconsistencies, and potential issues with data format compatibility.
University Example
-
The university example illustrates the problems of the file-based approach.
- Each department (Student Accommodation, Staff Records, Student Registration, Payroll) maintains its own data.
- This leads to data duplication, for example, student names and addresses are repeated in multiple departments.
Centralized Database Approach
-
The database approach overcomes many of the problems associated with the file-based approach.
-
A database is a collection of logically related data, stored in a structured format, enabling efficient interaction with data and extraction of meaningful information.
Studying That Suits You
Use AI to generate personalized quizzes and flashcards to suit your learning preferences.
Related Documents
Description
Test your knowledge of the key concepts from the Introduction to Databases module (COM106). This quiz covers essential topics such as database basics, data models, query processing, and the integration of SQL with Python. Perfect for both full-time and part-time students looking to review their understanding of database management.