DS311 Advanced Databases Lecture Notes PDF
Document Details
Uploaded by ReliableChalcedony7557
Arab Academy for Science and Technology and Maritime Transport, College of Artificial Intelligence, El Alamein
Tags
Summary
These lecture notes cover an introduction to DS311 Advanced Databases, including course information, data scientist salaries in Egypt 2024, course objectives, aims, and an outline of the lectures and labs.
Full Transcript
بـسـم اللـه الـرحـمـن الرحـيم Arab Academy for Science &Technology & Maritime Transport College of Artificial Intelligence, El Alamein Data Science Department DS311 Advanced Databases Lecture 01: Course Introduction Lecturer: Prof. Ossama Mohamed Badawy ...
بـسـم اللـه الـرحـمـن الرحـيم Arab Academy for Science &Technology & Maritime Transport College of Artificial Intelligence, El Alamein Data Science Department DS311 Advanced Databases Lecture 01: Course Introduction Lecturer: Prof. Ossama Mohamed Badawy Course Information Instructor: Prof. Ossama M. Badawy Contact: [email protected] & [email protected] Office: Room 321 Building 3 Office Hours: Sunday, 09:00 –11:00, and by appointment Class Times: Wednesday, 11:00 am Prerequisites: Database Systems Credit: 3 UG credit hours Teaching Assistant: Eng. Mazen Aziz ; [email protected] Data scientist salaries in Egypt 2024 Factors Affecting Data Scientist Salaries in Egypt Experience: As with other regions, experience is a significant factor in determining data scientist salaries in Egypt. S Location: Cairo offers higher salaries compared to others. Company Size: Larger multinational corporations and tech startups in Egypt offer competitive salaries to attract top talent. Industry: Tech companies, financial institutions, and telecommunications firms tend to pay higher salaries. Skills: such as machine learning, deep learning, and big data analytics, are likely to command higher salaries. Data scientist salaries in Egypt 2024 Career Opportunities for Data Scientists in Egypt Data Analyst: Responsible for collecting, cleaning, and analyzing data to identify trends and insights. Data Engineer: maintaining data infrastructure and pipelines. Machine Learning Engineer: Develops and deploys machine learning models for various applications. Data Scientist: Combines data analysis, machine learning, and domain expertise to solve complex problems. Business Intelligence Analyst: Provides data-driven insights to support strategic decision-making. Data scientist salaries in Egypt 2024 Companies Hiring Data Scientists in Egypt Etisalat Egypt Orange Egypt TBC Bank Credit Agricole Egypt IBM Egypt Microsoft Egypt. Entry-level data scientists (1-3 years of experience) can expect to earn an average salary of around EGP 222,171/year Data scientist salaries in Egypt 2024 Coursera Data science jobs are increasing in demand as big data and technology industries grow. Data scientist salaries in 2024 have continued to rise due to the increasing demand for data-driven insights. Data Scientist Salary Ranges in Egypt Entry-level: EGP 15,000 - 30,000 per month Mid-level: EGP 30,000 - 60,000 per month Senior-level: EGP 60,000+ per month Key skills for data scientists Recruiters will be seeking: The ability to approach a problem analytically Excellent communication skills Teamwork skills Investigative skills Attention to detail and the ability to identify patterns Commercial awareness. DS311 Advanced Databases Course Objectives Theory: Students will be able to demonstrate Knowledge of: 1. Big Data and NoSQL Databases 2. Web Databases Practice: Students will be able to demonstrate Knowledge of: 3. Apply database theories to practical data science applications. 4. Implementing software applications 5. Investigative Study Course Aims The course introduces fundamental issues and novel techniques of Big Data & NoSQL databases, and Web databases, We will survey applications and provide an opportunity for hands-on experimentation with NoSQL databases specifically MongoDB and Neo4j, such as data modelling and their query languages. Web databases using advanced software packages from leading industrial vendors. Lecture Outline Table shows class meeting days, and topics. W Lecture Day & Date 1 Introduction to the Course; Why Adv. DB for DS? W 02/10/2024 2 Introduction to Big Data and map-reduce W 09/10/2024 3 NoSQL Databases: Introduction W 16/10/2024 4 Document-Based NOSQL Systems and MongoDB W 23/10/2024 5 MongoDB Data Modelling 1 W 30/10/2024 6 MongoDB Data Modelling 2 W 06/11/2024 7 7th Week Exam W 13/11/2024 8 MongoDB for Data Sc app: Query, Indexing, & Aggregation W 20/11/2024 9 MongoDB for DS: Loading Datasets Using Python W 27/11/2024 10 MongoDB for DS: Machine Learning & Data Visualization W 04/12/2024 11 Web Databases for Data Science W 11/12/2024 12 Web App Using MongoDB & Python W 18/12/2024 13 NOSQL DBs for Cloud Applications W 25/12/2024 14 Neo4j NoSQL Graph database W 01/01/2025 15 Course Projects Presentation W 08/01/2025 Lab Outline W Lab Day & Date 1 SQL recap part 1 W 02/10/2024 2 SQL recap part 2 W 09/10/2024 3 Entity Relationship Diagram (ERD) recap W 16/10/2024 4 Intro. To MongoDB compass & inserting queries W 23/10/2024 5 MongoDB finding queries part 1 W 30/10/2024 6 MongoDB finding queries part 2 & sorting and limiting documents W 06/11/2024 7 MongoDB operators W 13/11/2024 8 7th Lab quiz W 20/11/2024 9 MongoDB nested documents & update and delete queries W 27/11/2024 10 MongoDB aggregate functions W 04/12/2024 11 MongoDB & Python for Machine Learning W 11/12/2024 12 Advanced services by MongoDB & Intro. To Neo4J DB W 18/12/2024 13 Neo4J DB part 1 W 25/12/2024 14 Neo4J DB part2 W 01/01/2025 15 Final project Discussions W 08/01/2025 Representative Topics List ❑ I never specify exactly what material will be covered on any particular week and reserve the right to modify the presentation order of materials. ❑ This is for your benefit. Course progress will be based on feedback from students. ❑ The course schedule is subject to change with appropriate notice. 1-14 Course Assessment Coursework Percentage Class Participation 10% Quizzes\Tests 20% Labs and Projects 30% Final Examination 40% TOTAL 100% +10% Extra Credit Teaching methods Duration: 16 weeks, 64 hours in total Lectures: 32 hours (2 hours per week) Lab : 32 hours (2 hours per week) Expected workload On average students need to spend 2 hours of study and preparation for each 50-minute lecture. Hardware And Software Requirements You are required to provide your – Own suitable laptop & software – I recommend that your laptop have: at least 16 GB RAM, a multi-core Intel Core-i7 CPU, and an SSD (with over 100 GB free space). – If feasible, please add more RAM, an SSD, or both to your laptop (but backup your data first in case anything goes wrong). – Software: MongoDB, Python, MS SQL., & Neo4J Textbook and References ✓ Perkins, L., Redmond, E., Wilson, J.: Seven databases in seven weeks: a guide to modern databases and the NoSQL movement. 2nd ed., Pragmatic Bookshelf, 2018. ✓ Shannon Bradshaw, and Eoin Brazil, MongoDB: The Definitive Guide, O’Reilly Media, Inc., 2020. ✓ Beginning Neo4j: Create relationships and grow your application with Neo4j, Chris Kemper, Apress, 2020. ✓ Wilfried Lemahieu , Seppe vanden Broucke , Bart Baesens, Principles of Database Management, Cambridge University Press, 2018. ✓ Avi Silberschatz, Henry F. Korth, S. Sudarshan, Database System Concepts, 7th Ed., 2019. ✓ Nenad Jukic, Susan Vrbsky, Svetlozar Nestorov, Database Systems: Introduction to Databases and Data Warehouses, Prentice Hall, 2016. ✓ Catherine M Ricardo, Susan D Urban, Databases Illuminated (3rd edition), Jones & Bartlett Learning, LLC, an Ascend Learning Company, 2017. Literature and Resources Final Project ▪ The main component of this course will be the final group project. Students will organize into groups and choose to implement a project that is ▪ relevant to the Databases for “real-world” Data Science Applications materials discussed in class. ▪ We will discuss this more in-depth during class. ▪ The project involves design, implementation and verification of applications. ▪ Project is graded based on the design, performance, and correctness, including documentation. Means for Achieving Learning Outcomes Active Learning: Teaching will be through in-class activities and discussion, and a project. Class Discussions: You are expected to participate in discussions about the lecture material as well as the research presentations. Project: There will be an integrative application development project. Students will need to reserve time outside of class to work on the project. Course Requirements The following are required for this course: Class discussion and participation, assignments, mid-term exam, final exam, and project participation. Statement on Collaboration, Academic Honesty, & Plagiarism ✓ We encourage working together whenever possible. Talking about the course material is a great way to learn. ✓ An unacceptable form of dealing with homework is to copy a solution that someone else has written. Here will be a zero- tolerance policy for Cheating/Copying HW’s. ✓ If you are caught for a second time, you will fail the course. ✓ Presenting another's work as if it was your own, or cheating in exams will not be tolerated. ✓ In the Final Exam AASTMT rules are applied 1-23 Classroom Rules Of Conduct ✓If you cannot wait until the break and must temporarily leave the classroom to make an emergency phone call or to use the bathroom, etc., please be as undisruptive as you can. ✓Personal entertainment/personal are to be turned off in class. ✓Please turn off cell phones and iPods before entering class. ✓Once class has started, do not sign onto a classroom computer. ✓Non-compliance will have a negative impact on your participation grade. ✓If you have any questions or concerns, let me know as soon as possible. Classroom Rules Of Conduct ✓Be respectful of other people’s opinion in discussions. ✓Do not take naps or fall asleep during class. It is disrespectful toward the other students and the instructor. ✓Music listening, web-browsing, cell phone use, smart phone use, text-messaging, tweeting, social-networking & e-mail activities are prohibited in class, unless authorized by instructor under special circumstances. ✓Photography and audio/video recording of instructional activities are strictly prohibited unless prior permission is given. Netiquette Guidelines All opinions and experiences, no matter how different or controversial they may be perceived, must be respected. You are encouraged to comment, question, or critique an idea but you are not to attack an individual. Do not use offensive language. Present ideas appropriately. Be cautious in using Internet language. Popular emoticons such as can be helpful to convey your tone but do not overuse them. Never make fun of someone’s ability to read or write. Think and edit before you push the “Send” button. Tips for Success Make yourself a calendar with all of your due dates across ALL of your courses. Plan for when you will work on each one for completion in advance of the due dates. For this course, you should plan to work a few hours per course lecture as we move through the materials. Be sure to plan your time accordingly. Plan Ahead!! Study as you go instead of at the last minute! Course Policies Computer Account and Email Students should have a AAST email and Moodle accounts. This will allow you to log into the course's Moodle site. All AAST students automatically receive these accounts. Students should activate their e-mail accounts and check them every day. All students in this course are responsible for making sure they have working accounts prior to the first assignment. Course Policies Posting Questions and Getting Help When sending a message to me, please allow a minimum of 24 hours for a response. Most of the time I will respond much faster, but sometimes meetings take over my schedule. Be Proactive in Communication with Instructor If you have any trouble with assignments or other aspects of the course, make sure you let me know as early as possible. Make sure that you are proactive in informing me when difficulties arise during the semester so that we can help you find a solution. Course Policies Tests and Make-ups If a situation has occurred that requires your time and attention which will prevent submitting your graded activities on time, please notify your instructor 24 hours before the scheduled due date. It is the student's responsibility to give the instructor a written excuse and to arrange for any makeup work to be done. A makeup exam may be different (and possibly more difficult) than the regularly scheduled exam. Course Policies Attendance – I expect you to attend class and to arrive on time. Your grade may be affected if you are consistently tardy. – If you have to miss a class, you are responsible checking the course classroom to find any assignments or notes you may have missed. – Students may leave after 20 minutes if the instructor or a guest lecturer does not arrive in that time. Penalties Poor Attendance ✓Students who do not attend at least 80% of lectures AND labs will be BARRED from taking the final exam. ✓Lectures are crucial to understanding the course material and passing the exam. ✓The notes alone will not be sufficient. ✓Make sure your name has been called for attendance purposes or you will be counted as being absent. ✓Full attendance credit will be given to students who miss no more than two (2) total class periods Absenteeism ✓Do not be absent for any of the assessments and final examination ✓Make up tests will NOT be considered unless you have a very good reason and proof. ✓Even then, the one you take, will have to be different from the one everyone else did. ✓All letters must be handed to me within 3 days of the last absenteeism or it will not be accepted. Penalties Late homework policy ✓Homework submitted late without permission will be penalized according to the following formula: ✓(Penalized score) = (Your raw score) * (1 - 0.1 * (# of days past deadline)) ✓This formula will apply for up to Four days, after which the homework will not be accepted and you will receive a grade of zero. ✓Avoid invoking these penalties by starting early and seeking extra help. Lecture Sessions ✓All lectures will be conducted at the times and places scheduled. ✓Students are advised to seek help and ask relevant questions during lecture hours, if necessary. ✓All lectures should be completed by the end of the semester. Lecture Notes ✓A copy of the lecture slides will be made available to you at the web folder. ✓Please download the latest versions because they are frequently updated. ✓Contents of lectures are based on the textbook, recommended text and supplementary material. ✓Please read the textbook and any supplementary material on the subject you can find Final Examination ✓Final Examination questions will be based on ALL the topics covered in the course ✓Some questions will be based on the slides and textbook ✓Some questions will require you to think beyond the confines of the slides and the textbook(s) ✓Basically, you need to be able to piece together the various aspects and knowledge you have studied and apply them in new situations. Course Policies Withdrawal A syllabus constitutes an agreement between the student and the course instructor about course requirements. Participation in this course indicates your acceptance of its teaching focus, requirements, and policies. If you believe that the nature of this course does not meet – your interests, needs or expectations, – if you are not prepared for the amount of work – you should drop the class by the drop/add deadline. Meeting The Lecturer ✓This course is not difficult. If you put in the effort, you should do fine. ✓If you do not understand any of the topics covered in the lecture, please ask questions. ✓Students can meet the lecturer during consultation hours or make an appointment. Please Use the Link to access Google Classroom For lecture: ngwnuth https://classroom.google.com/c/NzEzMjU0NzQ0MzY1?cjc=ngwnuth Then, please fill the student data form: DS311 Student Data For Lab: kzdpdik https://classroom.google.com/c/NzE4NDgzNzk1OTQz?cjc=kzdpdik Lecture Assignment 1 Make a PowerPoint presentation about Why Should I Study databases for data science? Technical and Non-technical skills required for Data Scientist. The PowerPoint consists of (15-25) slides. It must include outline & references. The font size for the title 40 and for the text 28. 41 The End of the Lecture