Cloud-Based Big Data Analytics Assignment
48 Questions
0 Views

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

What is the main focus of the assignment?

  • Understanding cloud computing principles
  • Learning programming languages
  • Practical implementation of Big Data technologies (correct)
  • Exploring theoretical concepts of data analytics
  • The Harvard Referencing System should not be used for this assignment.

    False

    What are the two main Big Data technologies required for this assignment?

    Apache Spark and Hadoop

    The assignment submission date is on __________.

    <p>07/02/2025</p> Signup and view all the answers

    Match the following tasks with their respective descriptions:

    <p>Problem Definition = Identifying real-world business issues for Big Data analytics Data Collection = Justifying the use of a relevant dataset for the project Report Writing = Summarizing insights derived from data analysis Feedback = Receiving evaluation comments on the assignment</p> Signup and view all the answers

    What percentage of total marks is allocated to the Problem Definition and Business Context?

    <p>15%</p> Signup and view all the answers

    Students are allowed to reference Wikipedia for their work.

    <p>False</p> Signup and view all the answers

    What must the dataset used for the project be greater than in size?

    <blockquote> <p>10GB</p> </blockquote> Signup and view all the answers

    What is characterized by a 'good understanding of techniques applicable to their own research or advanced scholarship'?

    <p>Good understanding</p> Signup and view all the answers

    An exceptional understanding of techniques involves no limitations and ambiguities.

    <p>True</p> Signup and view all the answers

    What type of understanding is indicated by 'limited understanding of techniques applicable to their own research'?

    <p>Limited understanding</p> Signup and view all the answers

    A person with __________ understanding tends to have little to no understanding of advanced techniques.

    <p>comprehensive</p> Signup and view all the answers

    Match the level of understanding with its corresponding description.

    <p>Excellent = Very good understanding of techniques applicable to advanced scholarship. Competent = Good understanding of techniques. Low = Limited understanding of techniques. Exceptional = Outstanding mastery of techniques without ambiguities.</p> Signup and view all the answers

    Which of the following describes a person with a very good understanding of techniques?

    <p>They can apply techniques effectively in their research.</p> Signup and view all the answers

    Advanced techniques are applicable solely to theoretical scholarship.

    <p>False</p> Signup and view all the answers

    Someone with __________ understanding is characterized by the ability to work with techniques under certain limitations.

    <p>good</p> Signup and view all the answers

    Which phrase best describes 'Excellent conceptual understanding'?

    <p>Very good conceptual understanding with publishable quality</p> Signup and view all the answers

    Limited conceptual understanding is characterized by strong arguments and critical evaluation.

    <p>False</p> Signup and view all the answers

    What is a key element of 'very good conceptual understanding'?

    <p>Critical insight into advanced scholarship.</p> Signup and view all the answers

    A student with __________ conceptual understanding can critically evaluate and synthesize a wide range of views.

    <p>good</p> Signup and view all the answers

    What does 'Low conceptual understanding' imply?

    <p>Weakly constructed arguments and low critical evaluation</p> Signup and view all the answers

    Match the level of conceptual understanding with its characteristic:

    <p>Exceptional = Publishable quality with advanced engagement Very good = Critical insight into scholarship Limited = Weakly constructed arguments Competent = Basic understanding with some critical evaluation</p> Signup and view all the answers

    Descriptive explanations contribute to strong argumentation.

    <p>False</p> Signup and view all the answers

    What is one outcome of a 'conceptual understanding that enables the student to display originality'?

    <p>The ability to critically evaluate and synthesize alternative views.</p> Signup and view all the answers

    What ability best describes the skill of critically appraising a wide range of sources?

    <p>Established interpretation techniques</p> Signup and view all the answers

    Demonstrating ethical awareness is not important when interpreting knowledge in the discipline.

    <p>False</p> Signup and view all the answers

    What is required for original creative or artistic application in a specific area of study?

    <p>Creative or artistic skills related to that study</p> Signup and view all the answers

    The ability to _________ relevant points from sources is critical in advancing academic arguments.

    <p>extract</p> Signup and view all the answers

    Match the skills with their academic relevance:

    <p>Ethical awareness = Important for integrity and trust Creative skills = Enhances originality in projects Analytical techniques = Critical for evaluating data Established methods = Guide research practices</p> Signup and view all the answers

    Which of the following describes the importance of depth and breadth in academic study?

    <p>Demonstrates critical thinking and broad understanding</p> Signup and view all the answers

    Good technical skills are sufficient for advancing work without the need for creativity.

    <p>False</p> Signup and view all the answers

    Identify the key benefit of using well-established techniques in academic research.

    <p>They help ensure validity and reliability in findings.</p> Signup and view all the answers

    What level of expression indicates competent terminology and minimal errors in spelling and syntax?

    <p>Competent expression</p> Signup and view all the answers

    Very good expression includes many errors in spelling, grammar, and syntax.

    <p>False</p> Signup and view all the answers

    What is necessary for excellent expression in decision-making?

    <p>Exceptional expression with appropriate vocabulary and no errors in spelling, grammar, and syntax.</p> Signup and view all the answers

    Low use of appropriate terminology indicates a ______ level of expression.

    <p>limited</p> Signup and view all the answers

    Match the following expression levels with their characteristics:

    <p>Competent expression = Good expression with some errors that do not affect understanding Very good expression = Minimal errors in spelling and grammar Limited expression = Little to no appropriate terminology Exceptional expression = No errors in vocabulary or syntax</p> Signup and view all the answers

    Which level of digital literacy indicates little to no competency?

    <p>Low evidence of digital literacy</p> Signup and view all the answers

    Good evidence of numeracy suggests high use of appropriate terminology.

    <p>False</p> Signup and view all the answers

    An expression level with many errors in spelling, grammar, and syntax is identified as ______.

    <p>limited</p> Signup and view all the answers

    Which level represents an excellent ability to manage learning while exercising initiative and personal responsibility?

    <p>Exceptional ability to manage learning</p> Signup and view all the answers

    A person with a low ability to exercise initiative has good skills in decision-making in complex situations.

    <p>False</p> Signup and view all the answers

    What is required for employment that necessitates the exercise of initiative?

    <p>Transferable skills</p> Signup and view all the answers

    A person with very good ability to manage learning demonstrates _____ and exercise initiative.

    <p>ethical and personal responsibility</p> Signup and view all the answers

    Match the team role levels with their descriptions:

    <p>Little to no ability = Limited ability to manage learning Competent ability = Good ability to manage learning and exercise initiative Very good ability = Systematic management of learning Exceptional ability = Manage learning on own with initiative</p> Signup and view all the answers

    What characterizes a person with a 'good ability to manage learning'?

    <p>Systematic management of learning and ethical decision-making</p> Signup and view all the answers

    Everyone who possesses very good ability in managing learning also has an excellent ability to make decisions.

    <p>False</p> Signup and view all the answers

    What type of responsibility is emphasized in skills necessary for employment?

    <p>Personal responsibility</p> Signup and view all the answers

    Study Notes

    Module Information

    • Degree: MSc Data Analytics
    • Module: Big Data Analytics
    • Assignment Title: Cloud-Based Big Data Analytics with Apache Spark and Hadoop
    • Assignment Type: Report
    • Word Limit: 3000 words (+/- 300)
    • Weighting: 100%
    • Issue Date: 19/11/2024
    • Submission Date: 07/02/2025
    • Feedback Date: 28/02/2025

    Plagiarism

    • Students must submit their own original work for assessment
    • Submissions will be electronically checked for plagiarism
    • Students must adhere to guidelines and regulations regarding plagiarism on InterActive/Canvas

    Learner Declaration

    • Students must sign a declaration stating the work submitted is their own and research sources are acknowledged.

    Harvard Referencing

    • The Harvard Referencing System must be used
    • Wikipedia, UKEssays.com, and similar websites are not allowed as sources

    Learning Outcomes

    • LO1: Understand basic concepts of Big Data and its importance in business.
    • LO2: Explain Hadoop and HDFC components within the Big Data ecosystem.
    • LO3: Summarize Big Data analytics using Yarn, HDFC and MapReduce

    Assignment Tasks

    1. Problem Definition and Business Context (15% of total marks)

    • Identify: A real-world business problem suitable for Big Data analysis.
    • Write: A report (500-800 words) explaining the business context, the need for Big Data, and how Big Data analytics can provide value in solving the problem.
    • Suggest: A relevant, publicly available dataset (over 10GB) for the project.

    2. Cloud Environment Setup and Data Ingestion (25% of total marks)

    • Choose: A cloud platform (AWS, Google Cloud, or Azure) and set up a Big Data processing environment (EMR, Dataproc, or HDInsight).
    • Document: Steps taken to configure the cluster, including instance types, scaling options, and cost considerations.
    • Upload: The dataset to HDFS.
    • Explain: Data ingestion process, including file formats (CSV, JSON, Parquet), and ensuring proper data distribution.

    3. Data Processing with Spark and Hadoop (30% of total marks)

    • Implement: Two data processing tasks: a Hadoop MapReduce job (e.g., word frequencies, anomaly detection) and an Apache Spark job (e.g., advanced data transformations, EDA).
    • Evaluate: Performance of both tasks, comparing MapReduce with Spark concerning speed, scalability, and ease of use.

    4. Advanced Analytics and Machine Learning (30% of total marks)

    • Implement: A machine learning algorithm (e.g., classification, regression, or clustering) on the dataset using Apache Spark MLlib.
    • Detail: The model selection process, including data preprocessing and feature selection, model training, and evaluation.
    • Visualize: Results, highlighting any business insights.

    Data Source

    • Dataset: Amazon Customer Reviews (E-commerce Dataset).
    • Information: Product reviews, customer sentiments, product popularity
    • Size: Over 10GB
    • Source: AWS Public Dataset.
    • Use Case: Sentiment analysis, customer behavior, product trends
    • Data Link: https://github.com/futurexskill/bigdata

    Submission Instructions

    • Compile: A comprehensive project report or presentation addressing all tasks.
    • Report Includes: Steps to set up the Hadoop cluster, data ingestion into HDFS, MapReduce/Spark job code, job submission, results analysis.
    • Format: Use the appropriate BSBI template (available on Canvas), Harvard Referencing style, and follow all specified submission instructions on Canvas.

    Studying That Suits You

    Use AI to generate personalized quizzes and flashcards to suit your learning preferences.

    Quiz Team

    Related Documents

    Description

    This assignment focuses on the analysis and implementation of cloud-based big data analytics using Apache Spark and Hadoop. Students are required to explore the foundational concepts of big data and its business significance while adhering to academic integrity through original work and proper referencing.

    More Like This

    Use Quizgecko on...
    Browser
    Browser