Assignment 1 Quiz 2 CSE5BDC T5 2023

PolishedOpal avatar
PolishedOpal
·
·
Download

Start Quiz

Study Flashcards

7 Questions

What is the benefit of using subqueries in Hive versus writing multiple separate queries to solve a problem?

There is a potential for less map reduce jobs to be executed.

Which of the following statements is true of Hive?

SORT BY only sorts the data within each reducer.

Suppose you are given a table with the following columns: City Name, Country Name, Population. The table contains every city for every country in the world and their population. You are asked to output the city with the highest population among all cities in Australia. Which Hive commands would you use?

SELECT, WHERE, ORDER BY, LIMIT

Why does Hive allow users to define complex data types like structs, whereas traditional relational databases do not allow it?

Traditional relational databases perform schema on write.

Suppose you are asked to output the url, ipAddress and time of the most recent log entry in the mytraffic table of lab 3, where each row of the mytraffic table represents a log entry. Which of the following set of Hive commands best solves this problem?

SELECT, ORDER BY, LIMIT

Which statement about the MapReduce job tracker is false?

The job tracker is responsible for balancing the workload across the different task trackers.

Suppose you are given a table with the following columns: City Name, Country Name, Population. The table contains every city for every country in the world and their populations. You are asked to list the top 10 countries in descending order of total population using Hive. Which Hive commands would you use?

SELECT, GROUP BY, ORDER BY, LIMIT

Make Your Own Quizzes and Flashcards

Convert your notes into interactive study material.

Get started for free
Use Quizgecko on...
Browser
Browser