Data Viz Final AI Study Notes PDF
Document Details
Uploaded by FrugalThorium8635
Tags
Summary
This document contains a set of questions on data visualization, SQL and data analysis. It includes questions on various topics, such as different types of data, different types of charts for data visualization and more. It also includes definitions of related concepts such as 'data' and 'data visualization'.
Full Transcript
Data viz final AI study notes Which of the following is an example of structured data and semistructured data and unstructured data? True or False: In a relational database, a foreign key is a unique identifier for each row in a table. Q. What is the definition of data A. Recorded descr...
Data viz final AI study notes Which of the following is an example of structured data and semistructured data and unstructured data? True or False: In a relational database, a foreign key is a unique identifier for each row in a table. Q. What is the definition of data A. Recorded descriptions or measurements of something in the real world Quiz 2: If we only want to see or view a sample of the data, what SQL function is used? _____ is used to request all columns when extracting data in SQL. Quiz 3: Which among the following is the correct syntax for creating table? What does the following statement in SQL do? DROP TABLE student; Datetime data type is used to store time-based information. True / False? How many operations are considered to be the most basic SQL operations? What are the most basic SQL operations? In SQL, the _________ wildcard represents any number of characters, including zero characters In SQL, the _________ wildcard represents… What does the IN operator do in SQL? Which of the following is not a valid SQL wildcard character? a. Percentage "%" b. Underscore "_" c. Square brackets "[]" d. Dollar sign "$" (correct answer) The _________ JOIN returns all records from the left table and the matched records from the right table. In a self join, a table is joined with _________. The _________ operator in SQL is used to combine the result-set of two or more SELECT statements. Which clause is used to sort a UNION result as a whole? To combine multiple retrievals, we write several SELECT statements and put the keyword between them. What is the keyword? Which of the following statements is correct? a. UNION and UNION ALL do the exact same thing b. UNION does not remove duplicates (correct answer) c. UNION ALL does not remove duplicate rows d. UNION ALL removes duplicate rows Gilbert Furniture has initiated a new marketing campaign for its high-end desk lamp. The analyst for e-commerce, Lauren Stevens, has been tracking the progress of the campaign and has collected the following data based on an email sent to the customer list: 68% opened the email, 29% clicked on the web link in the email, 11% added the desk lamp to their cart, and 9% purchased the lamp. Which of the following is the most appropriate chart for these data? a. Funnel chart b. Line chart c. Scatter plot d. Bar graph Types of Analytics. Indicate which type of analytics (descriptive, predictive, or prescriptive analytics) each of the following represents. a. data dashboard b. a model that finds the production schedule that minimizes overtime c. a model that forecasts sales for the next quarter d. a bar chart e. a model that allocates your financial investments to achieve your financial goal 5. House Price and Square Footage. Suppose we want to better understand the relationship between house price and square footage of the house, and we have collected house price and square footage for 75 houses in a particular neighborhood of Cincinnati, Ohio, from the Zillow website on January 3, 2021. L 2, 3 a. Are these data quantitative or categorical? b. Are these data cross-sectional or times series? c. Which of the following type of chart would provide the best display of these data? Explain your answer. i. Bar chart ii. Column chart iii. Scatter chart U.S. Netflix Subscribers. Refer to the previous problem. Suppose that in addition to the total number of Netflix subscribers, we have the number of those subscribers by year for the years 2010–2019 who live in the United States. Our message is to 22 Chapter 1 Introduction emphasize how much of the growth is coming from the United States. Which of the following types of charts would best display the data? Explain your answer. L 2, 3 i. Bar chart ii. Clustered column chart iii. Stacked column chart iv. Stock chart. Academic Makeup of Departments. You are conducting an analysis of the makeup of the departments in your firm. Your goal is to compare the departments’ mixes of academic backgrounds. You have defined the following categories for academic background: Business, Engineering, and Other. You have the percentage of employees in each category for each of the four departments as shown in the table below What type of chart is best suited to display these data? When to use a column vs bar chart When to use Funnel vs bar chart When to use stacked column vs clustered collumn R&D Project Portfolio. The Ajax Company uses a portfolio approach to manage their research and development (R&D) projects. Ajax wants to keep a mix of projects to balance the expected return and risk profiles of their R&D activities. Consider a situation in which Ajax has six R&D projects as characterized in the following table. Each project is given an expected rate of return and a risk assessment, which is a value between 1 and 10, where 1 is the least risky and 10 is the riskiest. Ajax would like to visualize their current R&D projects to keep track of the overall risk and return of their R&D portfolio. Which of the following chart types would be the most appropriate for these data? Explain your answer. i. Stacked bar chart ii. Line chart iii. Bubble chart iv. Funnel chart Choosing the Best Chart Type. Choose the most appropriate chart type (bar chart, bubble chart, choropleth map, line chart) for each data set described below. Use each chart type exactly once. LO 3 a. Advertising budget, number of salespeople, percent of market share for 10 products b. Annual demand for potato chips (in tons) in the United States by state c. Annual sales in millions of dollars for seven regional salespeople d. Population of the United States for each year 1900–2020 Quiz 8: *** Chapter 3 For the description below, provide the name of the Gestalt principle that is being described. Objects with like characteristics such as color, shape, size, etc. are seen as belonging to the same group. a. Proximity b. Connection c. Similarity d. Enclosure Which of the following types of memory is used to process pre-attentive attributes? a. Random Access Memory b. Hard Drive Memory c. Iconic Memory d. Long-term Memory The portion of a data visualization that is devoid of markings is known as ______ Which of the following is an example of a common serif font? The small end-of-stroke features that are visual in the characters created using this specific font are known as _____ Memory Used for Preattentive Attributes. Which of the following types of memory is used to process preattentive attributes? i. Iconic memory ii. Short-term memory iii. Long-term memory iv. Random access memory Preattentive Attributes in a Data Visualization. Which of the following statements about the use of preattentive attributes in a data visualization are true? (Select all that apply.) LO 1 i. The use of preattentive attributes reduces the cognitive load required by the audience to interpret the information conveyed by a data visualization. ii. Preattentive attributes can be used to draw the audience’s attention to certain parts of a data visualization iii. Overuse of preattentive attributes can lead to clutter and can be distracting to the audience. iv. Preattentive attributes include attributes such as proximity and enclosure. 3. Descriptions of Gestalt Principles. For each description below, provide the name of the Gestalt principle that is being described. LO 2 a. Objects that are physically close to one another are seen as belonging to the same group. b. Objects that are linked in some way are seen as belonging to the same group. c. Objects that are physically bound together are seen as belonging to the same group. d. Objects with like characteristics such as color, shape, size, etc. are seen as belonging to the same group. Scatter with Straight Lines and Markers Chart in Excel. Using a Scatter with Straight Lines and Markers Chart in Excel makes use of which Gestalt principle? LO 2 i. Similarity ii. Proximity iii. Enclosure iv. Connection Increasing the Data-Ink Ratio on a Chart. Which of the following changes to a chart would increase the data-ink ratio? (Select all that apply.) LO 3 i. Removing unnecessary gridlines. ii. Removing a legend on a bar chart where each bar is already labeled with the same information. iii. Adding axes labels to a chart where the units used in each axis are not clear from the chart title. iv. Adding data labels for each point on a scatter chart when the audience does not need to know exact values. 6. Pie Charts versus Bar or Column Charts. Which of the following reasons accurately describe why bar or column charts are often preferred to pie charts for a data visualization? (Select all that apply.) LO 4 i. Bar and column charts utilize the Gestalt principle of proximity while pie charts use the Gestalt principle of connection. ii. Using a legend for a pie chart creates unnecessary eye travel that can often be reduced by using a bar or column chart that does not require a legend. iii. Bar and column charts use length rather than size to make comparisons, and length is much easier for the audience to interpret than size. iv. Pie charts often use different colors to differentiate each piece of the pie that can create unnecessary clutter compared to bar or column charts that can display the same information without the use of multiple colors. Identify whether a categorical color scheme, a sequential color scheme, or a diverging color scheme should be applied *** (pg 160) (176) Categorical color scheme vs Sequential color scheme vs Diverging color scheme better suited for an audience that needs detailed insights or high-level insights? Different chart types and their purposes Box plot vs Slope chart vs Dot matrix chart vs Clustered column chart