std_skewness_questions.csv
Document Details
Uploaded by momogamain
Full Transcript
Question,Answer A,Answer B,Answer C,Answer D,Correct Answer,Explanation "You have a dataset of daily temperatures over a year, and you want to understand the spread of temperature fluctuations. Which measure should you calculate?",Mean,Standard deviation,Median,Mode,B,"Standard deviation measures ho...
Question,Answer A,Answer B,Answer C,Answer D,Correct Answer,Explanation "You have a dataset of daily temperatures over a year, and you want to understand the spread of temperature fluctuations. Which measure should you calculate?",Mean,Standard deviation,Median,Mode,B,"Standard deviation measures how much temperatures vary from the mean, providing insight into temperature fluctuations." A dataset of house prices is highly skewed to the right due to a few luxury mansions. Which visualization would best show the skewness and outliers?,Histogram,Pie chart,Scatter plot,Line graph,A,A histogram effectively displays the skewness and highlights the presence of outliers in house prices. "You have collected data on the monthly sales of a product, which are highly variable. Which visualization would best help you understand the variability over time?",Box plot,Time plot,Bar chart,Histogram,B,"A time plot shows how sales fluctuate over time, helping to identify patterns and variability." A researcher has a dataset of patient cholesterol levels and wants to see if there are extreme cases. Which measure of spread should they look at?,Mean,Interquartile Range (IQR),Mode,Range,B,The IQR is useful for identifying the spread of the middle 50% of data and highlighting potential outliers. You are analyzing customer purchase amounts and find that the mean is significantly higher than the median. What does this suggest about the data distribution?,The data is normally distributed,The data is negatively skewed,The data is positively skewed,The data has no outliers,C,"When the mean is greater than the median, the data is positively skewed, indicating a right tail with high values." You have a dataset of ages in a retirement community. Which measure of center would be most robust if a few very young visitors were included in the dataset?,Mean,Median,Mode,Standard deviation,B,"The median is more robust and unaffected by the inclusion of young outliers, unlike the mean." "A data scientist is comparing the distribution of exam scores between two classes. One class has a small standard deviation, while the other has a large one. What does this imply?",The first class has scores that are spread out widely,The second class has scores that are closely packed around the mean,The first class has scores that are tightly clustered around the mean,Both classes have the same score distribution,C,"A small standard deviation indicates that scores are closely clustered around the mean, while a large one indicates more spread." You are given a dataset of annual rainfall in different cities and want to visualize both the distribution and outliers. What is the best visualization to use?,Box plot,Time plot,Pie chart,Bar chart,A,"A box plot shows the distribution, quartiles, and potential outliers, making it ideal for this purpose." "In a dataset of heights of basketball players, you notice a right skew due to a few very tall players. Which measure would best represent the typical height?",Mean,Median,Mode,Standard deviation,B,The median is less affected by extremely tall players and would better represent the typical height. A financial analyst is examining the returns of a stock over 10 years. The returns are highly volatile. What measure should they use to report the volatility?,Mean,Standard deviation,Median,Mode,B,"Standard deviation quantifies the volatility of the stock returns, giving a measure of how much the returns deviate from the average." You are analyzing the distribution of daily step counts from a fitness app. The data is roughly symmetrical. Which visualization would best show the spread and shape?,Pie chart,Box plot,Histogram,Scatter plot,C,A histogram is effective for visualizing the spread and shape of a roughly symmetrical distribution. "If a dataset of house prices has a standard deviation of \$50,000, what does this tell you about the variation in house prices?",The house prices are all close to the average,The house prices vary widely around the average,The house prices are exactly the same,The mean and median are equal,B,A large standard deviation indicates that house prices vary widely around the mean. A dataset of monthly electricity usage has a mean of 500 kWh and a large standard deviation. What does this imply about household electricity usage?,Households use electricity at similar levels each month,There is significant variation in electricity usage among households,All households use less than 500 kWh,The mean is not a reliable measure,B,A large standard deviation indicates significant variation in electricity usage among households. You have a dataset of test scores with no outliers. Which measure of spread would be most appropriate to summarize the variability?,Range,Interquartile Range (IQR),Standard deviation,Median,C,"Standard deviation is appropriate for summarizing variability when there are no outliers, as it measures average spread around the mean." A researcher is analyzing the distribution of incomes in a city and wants to report a measure that is not affected by extreme high incomes. Which measure should they use?,Mean,Median,Mode,Range,B,The median is not affected by extreme values and is more reliable for skewed income distributions.