classic_model.csv
Document Details
Uploaded by momogamain
Full Transcript
Question,A,B,C,D,CorrectAnswer,Explanation "A dataset of test scores is heavily skewed to the right, with a few very high scores. Which measure of central tendency is most appropriate to describe the average performance of the class?",Mean,Median,Mode,Range,B,The median is less affected by extreme v...
Question,A,B,C,D,CorrectAnswer,Explanation "A dataset of test scores is heavily skewed to the right, with a few very high scores. Which measure of central tendency is most appropriate to describe the average performance of the class?",Mean,Median,Mode,Range,B,The median is less affected by extreme values or skewness and better represents the central tendency of the data. "You have a dataset with the following five numbers: \[10, 12, 14, 18, 100\]. Which value would most likely be considered an outlier using the IQR method?",10,12,18,100,D,The value 100 is far from the rest of the data. The IQR method would likely flag this as an outlier. "In a dataset where most values are clustered around a central point but there are a few extreme outliers, which measure of spread should you use?",Range,Standard Deviation,Interquartile Range (IQR),Mean Absolute Deviation,C,The IQR is not affected by outliers and gives a better representation of spread when the data has extreme values. "A real estate analyst is comparing house prices in two neighborhoods. Neighborhood A has a median price of \$200,000 and an IQR of \$50,000, while Neighborhood B has a median price of \$300,000 and an IQR of \$100,000. What can you infer about the variability in house prices?",House prices in Neighborhood A are more spread out.,House prices in Neighborhood B are more spread out.,House prices have the same variability in both neighborhoods.,House prices are skewed in both neighborhoods.,B,"Neighborhood B has a larger IQR, indicating more variability in house prices." "When using a box plot to compare the performance of three investment portfolios, what would a longer box in one portfolio indicate compared to the others?",The portfolio has higher average returns.,The portfolio has a wider spread in returns.,The portfolio has fewer outliers.,The portfolio has a skewed distribution.,B,"A longer box indicates a wider spread, meaning more variability in returns for that portfolio." "A set of data has a mean of 50 and a standard deviation of 5. If a data point is 70, how many standard deviations away from the mean is it?",2,3,4,5,B,The data point is (70 - 50) / 5 = 4 standard deviations from the mean. Why would you choose the median over the mean to describe a dataset of employee salaries at a company?,Because the median considers every salary equally.,Because the median is less affected by extremely high or low salaries.,Because the median is the arithmetic average.,Because the median shows the total sum of all salaries.,B,"The median is less influenced by outliers, making it a better measure when salaries have extreme values." "If the whiskers of a box plot are very unequal in length, what does this indicate about the data distribution?",The data is normally distributed.,The data has no outliers.,The data is skewed.,The data is uniformly distributed.,C,"Unequal whisker lengths indicate that the data is skewed, either to the right or left." "In a financial report, a company's daily stock returns are analyzed. Most returns are between -1% and +1%, but there are a few days with returns of -10% and +15%. Which measure of spread would best summarize the variability?",Range,Standard Deviation,IQR,Mean,C,"The IQR would provide a more robust measure of variability, as it is less affected by the extreme returns." "A scatter plot shows a clear upward trend between years of experience and salary. However, there are a few data points where salaries are much lower than expected given the experience. What should you do next?",Ignore the low salaries.,Investigate these outliers to understand if there are special circumstances.,Assume there is no relationship between experience and salary.,Replace these salaries with the mean value.,B,Investigating outliers helps understand if they are due to errors or have meaningful explanations. A dataset is normally distributed with a mean of 100 and a standard deviation of 15. What percentage of data falls within one standard deviation of the mean?,50%,68%,95%,99.7%,B,"In a normal distribution, approximately 68% of the data lies within one standard deviation of the mean." "If you have a dataset with extreme outliers, what effect do these outliers have on the mean compared to the median?",The mean is more affected than the median.,The mean is less affected than the median.,Both the mean and median are equally affected.,The outliers have no effect on either the mean or the median.,A,"The mean is more sensitive to extreme values, while the median remains relatively stable." You are analyzing income data for a large city and notice a right-skewed distribution. What does this imply about the mean and median?,The mean is less than the median.,The mean is equal to the median.,The mean is greater than the median.,The mean and median cannot be compared.,C,"In a right-skewed distribution, the mean is pulled in the direction of the skew, making it larger than the median." "When analyzing a dataset, you find that the IQR is 20 and the mean is 100. If a value is 200, is this an outlier based on the IQR method?",Yes,No,Cannot determine without more information,It depends on the range,C,You need Q1 and Q3 to calculate the exact outlier boundaries using the IQR method. A box plot of monthly sales shows several outliers at the high end. What might this suggest about the company's sales strategy or performance?,Consistent and predictable sales,A few months had significantly higher sales than usual,Sales are declining overall,No unusual sales activity,B,"High outliers suggest some months had unusually high sales, which could be due to special promotions or market trends." "You are comparing two datasets using box plots. If one box plot has a much larger IQR than the other, what does this imply?",The data points in the first dataset are more concentrated.,The data points in the first dataset are more spread out.,The medians of both datasets are equal.,Both datasets have the same variability.,B,A larger IQR indicates that the data points are more spread out. What does it mean if a dataset has a negative skew?,Most data points are on the higher end with a few low outliers.,Most data points are on the lower end with a few high outliers.,The data is perfectly symmetrical.,The mean and median are equal.,A,"A negative skew indicates a long tail on the lower end, with most data points being higher." "A data analyst uses the IQR method to identify outliers. If the lower boundary is -5 and the upper boundary is 20, which of the following values is an outlier?",0,15,25,10,C,The value 25 is above the upper boundary of 20 and would be considered an outlier. Why might you choose a scatter plot over a box plot when analyzing a dataset with two continuous variables?,To compare the spread of a single variable,To identify relationships or correlations between two variables,To display the median and quartiles,To identify outliers within one variable,B,A scatter plot shows how two continuous variables are related and can reveal trends or correlations. "When examining a box plot, what does a line in the middle of the box represent?",The mean of the dataset,The mode of the dataset,The median of the dataset,The IQR of the dataset,C,"The line in the middle of the box represents the median, which divides the dataset into two equal parts."