Full Transcript

Based on the gender distribution pie chart, what percentage of the students are male? A. 50% B. 40% C. 60% D. 80% ANSWER: A 2. From the histogram showing age distribution, what can be inferred about the ages of students? A. All students are the same age. B. Students are evenly distribu...

Based on the gender distribution pie chart, what percentage of the students are male? A. 50% B. 40% C. 60% D. 80% ANSWER: A 2. From the histogram showing age distribution, what can be inferred about the ages of students? A. All students are the same age. B. Students are evenly distributed between two age groups. C. Most students are younger than 14 years old. D. There are no students aged 15. ANSWER: B 3. In the box plot for hours studied by gender, which gender has an outlier for hours studied? A. Male B. Female C. Both D. Neither ANSWER: A 4. Based on the scatter plot of hours studied vs. scores, which of the following best describes the trend observed? A. There is no correlation between hours studied and scores. B. There is a negative correlation between hours studied and scores. C. There is a positive correlation between hours studied and scores. D. Scores decrease as hours studied increase. ANSWER: C 5. What does the regression plot of hours studied vs. scores with a shaded area represent? A. The exact scores of each student B. The average hours studied by all students C. The confidence interval for the regression line D. The difference in study habits between genders ANSWER: C 1. Which of the following preprocessing steps is used to handle missing values in a dataset? A. Dropping irrelevant variables B. Removing duplicates C. Filling missing values with the mean or median D. Converting categorical variables to numerical ANSWER: C 2. To remove columns that do not contribute to your analysis, which preprocessing step should you use? A. Imputing missing values B. Dropping irrelevant variables C. Scaling numerical features D. Encoding categorical variables ANSWER: B 3. When you encounter duplicate rows in your dataset, which action should you take? A. Drop irrelevant variables B. Remove duplicates C. Convert categorical variables D. Fill missing values ANSWER: B 4. Which method is commonly used to convert categorical variables into numerical values? A. Normalization B. Standardization C. One-hot encoding D. Imputation ANSWER: C 5. How can you read a specific column from a pandas DataFrame? A. df.read\_column('column\_name') B. df\['column\_name'\] C. df.drop('column\_name') D. df.loc\[:, 'column\_name'\] ANSWER: B 6. To get a summary of the dataset including count, mean, and standard deviation, which function do you use? A. df.info() B. df.describe() C. df.head() D. df.tail() ANSWER: B 7. Which method would you use to get the number of rows and columns in a DataFrame? A. df.shape B. df.size C. df.len D. df.dimensions ANSWER: A 8. What does the df.info() method provide? A. Summary statistics of the dataset B. The first few rows of the dataset C. The data types and non-null counts of each column D. The last few rows of the dataset ANSWER: C 9. How do you drop a column named 'age' from a DataFrame? A. df.remove\_column('age') B. df.drop('age', axis=0) C. df.drop('age', axis=1) D. df.delete('age') ANSWER: C 10. To read the first 5 rows of a DataFrame, which method would you use? A. df.head() B. df.tail() C. df.sample() D. df.first() ANSWER: A

Use Quizgecko on...
Browser
Browser