Mastering the Stratified Approach for Machine Learning
40 Questions
0 Views

Choose a study mode

Play Quiz
Study Flashcards
Spaced Repetition
Chat to lesson

Podcast

Play an AI-generated podcast conversation about this lesson

Questions and Answers

What is the most popular regression metric used to describe the distance between prediction and actual?

  • Mean Squared Logarithmic Error (MSLE)
  • Mean Square Error (MSE) (correct)
  • Mean Absolute Error (MAE)
  • Root Mean Square Error (RMSE)
  • What is the purpose of using a histogram/KDE model in regression?

  • To calculate Mean Directional Accuracy
  • To calculate Mean Absolute Scaled Error
  • To calculate Median Absolute Error (MedAE)
  • To visualize errors distribution (correct)
  • Why is it more difficult to make a correct assessment of a classification model?

  • Classification models do not have evaluation metrics
  • It is easier to make a correct assessment of a regression model
  • Classification models are not used in business outcomes
  • It requires more knowledge and abstract thinking (correct)
  • What is the symmetric version of Mean Absolute Percentage Error (MAPE)?

    <p>sMAPE</p> Signup and view all the answers

    What are evaluation metrics in machine learning?

    <p>Functions used to train and monitor the quality of a model</p> Signup and view all the answers

    What are the properties that cost function should meet in machine learning?

    <p>Differentiability with respect to parameters</p> Signup and view all the answers

    What is the role of evaluation metrics in assessing model accuracy?

    <p>To evaluate the performance of a model during training and testing</p> Signup and view all the answers

    Why do evaluation metrics not have to comply with restrictive mathematical properties?

    <p>They are calculated after the estimator is already created with use of different cost function</p> Signup and view all the answers

    What is the purpose of using evaluation metrics and plots dedicated to probabilities in classification tasks?

    <p>To make decisions about probability cut-off points in a responsible and aware way</p> Signup and view all the answers

    What is the Receiver Operating Characteristic Curve (ROC) used for?

    <p>To plot TPR and FPR for every probability cut-off point</p> Signup and view all the answers

    What is the difference between AUC ROC and AUC PR?

    <p>AUC ROC measures the tradeoff between true positive rate and false positive rate while AUC PR measures the tradeoff between precision and recall</p> Signup and view all the answers

    What is Log-loss or Cross entropy or Entropy used for?

    <p>To evaluate the performance of a classification model</p> Signup and view all the answers

    What is the purpose of using a stratify approach in train-validation pair?

    <p>To ensure that relative class frequencies are approximately preserved</p> Signup and view all the answers

    What is the purpose of creating several models independently on the train and validation data?

    <p>To select one best model on the testing sample</p> Signup and view all the answers

    What is cross-validation (CV)?

    <p>A technique for evaluating a machine learning model and testing its performance</p> Signup and view all the answers

    Why is cross-validation considered more robust than a single train-validation split?

    <p>Because it uses different portions of the data to validate and train the model on different iterations</p> Signup and view all the answers

    Which metric can be used as an ultimate metric to assess the quality of a model's ROC curve?

    <p>Area Under the Curve ROC (AUC ROC)</p> Signup and view all the answers

    What is the range of values that AUC ROC can take?

    <p>0.5 to 1</p> Signup and view all the answers

    Which type of classification tasks is ROC curve not well suited for?

    <p>Imbalanced classification tasks</p> Signup and view all the answers

    What is the Precision Recall curve visualization used for?

    <p>To combine precision and recall in a single visualization</p> Signup and view all the answers

    What is the interpretation of AUC ROC?

    <p>The probability that a uniformly drawn random positive has a higher score than a uniformly drawn random negative</p> Signup and view all the answers

    What is the purpose of AUC PR in highly imbalanced problems?

    <p>To get one representative number for the whole model</p> Signup and view all the answers

    What is the difference between bias and variance of a model?

    <p>Bias is the difference between the expected prediction and the correct model, and variance is the variability of the model prediction for given data points</p> Signup and view all the answers

    What is the bias/variance trade-off?

    <p>The simpler the model, the higher the bias, and the more complex the model, the higher the variance</p> Signup and view all the answers

    What is the Continuous Ranked Probability Score (CRPS)?

    <p>A metric that generalizes the mean absolute error (MAE) to the case of probabilistic forecasts</p> Signup and view all the answers

    What is the Matthews Correlation Coefficient?

    <p>A metric that measures the correlation between predicted classes and ground truth in binary classification</p> Signup and view all the answers

    What is the False Positive Rate?

    <p>The proportion of actual negative observations that are incorrectly classified as positive</p> Signup and view all the answers

    What is the F beta score?

    <p>A combination of precision and recall in one metric</p> Signup and view all the answers

    What is the True Positive Rate?

    <p>The proportion of actual positive observations that are correctly classified as positive</p> Signup and view all the answers

    What is the Positive Predictive Value?

    <p>The proportion of observations predicted as positive that are actually positive</p> Signup and view all the answers

    What is the purpose of using validation/cross validation in machine learning?

    <p>To assess the quality of our model in a quasi-objective way and to execute hyperparameter tuning safely</p> Signup and view all the answers

    Which of the following is NOT a type of cross-validation discussed in the text?

    <p>Leave-p-out</p> Signup and view all the answers

    What is a hyperparameter in machine learning?

    <p>A parameter that controls the learning process and is not estimable</p> Signup and view all the answers

    Which type of cross-validation is most commonly used for cross-sectional problems?

    <p>K-folds</p> Signup and view all the answers

    Why are there multiple types of cross-validation?

    <p>To cater to the specifics of the data, business problem, dataset size, imbalance, and computing resources</p> Signup and view all the answers

    What is a learning curve?

    <p>A plot of model learning performance over experience or time</p> Signup and view all the answers

    What is the purpose of a validation learning curve?

    <p>To give an idea of how well the model is generalizing</p> Signup and view all the answers

    What is the bias/variance trade-off?

    <p>The trade-off between overfitting and underfitting in a model</p> Signup and view all the answers

    What is the purpose of dividing a data set into training, validation, and testing sets?

    <p>To avoid overfitting the model to the training data</p> Signup and view all the answers

    What is an imbalanced dataset?

    <p>A dataset with unequal distribution of classes</p> Signup and view all the answers

    More Like This

    Mastering Options Greeks
    24 questions
    Mastering the Art of Small Talk
    10 questions
    Mastering Small Talk Guide
    10 questions

    Mastering Small Talk Guide

    FeasibleMeadow7499 avatar
    FeasibleMeadow7499
    Use Quizgecko on...
    Browser
    Browser