Disease Prediction Analysis Quiz

Study Notes

Prediction outcomes indicate disease presence: "Yes" means the patient has the disease; "No" means they do not.
Total predictions: 165; 110 predicted as "Yes", 55 as "No".
Actual cases: 105 patients have the disease, and 60 do not.
Four key terminologies in prediction outcomes:
- True Positive (TP): Correctly predicted as having the disease.
- True Negative (TN): Correctly predicted as not having the disease.
- False Positive (FP): Incorrectly predicted as having the disease.
- False Negative (FN): Incorrectly predicted as not having the disease.

Precision measures the accuracy of positive predictions: TP / (TP + FP).
Recall (or Sensitivity) measures the ability to identify actual positives: TP / (TP + FN).

Example results for spam detection:
- True Positives (TP): 50 emails correctly identified as spam.
- True Negatives (TN): 40 emails correctly identified as not spam.
- False Positives (FP): 10 emails incorrectly identified as spam.
- False Negatives (FN): 5 emails incorrectly identified as not spam.

Accuracy provides an overall success rate: (TP + TN) / Total Predictions.
High accuracy can be misleading in imbalanced classes (e.g., 90% A and 10% B could yield high accuracy by predicting class A only).

A confusion matrix summarizes prediction performance for binary classifiers, illustrating actual vs. predicted outcomes.
Useful for identifying the classification errors of a binary model.

Bias refers to error from oversimplified models, which can lead to underfitting.
Variance refers to error from overly complex models sensitive to training data, leading to overfitting.
Aim to balance bias and variance to minimize overall error.

Hyperparameters must be defined prior to model training and significantly impact performance.
Methods for tuning include grid search, random search, and Bayesian optimization.

Compare model performance using metrics such as accuracy and F1 score.
Simplicity of model is valued when it performs similarly to more complex models.
Consider computational efficiency regarding training and inference times.

Ensemble techniques enhance model robustness by combining multiple models:
- Bagging: Multiple models of the same type, e.g., Random Forest.
- Boosting: Sequential correction of errors from earlier models, e.g., AdaBoost.
- Stacking: Uses different models to leverage their unique strengths.

Best model chosen is assessed on a test set for performance estimation.
Final evaluation confirms the model's generalization ability and effectiveness of the selection process.
For binary classification (e.g., customer churn prediction), split data 70% for training, 30% for testing, and prioritize F1 score for evaluation due to class imbalance.