Business Question
Data Pre-processing
1. Label Data
2. Feature Selection
Model Selection
1. Logistic Regression
2. Linear Regression
3. Decision Tree
4. Random Forest
5. SVM:
a. Linear
b. Kernel
6. K-NN
7. Naïve Bayes Classifier
8. Neural Networks Classifier
Model Validation
Evaluation metrics
1. Accuracy
2. Precision
3. Recall
4. F1 Score
5. Confusion matrix
Validation with imbalanced dataset
1. Oversampling
2. Undersampling
Validation Methods
1. K-fold Cross-validation
2. Random Split Data
3. Bootstrap Methods
Model Optimization
1. Tune model
2. Modify model