Machine Learning Prediction Problems Workflow

Business Question

Data Pre-processing

1. Label Data
2. Feature Selection

Model Selection

1. Logistic Regression
2. Linear Regression
3. Decision Tree
4. Random Forest
5. SVM:
	a. Linear
	b. Kernel
6. K-NN
7. Naïve Bayes Classifier
8. Neural Networks Classifier

Model Validation

Evaluation metrics

1. Accuracy
2. Precision	
3. Recall
4. F1 Score
5. Confusion matrix

Validation with imbalanced dataset

1. Oversampling
2. Undersampling

Validation Methods

1. K-fold Cross-validation
2. Random Split Data
3. Bootstrap Methods

Model Optimization

1. Tune model
2. Modify model