flowchart TD A(Machine learning) --> B(unsupervised learning) A --> C(supervised learning)
Definition
flowchart TD A(Machine learning) --> B(unsupervised learning) A --> C(supervised learning)
Definition
Outline
Common steps:
Outline
Common steps:
Definition
example of a classification algorithm
example of a classification algorithm
Algorithm
Why
train, validation & test sets
cross validation & repeated cross validation
Leave-one-out cross-validation
Correct (miss)classification rate
Missclassification Rate
Missclassification Rate = 1 - Correct Classification Rate
Confusion matrix
Confusion matrix allows us to compare between actual and predicted values. It is a N x N matrix, where N is the number of classes.
Predicted Positive | Predicted Negative | |
---|---|---|
Actual Positive | True Positive (TP) | False Negative (FN) |
Actual Negative | False Positive (FP) | True Negative (TN) |
Accuracy: measures the proportion of correctly classified samples over the total number of samples. \[ACC = \frac{TP+TN}{TP+TN+FP+FN}\]
Sensitivity: measures the proportion of true positives over all actual positive samples, i.e. how well the classifier is able to detect positive samples. It is also known as true positive rate and recall. \[TPR = \frac{TP}{TP + FN}\]
Specificity: measures the proportion of true negatives over all actual negative samples, i.e. how well the classifier is able to avoid false negatives. It is also known as true negative rate and selectivity. \[TNR = \frac{TN}{TN+FP}\]
Precision: measures the proportion of true positives over all positive predictions made by the classifier, i.e. how well the classifier is able to avoid false positives. It is also known as positive predictive value. \[PPV = \frac{TP}{TP + FP}\]
KNN model for classification