What are the axes of a ROC curve?
TP and FP
- TP and TN
- FP and FN
- TP and FN
An ROC curve plots the number of FPs against the number of TPs.
You are developing software for the police to show mugshots of suspects to the witness of a crime. Which of the following is the best approach to take?
For this type of application, we want to show anyone who stands any chance of being the perpetrator; it doesn't particularly matter if we have a high false positive rate as long as the true positive rate is high.
Which corner of a ROC curve indicates the best performance?
- lower right
- lower left
upper left
- upper right
We want the smallest number of false positives for the largest number of true positives, so the best performance is the upper left corner of the plot.
When using McNemar's test, what do we do if we want to see whether algorithms' performances differ?
If algorithms' performances differ, it doesn't matter which is better than the other, so we use two-tailed tables.
You are developing a automatic passport system for use by immigration, where pictures of people are compared to those in their passports. Which of the following is the best approach to take?
For this type of application, we need to keep the number of false positives as low as possible; otherwise, we would admit lots of people who don't look like the picture on their passports.
What is a false positive?
A false positive arises when an algorithm reports success but has actually found an incorrect result.
What assumption underlies a null hypothesis test?
The null hypothesis test assumes that there is no difference in performance between algorithms, then examines whether the statistics support that assumption or provide evidence that is is wrong.
What is a false negative?
A false negative arises when an algorithm reports failure (or crashes) when it should have succeeded.
What is 'ground truth'?
- values obtained by an algorithm that are known to be true
data known to be correct
- the true values obtained by an algorithm
- images of the ground
Ground truth are data (usually images) for which the correct answer is known; they are used for training and testing algorithms.
When evaluating vision systems, it is normal to:
We know that algorithms work better on the data they were trained on than on unseen data; hence, we use different training and test sets.