What is 'ground truth'?
- values obtained by an algorithm that are known to be true
- images of the ground
data known to be correct
- the true values obtained by an algorithm
Ground truth are data (usually images) for which the correct answer is known; they are used for training and testing algorithms.
Which test is most appropriate for comparing algorithms' performances?
- Canny's test
McNemar's test
- Gauss's test
- Laplace's test
McNemar's test is the most appopriate test for comparing algorithms: it is a chi-squared test with one degree of freedom for paired data.
Which corner of a ROC curve indicates the best performance?
- lower right
upper left
- upper right
- lower left
We want the smallest number of false positives for the largest number of true positives, so the best performance is the upper left corner of the plot.
When using McNemar's test, what do we do if we want to see whether algorithms' performances differ?
If algorithms' performances differ, it doesn't matter which is better than the other, so we use two-tailed tables.
You are developing a automatic passport system for use by immigration, where pictures of people are compared to those in their passports. Which of the following is the best approach to take?
For this type of application, we need to keep the number of false positives as low as possible; otherwise, we would admit lots of people who don't look like the picture on their passports.
When using McNemar's test, what do we do if we want to see whether one algorithm's performance is better than another's?
If we want to know that one algorithm's performance is better than another, we must use one-tailed tables.
What assumption underlies a null hypothesis test?
The null hypothesis test assumes that there is no difference in performance between algorithms, then examines whether the statistics support that assumption or provide evidence that is is wrong.
When evaluating vision systems, it is normal to:
We know that algorithms work better on the data they were trained on than on unseen data; hence, we use different training and test sets.
What is a false negative?
A false negative arises when an algorithm reports failure (or crashes) when it should have succeeded.
What is a false positive?
A false positive arises when an algorithm reports success but has actually found an incorrect result.