Adrian F. Clark
When evaluating vision systems, it is normal to:
have different training and test sets
You are developing a automatic passport system for use by immigration, where pictures of people are compared to those in their passports. Which of the following is the best approach to take?
minimize the number of false positives
What is `ground truth'?
data known to be correct
What are the axes of a ROC curve?
TP and FP
What assumption underlies a null hypothesis test?
that there is no performance difference between algorithms
What is a false positive?
A true result from an algorithm that is incorrect
Which test is most appropriate for comparing algorithms' performances?
What is a false negative?
A false result from an algorithm that should have succeeded
You are developing software for the Police to show mugshots of suspects to the witness of a crime. Which of the following is the best approach to take?
maximize the number of true positives, even if the false positive rate is high
What do we do if we want to see if one algorithm's performance is better than another's ?
Look up the Z-score in one-tailed tables