Adrian F. Clark
When evaluating vision systems, it is normal to:
have different training and test sets
You are developing a automatic passport system for use by immigration, where pictures of people are compared to those in their passports. Which of the following is the best approach to take?
minimize the number of false positives
Which test is most appropriate for comparing algorithms' performances?
What do we do if we want to see if one algorithm's performance is better than another's ?
Look up the Z-score in one-tailed tables
What assumption underlies a null hypothesis test?
that there is no performance difference between algorithms
What is `ground truth'?
data known to be correct
You are developing software for the Police to show mugshots of suspects to the witness of a crime. Which of the following is the best approach to take?
maximize the number of true positives, even if the false positive rate is high
Which corner of a ROC curve indicates the best performance?
What are the axes of a ROC curve?
TP and FP
What do we do if we want to see if algorithms' performances differ?
Look up the Z-score in two-tailed tables