A system for identifying broken custard cream biscuits looks for rectangular regions that are aligned with the edges of images. Why might this be a bad idea?
If the buscuit is not aligned with the edges, its axis-aligned bounding box will be much larger than an oriented bounding box and this might incorrectly identify the biscuit as being broken -- see figure 5.10 in the lecture notes.
An image of a room contains a computer with a display. What feature or features would help you detect the display?
The best approach would be to combine rectangularity (to detect the rectangular feature) and aspect ratio, as computer monitors have a 4:3 or 16:9 aspect ratio. SIFT would be poor and corners do not carry enough information in this context.
Which of the following is a sensible region descriptor?
A region descriptor attempts to encapsulate some characteristic of a region in a number, so anything that does not do this is at best ineffective and at worst just pointless! The location of a region in an image is not a good way of describing its properties, nor is the colour of the pixels outside it. The value of the centre pixel might be a descriptor but in practice it is pretty useless.
Why is a recursive region labelling algorithm poor in practice?
Recursive implementations of any algorithm save state on the program's stack. Recursive region-labelling algorithms make one recursive call for each pixel in a region, so if a region contains many pixels, stack overflow is likely.
Why is simple thresholding not especially effective at locating light features in an image?
The main problem with feature detection using thresholding is that changes in illumination can cause (say) a white object to appear grey.
In a real-time implementation of the Sobel operator, it is important to keep the number of multiplications as small as possible. What is the smallest number of multiplications that are required to convolve each image region with one of the Sobel masks?
Multiplication of any number by zero is obviously zero, and multiplication by unity just yields the number. Multiplication by two can be done (for integer operands) by a bitwise left-shift, so convolution with the Sobel masks do not involve any multiplications at all.
Which is the easiest way to identify broken digestive biscuits on a production line?
Digestive biscuits are normally circular, which means their circularity (the ratio of the square of the circumference to the area) is $4\pi$. All other shapes have a circularity greater than this, which would be the case for a broken biscuit.
If a histogram has two peaks, where is the best place to put a threshold to separate foreground from background?
In general, the best place to put a threshold is at the bottom of the dip between the two peaks -- this is essentially what Otsu's method does.
What are grey-level co-occurrence matrices?
GLCMs are scattergrams (2D histograms) computed for two regions of a single image separated by a particular shift. High values in it indicate similarity.
In the broken biscuit identifier explored in lectures, what is its most serious problem?
- it doesn't identify overlapping biscuits
non-uniform lighting
- inadequate testing
- it doesn't identify biscuits that aren't aligned with the image edges
The most significant shortcoming is in thresholding the biscuits from the background because of non-uniform lighting, even though your lecturer went to some effort to make it as uniform as possible.