• Training images must be representative of
the instances
of objects to be recognized.
• The object must be well-framed.
•
• Positions and sizes must be controlled.
•
• Dimensionality
reduction is needed.
• It is still not powerful enough to handle general scenes
without prior segmentation into relevant objects.-- my comment
• Hybrid systems (features plus appearance) seem worth pursuing.