Automatic Selection of Optimal Views in Multi-View Object Recognition


Multi-view representation techniques for 3-D free-form objects have not yet successfully dealt with the following fundamental issues:

We propose a method for automatic selection of optimal views of a free-form 3-D object. In order to represent an object efficiently, we eliminate similar views and select a relatively small number of views using an optimisation algorithm. This number varies from 5 to 25 depending on the complexity of the object and the measure of expected accuracy.

Initially, a video camera is used to obtain a large number of views of the objects from all possible viewpoints. When an object is pictured from a large number of viewpoints, it is likely that some of the resulting images are similar and convey no additional information. As a result, an algorithm is required to identify the optimal number of images needed to represent an object. Using an arbitrary shape descriptor and its associated matching method, one can measure the similarity between two different images grabbed from a single object. The algorithm which selects the most suitable views according to an arbitrary representation is as follows.

At the end of this process, the set C will contain the full set of characteristic views of the input object determined automatically. This algorithm was tested on a set of 15 free-form 3-D objects. One view of each object of our database is shown in the figure below.

Initially, about 50 views of each object were grabbed. That initial set was then substantially reduced using the optimal view selection algorithm described above. The following figure shows a number of query results using arbitrary views of various objects in our database as input queries. The shape descriptor used in this experiment was the Curvature Scale Space representation, which has been selected for MPEG-7 standardization.

The value of t affects the number of views selected for each object. Generally speaking, a larger value of t results in a smaller number of views. This relationship has been shown in the following figure for the objects in our database:

Furthermore, generally speaking, a better success rate can be observed when a larger number of views are selected for each object. This relationship has been shown in the following figure for the objects in our database. Note that N indicates the number of observed outputs.

This work was supported by EPSRC Research Grant # GR/L76754/01. For further details about the optimal view selection algorithm, see the following publications:


F.Mokhtarian@ee.surrey.ac.uk
June 2001