This page describes results obtained by tracking of the outer lip contour in image sequences from the The Extended M2VTS database (XM2VTS). The approach is described in the PhD thesis of Ulises Ramos [Postscript] (approximate size: 11Mb).
In the speech shot of the XM2VTS database, every subject was asked to read three sentences at normal pace, to pause briefly at the end of each sentence and to read through the three sentences twice. The first recording in each session is referred to as shot 1 whilst the second is referred to as shot 2. The sessions took place in approximately monthly intervals. The three sentences that remained the same in all four sessions were
Below are some lip tracking results obtained on the XM2VTS database. Click on the sequence ID to view a full tracked sequence in MPEG format [approximately 1~2MB]. Click on the image to see the initial position of contour control points obtained using colour analysis.
| 030[mpeg] | 047[mpeg] | 075[mpeg] | 140[mpeg] | 249[mpeg] | 330[mpeg] |
|---|---|---|---|---|---|
|
|
|
|
|
|