Image Processing Speech Driven Animation of a Talking Head

A talking head sequence is shown that is synthetically generated from the speech signal. The audio signal of a speaking person is recorded and LPC coefficients and an energy measure are computed. These parameters are used as input for a feed forward neural network that estimates directly facial animation parameters according to MPEG-4 syntax. To generate the video sequence a 3-D head model is rendered using the estimated facial animation parameters. More details can be found in the paper

One frame of the sequence.

contact_eisert



Prof. Dr. Peter Eisert

Tel. +49 30 31002-614
Fax +49 30 3927-200