Article

Dynamic, expressive speech animation from a single mesh

Authors:

Zoran PopovićAuthors Info & Claims

SCA '07: Proceedings of the 2007 ACM SIGGRAPH/Eurographics symposium on Computer animation

Pages 53 - 62

Published: 03 August 2007 Publication History

Abstract

In this work we present a method for human face animation which allows us to generate animations for a novel person given just a single mesh of their face. These animations can be of arbitrary text and may include emotional expressions. We build a multilinear model from data which encapsulates the variation in dynamic face motions over changes in identity, expression, and over different texts. We then describe a synthesis method consisting of a phoneme planning and a blending stage which uses this model as a base and attempts to preserve both face shape and dynamics given a novel text and an emotion at each point in time.

References

[1]

{BBVP03} Blanz V., Basso C., Vetter T., Poggio T.: Reanimating faces in images and video. In EUROGRAPHICS 2003 (EUROGRAPHICS-03): the European Association for Computer Graphics, 24th Annual Conference (Granada, Spain, 2003), Brunet P., Fellner D. W., (Eds.), vol. 22 of Computer Graphics Forum, The Eurographics Association, Blackwell, pp. 641--650.

[2]

{BCS97} Bregler C., Covell M., Slaney M.: Video rewrite: driving visual speech with audio. In SIGGRAPH '97: Proceedings of the 24th annual conference on Computer graphics and interactive techniques (New York, NY, USA, 1997), ACM Press/Addison-Wesley Publishing Co., pp. 353--360.

Digital Library

[3]

{Bra99} Brand M.: Voice puppetry. In SIGGRAPH '99: Proceedings of the 26th annual conference on Computer graphics and interactive techniques (New York, NY, USA, 1999), ACM Press/Addison-Wesley Publishing Co., pp. 21--28.

Digital Library

[4]

{BV99} Blanz V., Vetter T.: A morphable model for the synthesis of 3d faces. In SIGGRAPH '99: Proceedings of the 26th annual conference on Computer graphics and interactive techniques (New York, NY, USA, 1999), ACM Press/Addison-Wesley Publishing Co., pp. 187--194.

Digital Library

[5]

{Cam} Cambridge University Engineering Department: Hidden markov model toolkit. http://htk.eng.cam.ac.uk/.

[6]

{CB05} Chuang E., Bregler C.: Mood swings: expressive speech animation. ACM Trans. Graph. 24, 2 (2005), 331--347.

Digital Library

[7]

{CE05} Chang Y.-J., Ezzat T.: Transferable videorealistic speech animation. In SCA '05: Proceedings of the 2005 ACM SIGGRAPH/Eurographics symposium on Computer animation (New York, NY, USA, 2005), ACM Press, pp. 143--151.

Digital Library

[8]

{CTFP05} Cao Y., Tien W. C., Faloutsos P., Pighin F.: Expressive speech-driven facial animation. ACM Trans. Graph. 24, 4 (2005), 1283--1302.

Digital Library

[9]

{DLN05} Deng Z., Lewis J., Neumann U.: Synthesizing speech animation by learning compact speech co-articulation models. In Computer Graphics International (2005), pp. 19--25.

Digital Library

[10]

{EF78} Ekman P., Friesen W.: The facial action coding system: A technique for the measurement of facial movement, 1978.

[11]

{EGP02} Ezzat T., Geiger G., Poggio T.: Trainable videorealistic speech animation. In SIGGRAPH '02: Proceedings of the 29th annual conference on Computer graphics and interactive techniques (New York, NY, USA, 2002), ACM Press, pp. 388--398.

Digital Library

[12]

{JP98} Jones M. J., Poggio T.: Multidimensional morphable models: A framework for representing and matching object classes. Int. J. Comput. Vision 29, 2 (1998), 107--131.

Digital Library

[13]

{KB06} Kim T.-Y., Bulut M.: Expressive facial animation synthesis by learning speech coarticulation and expression spaces. IEEE Transactions on Visualization and Computer Graphics 12, 6 (2006), 1523--1534. Member-Zhigang Deng and Member-Ulrich Neumann and Member-J. P. Lewis and Senior Member-Shrikanth Narayanan.

Digital Library

[14]

{KP05} King M.-S. A., Parent M.-R. E.: Creating speech-synchronized animation. IEEE Transactions on Visualization and Computer Graphics 11, 3 (2005), 341--352.

Digital Library

[15]

{Lat97} Lathauwer L. D.: Signal Processing based on Multilinear Algebra. PhD thesis, Faculteit der Toegepaste Wetenschappen. Katholieke Universiteit Leuven, 1997.

[16]

{LTW95} Lee Y., Terzopoulos D., Walters K.: Realistic modeling for facial animation. In SIGGRAPH '95: Proceedings of the 22nd annual conference on Computer graphics and interactive techniques (New York, NY, USA, 1995), ACM Press, pp. 55--62.

Digital Library

[17]

{LZPW03} Levin A., Zomet A., Peleg S., Weiss Y.: Seamless image stitching in the gradient domain, 2003.

[18]

{MKPG05} Mueller P., Kalberer G. A., Proesmans M., Gool L. V.: Realistic speech animation based on observed 3d face dynamics. IEE Proc. Vision, Image and Signal Processing 152 (August 2005), 491--500.

[19]

{MZD05} Matusik W., Zwicker M., Durand F.: Texture design using a simplicial complex of morphable textures. In SIGGRAPH '05: ACM SIGGRAPH 2005 Papers (New York, NY, USA, 2005), ACM Press, pp. 787--794.

Digital Library

[20]

{Par82} Parke F. I.: Parameterized models for facial animation. j-IEEE-CGA 2, 9 (nov 1982), 61--64, 66--68.

Digital Library

[21]

{PL06} Pighin F., Lewis J. P.: Facial motion retargeting. In SIGGRAPH '06: ACM SIGGRAPH 2006 Courses (New York, NY, USA, 2006), ACM Press, p. 2.

Digital Library

[22]

{SSRMF06} Sifakis E., Selle A., Robinson-Mosher A., Fedkiw R.: Simulating speech with a physics-based facial muscle model. In SCA '06: Proceedings of the 2006 ACM SIGGRAPH/Eurographics symposium on Computer animation (Aire-la-Ville, Switzerland, Switzerland, 2006), Eurographics Association, pp. 261--270.

Digital Library

[23]

{VBPP05} Vlasic D., Brand M., Pfister H., Popović J. P.: Face transfer with multilinear models. ACM Trans. Graph. 24, 3 (2005), 426--433.

Digital Library

[24]

{VT02} Vasilescu M. A. O., Terzopoulos D.: Multilinear analysis of image ensembles: Tensorfaces. In ECCV '02: Proceedings of the 7th European Conference on Computer Vision-Part I (London, UK, 2002), Springer-Verlag, pp. 447--460.

Digital Library

[25]

{VT04} Vasilescu M. A. O., Terzopoulos D.: Ten-sortextures: multilinear image-based rendering. In SIGGRAPH '04: ACM SIGGRAPH 2004 Papers (New York, NY, USA, 2004), ACM Press, pp. 336--342.

Digital Library

[26]

{Wat87} Waters K.: A muscle model for animation three-dimensional facial expression. In SIGGRAPH '87: Proceedings of the 14th annual conference on Computer graphics and interactive techniques (New York, NY, USA, 1987), ACM Press, pp. 17--24.

Digital Library

[27]

{Wil90} Williams L.: Performance-driven facial animation. In SIGGRAPH '90: Proceedings of the 17th annual conference on Computer graphics and interactive techniques (New York, NY, USA, 1990), ACM Press, pp. 235--242.

Digital Library

[28]

{WWS*05} Wang H., Wu Q., Shi L., Yu Y., Ahuja N.: Out-of-core tensor approximation of multidimensional matrices of visual data. ACM Trans. Graph. 24, 3 (2005), 527--535.

Digital Library

[29]

{xCXH03} xiang Chai J., Xiao J., Hodgins J.: Vision-based control of 3d facial animation. In SCA '03: Proceedings of the 2003 ACM SIGGRAPH/Eurographics symposium on Computer animation (Aire-la-Ville, Switzerland, Switzerland, 2003), Eurographics Association, pp. 193--206.

Digital Library

[30]

{ZSCS04} Zhang L., Snavely N., Curless B., Seitz S. M.: Spacetime faces: high resolution capture for modeling and animation. ACM Trans. Graph. 23, 3 (2004), 548--558.

Digital Library

Cited By

Olszewski KLim JSaito SLi H(2016)High-fidelity facial and speech animation for VR HMDsACM Transactions on Graphics10.1145/2980179.298025235:6(1-14)Online publication date: 5-Dec-2016
https://dl.acm.org/doi/10.1145/2980179.2980252
Liu YXu FChai JTong XWang LHuo Q(2015)Video-audio driven real-time facial animationACM Transactions on Graphics10.1145/2816795.281812234:6(1-10)Online publication date: 2-Nov-2015
https://dl.acm.org/doi/10.1145/2816795.2818122
Taylor SMahler MTheobald BMatthews IBoulic RKomura T(2012)Dynamic units of visual speechProceedings of the ACM SIGGRAPH/Eurographics Symposium on Computer Animation10.5555/2422356.2422395(275-284)Online publication date: 29-Jul-2012
https://dl.acm.org/doi/10.5555/2422356.2422395
Show More Cited By

Index Terms

Dynamic, expressive speech animation from a single mesh
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Image and video acquisition
        3D imaging
  2. Computer graphics
    1. Animation

Recommendations

Mood swings: expressive speech animation

Motion capture-based facial animation has recently gained popularity in many applications, such as movies, video games, and human-computer interface designs. With the use of sophisticated facial motions from a human performer, animated characters are ...
Expressive Facial Animation Synthesis by Learning Speech Coarticulation and Expression Spaces

Synthesizing expressive facial animation is a very challenging topic within the graphics community. In this paper, we present an expressive facial animation synthesis system enabled by automated learning from facial motion capture data. Accurate 3D ...
Modeling Expressive Wrinkles of Face For Animation
ICIG '07: Proceedings of the Fourth International Conference on Image and Graphics

Vivid facial expressions contribute greatly to the visual realism of 3D face models. However, it is often tedious and expensive to model facial details which are authentic and real-time. In this paper, a simple and functional approach for modeling ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SCA '07: Proceedings of the 2007 ACM SIGGRAPH/Eurographics symposium on Computer animation

August 2007

287 pages

ISBN:9781595936240

Conference Chairs:
Michael Gleicher
University of Wisconsin - Madison, USA)
,
Daniel Thalmann
EPFL, Switzerland

Sponsors

SIGGRAPH: ACM Special Interest Group on Computer Graphics and Interactive Techniques
EUROGRAPHICS: The European Association for Computer Graphics

Publisher

Eurographics Association

Goslar, Germany

Publication History

Published: 03 August 2007

Check for updates

Qualifiers

Article

Conference

SCA07

Sponsor:

SIGGRAPH
EUROGRAPHICS

SCA07: The ACM SIGGRAPH / Eurographics Symposium on Computer Animation

August 2 - 4, 2007

California, San Diego

Acceptance Rates

SCA '07 Paper Acceptance Rate 28 of 81 submissions, 35%;

Overall Acceptance Rate 183 of 487 submissions, 38%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

6
Total Citations
View Citations
715
Total Downloads

Downloads (Last 12 months)2
Downloads (Last 6 weeks)0

Reflects downloads up to 14 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Olszewski KLim JSaito SLi H(2016)High-fidelity facial and speech animation for VR HMDsACM Transactions on Graphics10.1145/2980179.298025235:6(1-14)Online publication date: 5-Dec-2016
https://dl.acm.org/doi/10.1145/2980179.2980252
Liu YXu FChai JTong XWang LHuo Q(2015)Video-audio driven real-time facial animationACM Transactions on Graphics10.1145/2816795.281812234:6(1-10)Online publication date: 2-Nov-2015
https://dl.acm.org/doi/10.1145/2816795.2818122
Taylor SMahler MTheobald BMatthews IBoulic RKomura T(2012)Dynamic units of visual speechProceedings of the ACM SIGGRAPH/Eurographics Symposium on Computer Animation10.5555/2422356.2422395(275-284)Online publication date: 29-Jul-2012
https://dl.acm.org/doi/10.5555/2422356.2422395
Taylor SMahler MTheobald BMatthews I(2012)Dynamic units of visual speechProceedings of the 11th ACM SIGGRAPH / Eurographics conference on Computer Animation10.5555/2421731.2421770(275-284)Online publication date: 29-Jul-2012
https://dl.acm.org/doi/10.5555/2421731.2421770
Guan SChen YHuang FChen BZhang ZLi Z(2012)Lip-synced character speech animation with dominated animeme modelsSIGGRAPH Asia 2012 Technical Briefs10.1145/2407746.2407772(1-4)Online publication date: 28-Nov-2012
https://dl.acm.org/doi/10.1145/2407746.2407772
Deng ZMa XFiume ETessendorf JGross MJames D(2008)Perceptually guided expressive facial animationProceedings of the 2008 ACM SIGGRAPH/Eurographics Symposium on Computer Animation10.5555/1632592.1632603(67-76)Online publication date: 7-Jul-2008
https://dl.acm.org/doi/10.5555/1632592.1632603

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten