Pomni

Visemes and Expression: How a Face Comes Alive

For an avatar's face to look alive, it's not enough to simply show it — it has to move in time with the words. That's the job of visemes and facial expression.

What visemes are

Visemes are mouth shapes that correspond to the sounds of speech. Putting them in the right order lets the avatar "speak" the words with its lips.

Syncing with the voice

Timings are extracted from the spoken audio, and the lip movement matches the voice exactly. A mismatch instantly gives away a fake, so synchronization matters.

Emotions

Beyond the lips, the face conveys the emotion of the answer — warm, sad, thoughtful. This makes the interaction feel alive rather than mechanical.

Why it matters

The brain is sensitive to the slightest mismatch between lips and sound — it immediately reads as "not real." Precise synchronization and lively expression remove that effect. That's why the image comes across as warm rather than eerie.

  • Visemes — mouth shapes for the sounds of speech.
  • Timings from the voice → synchronized lips.
  • Expression conveys the emotion of the answer.
  • Synchronization removes the "fake" effect.

Frequently asked questions

What are visemes in plain words?
They're lip positions for different sounds; arranged by the timings of speech, they let the avatar "speak" the words.
Why is syncing lips and voice important?
A desync immediately reads as fake; an exact match makes the image natural.

Save the story while it is with you

Create a memorial page in a few minutes — gently, beautifully and with respect for your loved ones. Free forever for the text version.

Create a memorial
Pomni editors

We help families gently preserve the memory of their loved ones. The materials are written with respect for the subject of loss and are regularly updated. About · Support resources

Read also