Visemes and Expression: How a Face Comes Alive

For an avatar's face to look alive, it's not enough to simply show it — it has to move in time with the words. That's the job of visemes and facial expression.

What visemes are

Visemes are mouth shapes that correspond to the sounds of speech. Putting them in the right order lets the avatar "speak" the words with its lips.

Syncing with the voice

Timings are extracted from the spoken audio, and the lip movement matches the voice exactly. A mismatch instantly gives away a fake, so synchronization matters.

Emotions

Beyond the lips, the face conveys the emotion of the answer — warm, sad, thoughtful. This makes the interaction feel alive rather than mechanical.

Why it matters

The brain is sensitive to the slightest mismatch between lips and sound — it immediately reads as "not real." Precise synchronization and lively expression remove that effect. That's why the image comes across as warm rather than eerie.

Visemes — mouth shapes for the sounds of speech.
Timings from the voice → synchronized lips.
Expression conveys the emotion of the answer.
Synchronization removes the "fake" effect.

Frequently asked questions

What are visemes in plain words?

They're lip positions for different sounds; arranged by the timings of speech, they let the avatar "speak" the words.

Why is syncing lips and voice important?

A desync immediately reads as fake; an exact match makes the image natural.

Save the story while it is with you

Create a memorial page in a few minutes — gently, beautifully and with respect for your loved ones. Free forever for the text version.

Create a memorial

Visemes and Expression: How a Face Comes Alive

What visemes are

Syncing with the voice

Emotions

Why it matters

Frequently asked questions

Save the story while it is with you

Read also

Three ways to talk with an AI memory copy

Farewell mode: a gentle way to let go

Why an AI copy is always marked as a simulation