Talking Avatar

Create a lip-sync video from one image and one audio file

Upload a clear avatar image, add audio, and generate a simple talking video that is easy to preview and download.

Avatar image

JPG, PNG, or WEBP up to 10 MB.

Audio

Upload a file or record a take in your browser. Audio must be 15 seconds or shorter.

History

Your recent lip-sync videos appear here.

Sign in to start
AI lip sync

Create talking avatar videos from one image, one audio file, and a prompt

Upload a clear avatar image, add a short voice clip, and write a prompt that describes the delivery you want. The prompt should guide expression, gesture, emotion, and overall camera feel so the final talking video matches the tone of your message.

Read More Guides

Popular lip-sync use cases

Presenter-style explainers and walkthroughs

Turn a scripted voiceover and one avatar image into short explainers for products, tutorials, onboarding, and internal training.

Social promos and creator updates

Create quick speaking clips for announcements, launches, ad tests, and community updates without filming a new take.

Character and spokesperson messaging

Use a branded character, host image, or portrait to deliver repeatable messages across campaigns and content series.

What the lip-sync workflow includes

One avatar image upload and one audio upload in a simple workflow
Prompt input to guide gesture, facial expression, emotion, and speaking style
Fast task history with previews, status tracking, and direct downloads after sign-in
Built for short, consumer-facing talking videos that are easy to create and reuse

FAQ

Why is the prompt important for lip sync?

The prompt helps define how the avatar should perform. Use it to describe tone, emotion, delivery speed, facial expression, and visible gesture so the result feels closer to your intended message.

What kind of avatar image works best?

Use a clear image where the face is easy to read. Front-facing portraits, clean framing, and stable lighting usually give stronger results.

What kind of audio works best?

Clean spoken audio works best. Keep the clip short, avoid overlapping voices, and reduce background noise whenever possible.