Introducing Fantasytalking Artificial Intelligence to build spokesperson characters! + Video

Chinese researchers of artificial ielligence from an innovative model called Faasytalking They have unveiled that it can only produce realistic and corollable videos of speaking faces with only one fixed portrait image. This model of advanced architecture -based architecture Video Diffusion Transformer It uses and uses audio-visual synchronization techniques, providing accurate coordination between lip movemes, face states, body movemes and input sound.

According to the GitHub page description of this project, there is a two -step strategy for sound and image synchronization.

How to produce a spokesman by faasytalking artificial ielligence

In the first step, the model with the clip level training coordinates the overall scene movemes including the face, the surrounding objects and the background with the input sound. Secondly, the details of the lip movemes are precisely frame and modified using specific masks to fully match the sound.

One of the major challenges in the field of graphics and vision of the machine has been the production of removable avatars of fixed image. Most of the previous methods used to maiain realism and sound synchronization used 3D mediators such as 3DMM or Flame, but these were ineffective in reproducing delicate face movemes and natural animations.

In the video below you can compare some of the models made by this model and other models:

Faasytalking It also uses a special module to corol the severity of the movemes, which allows for adjusting the amou of face and body animation. This feature makes it possible to produce videos beyond the moveme of the lips. Unlike many other models, the system uses face -based mechanism to maiain face ideity that offers more natural and iegrated results.

Other capabilities of this model include the production of characters with differe angles (close, half -seal, full or angled), support for differe graphic (realistic or cartoon) and even animals.

Compared to closed and advanced methods such as Omnihuman-1The Faasytalking model offers higher quality in terms of realism, ideity preservation, motor cohesion and audio-visual matching.

RCO NEWS

New ways to get Canadian permanent residence through Express Entry 2026

Get to know Ryazan University in Russia! Complete guide for 2026 study applicants

ca PGWP golden tips that most Canadian students don’t know

ca

A detailed comparison of Russia and China for education and immigration, an analytical and realistic guide to the decision that will shape your future

Conditions for buying bus tickets Booking guide and bus travel rules

Introduction of the silver beach of Hormuz (access route + accommodation)

Al Habtoor Palace Dubai Hotel

Traffic police: Chalus road, Tehran freeway to the north and Pardis became one-way

Swissôtel Al Ghurair, Dubai

ChatGPT’s safety rules need to be revised

Ethereum time bomb at the border of 2 dollars and the possibility of a historic explosion!

New Qwen 3.5 open source models released; Suitable for running on personal systems

The Perplexity Computer platform was introduced

Nano Banana 2 model was introduced; Google’s strongest artificial intelligence

Introducing Fantasytalking Artificial Intelligence to build spokesperson characters! + Video

How to produce a spokesman by faasytalking artificial ielligence

Leave a Reply Cancel reply

Editor's Pick

Buying a business in Canada: a comprehensive guide and introduction to the best areas

Dubai Metro Map 2024 from introduction to (new download)

Burj Al Arab restaurants Instant booking 2024

Top Writers

Oponion

Women’s short home cotton shirt

You Might Also Like

Clash in the US Senate about the war against Iran + video – Mehr news agency RCO News Agency

ChatGPT’s safety rules need to be revised

New Qwen 3.5 open source models released; Suitable for running on personal systems

The Perplexity Computer platform was introduced

Other News

Technology

Immigration

Travel

More

Subscribe