博彩评级网-博彩网_百家乐投资_全讯网新2

position: EnglishChannel  > News> Upload a Photo, Get a Video

Upload a Photo, Get a Video

Source: Science and Technology Daily | 2025-06-11 11:29:11 | Author: LI LInxu

The rapid developments in AI have unlocked new possibilities for digital representation. With the help of AI models, you can now achieve a remarkable feat: bringing characters to life with just an image and an audio clip.

Jointly developed by Tencent Hunyuan and Tencent Music, the newly released HunyuanVideo-Avatar, a multimodal diffusion transformer-based model, is capable of simultaneously generating dynamic, emotion-controllable, and multi-character dialogue videos. This capability supports head-and-shoulder, half-body, and full-body views, encompassing multiple styles, species, and even dual-character scenes.

To put it simply, you just upload a photo and a voice clip, and the model figures out the context, emotion and lip movements to create a realistic animated video.

For instance, if you upload an image of a woman sitting on a beach with a guitar, along with a piece of lyrical music,  the model understands the scene as "a woman playing the guitar and singing a lyrical song by the sea," and subsequently generates a video of the woman performing the song.

The model provides video creators with highly consistent and dynamic video generation capabilities. Its versatility can unlock a myriad of applications in fields like entertainment, media, e-commerce, advertising and education.

It has already been applied in multiple scenarios within Tencent Music, such as AI companions for music listening, long-form audio podcasts, and music videos (MVs).

For example, on the app QQ Music, when users listen to songs by "AI Leehom" (a fully AI-driven singer created by Tencent Music and Team Leehom), a lively and adorable AI Leehom image synchronizes its singing in real-time on the player.

On WeSing, a popular karaoke singing app, users can upload their images to generate personalized MVs of themselves singing.

In subject consistency and audio-video synchronization, the HunyuanVideo-Avatar shows top-tier industry performance. For video dynamics and natural body movements, it exceeds open-source solutions and rivals closed-source ones.

Currently, the model supports audio uploads of up to 14 seconds for video generation, with more capabilities to be released and open-sourced in the future.

Editor:李林旭

Top News

Energy Cooperation Gets New Direction

?Chinese President Xi Jinping sent a congratulatory message to the 7th China-Russia Energy Business Forum in Beijing on November 25, sparking enthusiastic responses from various sectors in both countries.

WEEKLY REVIEW (Dec.3-10)

Liang Wenfeng, founder and CEO of the Chinese AI firm DeepSeek, and "deep diver" Chinese geoscientist Du Mengran are on the annual "Nature's 10" list, which highlights 10 people at the heart of some of the biggest science stories of 2025.

抱歉,您使用的瀏覽器版本過低或開啟了瀏覽器兼容模式,這會影響您正常瀏覽本網頁

您可以進行以下操作:

1.將瀏覽器切換回極速模式

2.點擊下面圖標升級或更換您的瀏覽器

3.暫不升級,繼續瀏覽

繼續瀏覽
大发888官方 46| 乐宝百家乐官网的玩法技巧和规则 | 永利百家乐赌场娱乐网规则| 赌百家乐2号破解| 博e百| 百家乐真人投注网站| 大发888娱乐厂场| 喜达百家乐官网现金网| 破解百家乐游戏机| tt线上娱乐城| 风水学中的24向是什么| 大发888官方6222.com| 仕達屋百家乐官网的玩法技巧和规则 | 百家乐大眼仔用法| 百家乐官网拍是什么| 百家乐高人玩法| 巩留县| 网上百家乐赢钱公式| 百家乐官网注册开户| 百家乐六手变化混合赢家打法| 百家乐官网香港六合彩| 百家乐咨询网址| 网上百家乐官网如何作假| 百家乐官方游戏| 游戏机百家乐官网的技术| 大发在线德州扑克| 百家乐AG| 百家乐官网和| 百家乐官网增值公式| 百家乐专业术语| 风水24山分房图| 破解百家乐官网真人游戏| 尊龙国际网址| 威尼斯人娱乐城 老品牌| 百家乐官网板路| 芦山县| 稳赢至尊| 大发888娱乐手机版| 开百家乐骗人吗| 百家乐永利娱乐城| 百家乐官网专打方法|