Home Technology peripherals AI MoDa community launches AI video generation tool Live Portait, which can make photos speak with one click

MoDa community launches AI video generation tool Live Portait, which can make photos speak with one click

Aug 19, 2023 pm 05:21 PM

Magic Community has launched an AI video generation tool called Live Portrait, which can make the characters in the photo speak with one-click operation

Alibaba Cloud has launched a digital human video generation tool called Live Portrait. Users only need to upload a photo and a text or voice to generate a talking digital human video. This tool can be used in many scenarios such as live video broadcasts, chat robots, and corporate marketing. Currently, this tool is open for experience in the Magic Community Creation Space

魔搭社区上线AI视频生成工具Live Portait,可一键让照片开口说话

With the popularity of self-conversation large models and AI painting models, the research community is gradually pushing the research on generative AI into more modalities, among which AI video generation technology has attracted much attention. This technology can convert information such as text or audio into facial movement information to generate animated photos with character images, effectively lowering the threshold for video shooting and production

Alibaba Cloud’s latest Live Portait tool combines the motion module and the generation module. This tool uses Alibaba Cloud's self-developed mouth shape prediction algorithm, which greatly improves the accuracy of mouth shape generation and is significantly improved compared to traditional methods. In the training stage, explicit control of posture is added, so that the generated video can show any action without the need for a baseboard video, thus greatly improving the realism of digital human speech. In addition, through active eye control technology, Live Portait can add natural movement to the eyeballs, making the generated results closer to real-life effects. According to reports, Live Portait related technologies have been included in top international AI conferences such as CVPR and ICCV

魔搭社区上线AI视频生成工具Live Portait,可一键让照片开口说话

According to information from the Magic Community, Live Portait provides two methods for users to choose from after uploading photos, namely text-driven and audio-driven. In text-driven mode, users can choose from 28 different voices, including Mandarin, English, Cantonese, and children's voices. In addition, Live Portait also provides lightweight model selection to help users generate videos faster

Zhang Bang, head of the tool’s algorithm, said: “Live Portait integrates a number of innovative technologies independently developed by the team, including the ability to generate realistic facial animations using a single picture, breaking through the limitations of traditional adversarial generation networks. With the continuous evolution of technology, image-generated videos have broad application prospects and are expected to become an important tool for enterprises to improve production efficiency and reduce costs."

It is understood that the team’s research directions include digital humans, 3D model AI generation, high-fidelity rendering and natural human-computer interaction. It has published more than 50 papers at top international conferences

The above is the detailed content of MoDa community launches AI video generation tool Live Portait, which can make photos speak with one click. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Top 5 GenAI Launches of February 2025: GPT-4.5, Grok-3 & More! Top 5 GenAI Launches of February 2025: GPT-4.5, Grok-3 & More! Mar 22, 2025 am 10:58 AM

February 2025 has been yet another game-changing month for generative AI, bringing us some of the most anticipated model upgrades and groundbreaking new features. From xAI’s Grok 3 and Anthropic’s Claude 3.7 Sonnet, to OpenAI’s G

How to Use YOLO v12 for Object Detection? How to Use YOLO v12 for Object Detection? Mar 22, 2025 am 11:07 AM

YOLO (You Only Look Once) has been a leading real-time object detection framework, with each iteration improving upon the previous versions. The latest version YOLO v12 introduces advancements that significantly enhance accuracy

Best AI Art Generators (Free & Paid) for Creative Projects Best AI Art Generators (Free & Paid) for Creative Projects Apr 02, 2025 pm 06:10 PM

The article reviews top AI art generators, discussing their features, suitability for creative projects, and value. It highlights Midjourney as the best value for professionals and recommends DALL-E 2 for high-quality, customizable art.

Is ChatGPT 4 O available? Is ChatGPT 4 O available? Mar 28, 2025 pm 05:29 PM

ChatGPT 4 is currently available and widely used, demonstrating significant improvements in understanding context and generating coherent responses compared to its predecessors like ChatGPT 3.5. Future developments may include more personalized interactions and real-time data processing capabilities, further enhancing its potential for various applications.

Best AI Chatbots Compared (ChatGPT, Gemini, Claude & More) Best AI Chatbots Compared (ChatGPT, Gemini, Claude & More) Apr 02, 2025 pm 06:09 PM

The article compares top AI chatbots like ChatGPT, Gemini, and Claude, focusing on their unique features, customization options, and performance in natural language processing and reliability.

Getting Started With Meta Llama 3.2 - Analytics Vidhya Getting Started With Meta Llama 3.2 - Analytics Vidhya Apr 11, 2025 pm 12:04 PM

Meta's Llama 3.2: A Leap Forward in Multimodal and Mobile AI Meta recently unveiled Llama 3.2, a significant advancement in AI featuring powerful vision capabilities and lightweight text models optimized for mobile devices. Building on the success o

Top AI Writing Assistants to Boost Your Content Creation Top AI Writing Assistants to Boost Your Content Creation Apr 02, 2025 pm 06:11 PM

The article discusses top AI writing assistants like Grammarly, Jasper, Copy.ai, Writesonic, and Rytr, focusing on their unique features for content creation. It argues that Jasper excels in SEO optimization, while AI tools help maintain tone consist

Guide to Uber's H3 for Spatial Indexing Guide to Uber's H3 for Spatial Indexing Mar 22, 2025 am 10:54 AM

In today’s data-driven world, efficient geospatial indexing is crucial for applications ranging from ride-sharing and logistics to environmental monitoring and disaster response. Uber’s H3, a powerful open-source spat

See all articles