Table of Contents
Overview of Method
Home Technology peripherals AI Google is the first to release video generation AIGC, netizens: you can customize movies

Google is the first to release video generation AIGC, netizens: you can customize movies

Apr 12, 2023 pm 08:10 PM
video image

We know that advances in generative models and multimodal visual language models have paved the way for large-scale text-to-image models with unprecedented generative realism and diversity. These models offer new creative processes, but are limited to compositing new images rather than editing existing ones. To bridge this gap, intuitive text-based editing methods can perform text-based editing of generated and real images and preserve some of the original properties of these images. Similar to images, many text-to-video models have been proposed recently, but there are few methods using these models for video editing. ​

In text-guided video editing, the user provides an input video along with a text prompt that describes the expected properties of the generated video, as shown in Figure 1 below. The goals have the following three aspects, 1) Alignment, the edited video should conform to the input text prompt; 2) Fidelity, the edited video should retain the content of the original video, 3) Quality, the edited video should have high quality .

As you can see, video editing is more challenging than image editing, it requires synthesizing new actions rather than just modifying the visual appearance. There is also a need to maintain temporal consistency. Therefore, applying image-level editing methods such as SDEdit and Prompt-to-Prompt to video frames is not enough to achieve good results.

Google is the first to release video generation AIGC, netizens: you can customize movies

##In a paper recently published by Google Research and others on arXiv, Researchers proposed a new method, Dreamix, which was inspired by UniTune and applied the text conditional video diffusion model (VDM) to video editing.

Google is the first to release video generation AIGC, netizens: you can customize movies

  • ##Paper address: https://arxiv.org/ pdf/2302.01329.pdf
  • Project homepage: https://dreamix-video-editing.github.io/
The core of the method in this article is

The text conditional VDM maintains high fidelity to the input video through the following two main ideas . One does not use pure noise as model initialization, but uses a downgraded version of the original video to retain only low spatiotemporal information by reducing the size and adding noise; the other is to further improve the fidelity of the original video by fine-tuning the generative model on the original video Spend.

Fine-tuning ensures that the model understands the high-resolution properties of the original video. Simple fine-tuning of the input video contributes to relatively low motion editability because the model learns to prefer raw motion rather than following text prompts. We propose a novel hybrid fine-tuning method in which the VDM is also fine-tuned on a set of individual frames of the input video and discards their timing. Blend fine-tuning significantly improves the quality of motion editing.

The researchers further used their video editing model to propose a

new image animation framework, as shown in Figure 2 below. The framework consists of several steps, such as animating objects and backgrounds in images, creating dynamic camera movements, and more. They do this through simple image processing operations such as frame copying or geometric image transformations, creating crude videos. Then use the Dreamix video editor to edit the video. In addition, the researchers also used their fine-tuning method for goal-driven video generation, which is the video version of Dreambooth.

Google is the first to release video generation AIGC, netizens: you can customize movies

In the experimental display part, the researchers conducted extensive qualitative research and manual evaluation. Demonstrating the powerful capabilities of their method, please refer to the following animation for details.

Google is the first to release video generation AIGC, netizens: you can customize movies

Google is the first to release video generation AIGC, netizens: you can customize movies

For this Google study, it was stated that 3D motion and editing tools Might be a popular topic for the next wave of papers.

Someone else said: You can soon make your own movie on a budget, all you need is a green screen and this technology:

Google is the first to release video generation AIGC, netizens: you can customize movies

Overview of Method

This article proposes a new method for video editing, specifically :

Text-guided video editing by reverse engineering destroyed videos

They use cascade VDM (Video Diffusion Models ), first destroy the input video to a certain extent through downsampling, and then add noise. Next a cascade diffusion model is used for the sampling process and conditional on time t to upscale the video to the final temporal-spatial resolution. ​

In the process of destroying the input video, you first need to perform a downsampling operation to obtain the basic model (16 frames 24 × 40), and then add The variance is Google is the first to release video generation AIGC, netizens: you can customize moviesGaussian noise, further corrupting the input video. ​

For the above processed video, the next step is to use cascaded VDM to map the damaged low-resolution video to a high-resolution video aligned with the text . The core idea here is that given a noisy, very low temporal and spatial resolution video, there are many perfectly feasible, high-resolution videos corresponding to it. The basic model in this paper starts from a corrupted video, which has the same noise as the diffusion process at time s. The study then used VDM to reverse the diffusion process until time 0. Finally, the video is upgraded through the super-resolution model.

Hybrid video image fine-tuning

Using only the input video to fine-tune the video diffusion model will limit the changes in object motion. Instead, this study uses a hybrid target, that is, in addition to the original target (lower left corner), this paper also performs fine-tuning on an unordered set of frames. This is done through "masked temporal attention" to prevent temporal attention. Forces and convolutions are fine-tuned (bottom right). This operation allows adding motion to static videos.

Google is the first to release video generation AIGC, netizens: you can customize movies

Inference

In the application Based on pre-processing (Application Dependent Pre-processing, left in the figure below), this research supports multiple applications and can convert input content into a unified video format. For image-to-video, the input image is copied and transformed, synthesizing a rough video with some camera motion; for object-driven video generation, its input is omitted and fine-tuned separately to maintain fidelity. This rough video was then edited using the Dreamix Video Editor (right): as mentioned earlier, the video was first destroyed by downsampling, adding noise. A fine-tuned text-guided video diffusion model is then applied to upscale the video to its final temporal and spatial resolution.

Google is the first to release video generation AIGC, netizens: you can customize movies

Experimental results

Video editing: In the picture below, Dreamix changes the action to dance, and the appearance changes from a monkey to a bear, But the basic attributes of the subject in the video have not changed:

Google is the first to release video generation AIGC, netizens: you can customize movies


Dreamix can also generate smooth visual modifications that are consistent with the input video time information, such as the deer that can skateboard as shown below:

Google is the first to release video generation AIGC, netizens: you can customize movies

Image to video: When the input is an image, Dreamix can add new moving objects using its video prior, as follows A unicorn appears in a foggy forest and is zoomed in.

Google is the first to release video generation AIGC, netizens: you can customize movies

Penguins appeared next to the hut:

Google is the first to release video generation AIGC, netizens: you can customize movies

Goal-driven video generation: Dreamix can also take a collection of images showing the same subject and generate a new video with that subject as a moving object. The picture below shows a caterpillar wriggling on a leaf:

Google is the first to release video generation AIGC, netizens: you can customize movies

In addition to qualitative analysis, the study also conducted baseline comparisons, mainly using Dreamix Compare with two baseline methods: Imagen-Video and Plug-and-Play (PnP). The following table shows the scoring results:

Google is the first to release video generation AIGC, netizens: you can customize movies

Figure 8 shows a video edited by Dreamix and two baseline examples: text to The video model enables low-fidelity editing because it is not conditioned on the original video. PnP preserves the scene but lacks consistency from frame to frame; Dreamix performs well on all three goals.

Google is the first to release video generation AIGC, netizens: you can customize movies

Please refer to the original paper for more technical details.

The above is the detailed content of Google is the first to release video generation AIGC, netizens: you can customize movies. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Where are video files stored in browser cache? Where are video files stored in browser cache? Feb 19, 2024 pm 05:09 PM

Which folder does the browser cache the video in? When we use the Internet browser every day, we often watch various online videos, such as watching music videos on YouTube or watching movies on Netflix. These videos will be cached by the browser during the loading process so that they can be loaded quickly when played again in the future. So the question is, in which folder are these cached videos actually stored? Different browsers store cached video folders in different locations. Below we will introduce several common browsers and their

Is it infringing to post other people's videos on Douyin? How does it edit videos without infringement? Is it infringing to post other people's videos on Douyin? How does it edit videos without infringement? Mar 21, 2024 pm 05:57 PM

With the rise of short video platforms, Douyin has become an indispensable part of everyone's daily life. On TikTok, we can see interesting videos from all over the world. Some people like to post other people’s videos, which raises a question: Is Douyin infringing upon posting other people’s videos? This article will discuss this issue and tell you how to edit videos without infringement and how to avoid infringement issues. 1. Is it infringing upon Douyin’s posting of other people’s videos? According to the provisions of my country's Copyright Law, unauthorized use of the copyright owner's works without the permission of the copyright owner is an infringement. Therefore, posting other people’s videos on Douyin without the permission of the original author or copyright owner is an infringement. 2. How to edit a video without infringement? 1. Use of public domain or licensed content: Public

How to remove video watermark in Wink How to remove video watermark in Wink Feb 23, 2024 pm 07:22 PM

How to remove watermarks from videos in Wink? There is a tool to remove watermarks from videos in winkAPP, but most friends don’t know how to remove watermarks from videos in wink. Next is the picture of how to remove watermarks from videos in Wink brought by the editor. Text tutorial, interested users come and take a look! How to remove video watermarks in Wink 1. First open wink APP and select the [Remove Watermark] function in the homepage area; 2. Then select the video you want to remove the watermark in the album; 3. Then select the video and click the upper right corner after editing the video. [√]; 4. Finally, click [One-click Print] as shown in the figure below and then click [Process].

How to make money from posting videos on Douyin? How can a newbie make money on Douyin? How to make money from posting videos on Douyin? How can a newbie make money on Douyin? Mar 21, 2024 pm 08:17 PM

Douyin, the national short video platform, not only allows us to enjoy a variety of interesting and novel short videos in our free time, but also gives us a stage to show ourselves and realize our values. So, how to make money by posting videos on Douyin? This article will answer this question in detail and help you make more money on TikTok. 1. How to make money from posting videos on Douyin? After posting a video and gaining a certain amount of views on Douyin, you will have the opportunity to participate in the advertising sharing plan. This income method is one of the most familiar to Douyin users and is also the main source of income for many creators. Douyin decides whether to provide advertising sharing opportunities based on various factors such as account weight, video content, and audience feedback. The TikTok platform allows viewers to support their favorite creators by sending gifts,

2 Ways to Remove Slow Motion from Videos on iPhone 2 Ways to Remove Slow Motion from Videos on iPhone Mar 04, 2024 am 10:46 AM

On iOS devices, the Camera app allows you to shoot slow-motion video, or even 240 frames per second if you have the latest iPhone. This capability allows you to capture high-speed action in rich detail. But sometimes, you may want to play slow-motion videos at normal speed so you can better appreciate the details and action in the video. In this article, we will explain all the methods to remove slow motion from existing videos on iPhone. How to Remove Slow Motion from Videos on iPhone [2 Methods] You can use Photos App or iMovie App to remove slow motion from videos on your device. Method 1: Open on iPhone using Photos app

How to publish Xiaohongshu video works? What should I pay attention to when posting videos? How to publish Xiaohongshu video works? What should I pay attention to when posting videos? Mar 23, 2024 pm 08:50 PM

With the rise of short video platforms, Xiaohongshu has become a platform for many people to share their lives, express themselves, and gain traffic. On this platform, publishing video works is a very popular way of interaction. So, how to publish Xiaohongshu video works? 1. How to publish Xiaohongshu video works? First, make sure you have a video content ready to share. You can use your mobile phone or other camera equipment to shoot, but you need to pay attention to the image quality and sound clarity. 2. Edit the video: In order to make the work more attractive, you can edit the video. You can use professional video editing software, such as Douyin, Kuaishou, etc., to add filters, music, subtitles and other elements. 3. Choose a cover: The cover is the key to attracting users to click. Choose a clear and interesting picture as the cover to attract users to click on it.

How to convert videos downloaded by uc browser into local videos How to convert videos downloaded by uc browser into local videos Feb 29, 2024 pm 10:19 PM

How to turn videos downloaded by UC browser into local videos? Many mobile phone users like to use UC Browser. They can not only browse the web, but also watch various videos and TV programs online, and download their favorite videos to their mobile phones. Actually, we can convert downloaded videos to local videos, but many people don't know how to do it. Therefore, the editor specially brings you a method to convert the videos cached by UC browser into local videos. I hope it can help you. Method to convert uc browser cached videos to local videos 1. Open uc browser and click the "Menu" option. 2. Click "Download/Video". 3. Click "Cached Video". 4. Long press any video, when the options pop up, click "Open Directory". 5. Check the ones you want to download

How to post videos on Weibo without compressing the image quality_How to post videos on Weibo without compressing the image quality How to post videos on Weibo without compressing the image quality_How to post videos on Weibo without compressing the image quality Mar 30, 2024 pm 12:26 PM

1. First open Weibo on your mobile phone and click [Me] in the lower right corner (as shown in the picture). 2. Then click [Gear] in the upper right corner to open settings (as shown in the picture). 3. Then find and open [General Settings] (as shown in the picture). 4. Then enter the [Video Follow] option (as shown in the picture). 5. Then open the [Video Upload Resolution] setting (as shown in the picture). 6. Finally, select [Original Image Quality] to avoid compression (as shown in the picture).

See all articles