Kuaishou internal beta AI play review: What is the collision effect between large models and short videos?-AI-php.cn

Home

Technology peripherals

Kuaishou internal beta AI play review: What is the collision effect between large models and short videos?

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Sep 26, 2023 pm 01:57 PM

ai rating model chemistry

The "Battle of One Hundred Models" has recently added another participant. Following the launch of the large language model "Ruyi" of Wenshengwen last month, Kuaishou recently launched a self-developed large model in the field of "Wenshengtu". Figureable” (Kolors). As a short video platform, Kuaishou’s “Ketu” will naturally be used in its own App. Relying on the Ketu large model, Kuaishou has also begun to test the “AI play review” function in the short video comment area, trying to unlock the AIGC short video New ways to play.

Kuaishou internal beta AI play review: What is the collision effect between large models and short videos?

It is reported that Kuaishou’s “AI Play Review” is the first time in the industry to apply AIGC capabilities in the comment area of the core business scenario of a large-scale app. This function is designed to enhance users’ interactive experience in the comment area. Users can input creative text to Easily generate a large number of images in different styles to enrich comment interaction. Users only need to enter a text comment of 6 words or more in the comment area of the short video, and click the "AI" logo in the lower right corner of the comment box to generate a comment picture with one click. They can also click "Change View" to switch to more styles. .

According to the Kuaishou AI team, through the "AI Play Review" function, users can express their opinions and emotions more accurately and more interestingly, and have more convenient and interesting interactions in the comment area, without having to look for suitable pictures. Or emoticon package, but can directly generate a picture. It is understood that AI game reviews can generate pictures ranging from common styles such as cyberpunk, pixels, and realistic animation, to pictures with strong personal styles such as Makoto Shinkai, Hayao Miyazaki, and Katsuhiro Otomo.

Kuaishou internal beta AI play review: What is the collision effect between large models and short videos?

By analyzing the content input by the user, drawing semantic pictures has become a standard function of Stable Diffusio, midjourney, and various large AI models with Vincentian diagram functions in the domestic market. In other words, Kuaishou's AI review is essentially an AI painting tool. The technology behind it is mainly based on NLP natural semantic processing, and accurately identifying what the user wants to express is a key element

The effect of AI game review depends on the prompt word (Prompt). According to netizens’ experience, if the text comments contain more descriptive content about people, scenery, space, actions, etc., the generated pictures will be more consistent with the actual situation. On the contrary, if there are vague descriptions in the comments that lack specific referents, such as "666" or "Oh my god! Sister is so awesome!", the results generated by the AI will not be viewable. Therefore, this reality directly leads to the fact that AI game reviews may not be loved by most users

Kuaishou internal beta AI play review: What is the collision effect between large models and short videos?

The question is, what is the comment area of the short video platform like at this stage? In fact, this is a scene full of witticisms, jokes, witty remarks and other emotional content. Due to the characteristics of short videos, including magical brainwashing background music, intensely stimulating pictures and uncertain reward mechanisms, users give up thinking and become immersed in it. Therefore, comments in the comment area are usually just a simple sentence, which users will use to clearly express their likes, dislikes or opinions

The result of this reality is that the content output by users in the short video comment area is basically emotional and lacks qualitative content. Just imagine, if it is just a pile of adjectives, AI will face the confusion of lacking a subject, which means that the final content generated by AI may be very different from what the user wants to express. I believe that friends who have used tools such as Stable Diffusio and midjourney know that if Prompt is mainly adjectives, the result of the lack of nouns is that the AI will let itself go.

Kuaishou internal beta AI play review: What is the collision effect between large models and short videos?

Even the most advanced GPT-4 is actually flawed in experiencing human emotions. In fact, AI's emotional perception ability is still a problem facing all AI researchers at this stage. At present, many large AI models are oriented to either serious productivity scenarios or conversations with humans, and almost no AI involves emotional expression. So in this way, it is actually difficult for Kuaishou’s AI game reviewers to do their job well. It might be good not to hinder users’ comments.

So in this case, why does Kuaishou launch AI game review? Of course, the purpose is to make the large model of Vincent's picture "pictureable" and have a realistic scene. The Kuaishou App itself is almost Kuaishou’s only consumer-oriented product, so “AIGC short video” has become almost the only card they can play. In fact, we can see from here that Kuaishou, as a new giant emerging in the mobile Internet era, is still inferior to traditional giants such as BAT in terms of background.

Kuaishou internal beta AI play review: What is the collision effect between large models and short videos?

Unlike BAT, which has almost built itself into an Internet water, coal and electricity company, Kuaishou, a group of new giants that grew up in the mobile Internet era, currently almost all show the characteristics of a single business line of "strong trunks and weak branches", such as Kuaishou’s core business is basically based on the Kuaishou App, and almost all other businesses have not yet been launched. Before this round of AI concepts broke out, Baidu, which was once considered lonely by the outside world, in addition to a search engine, also made an input method, so Baidu's native AI applications can be carried on Baidu input method.

Looking back at Kuaishou, apart from the Kuaishou App, where else can the "tutu" large model be used? If Kuaishou wants to make an app solely for large AI models, Kuaishou may lose the opportunity. The current situation is that there is actually no generational difference in performance between the major AI models in the domestic market. The actual use experience of each model is basically the same, and the user's choice is often as long as it is useful. Even for users who want to experience the charm of large AI models, many have downloaded Baidu Wenxinyiyan, which has a first-mover advantage.

Kuaishou internal beta AI play review: What is the collision effect between large models and short videos?

In fact, station B may have set a better example for combining AIGC with video. Previously this summer, Station B launched the "AI Video Assistant" account. Users only need to @AI Video Assistant in the comment area of the corresponding video, and the latter can automatically generate a text summary of the video. For the long videos of Station B, the summary and organization of the AI video assistant can help users complete information extraction in a short time, so it will naturally be welcomed by many users.

As a product with more prominent entertainment attributes, if Kuaishou App wants to better integrate with AIGC, it must naturally meet users’ entertainment needs. For example, intelligently generating emoticons based on comments may be far more suitable for the atmosphere of the platform than creating pictures of people in the comment area.

The above is the detailed content of Kuaishou internal beta AI play review: What is the collision effect between large models and short videos?. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Assassin's Creed Shadows: Seashell Riddle Solution

3 weeks ago By DDD

What's New in Windows 11 KB5054979 & How to Fix Update Issues

2 weeks ago By DDD

Where to find the Crane Control Keycard in Atomfall

3 weeks ago By DDD

Assassin's Creed Shadows - How To Find The Blacksmith And Unlock Weapon And Armour Customisation

1 months ago By DDD

Roblox: Dead Rails - How To Complete Every Challenge

3 weeks ago By DDD

Hot Tools

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Where is the login entrance for gmail email?

7611

CakePHP Tutorial

1387

What is the format of the account name of steam

win11 activation key permanent

nyt connections hints and answers

136

Related knowledge

Top 5 GenAI Launches of February 2025: GPT-4.5, Grok-3 & More! Mar 22, 2025 am 10:58 AM

February 2025 has been yet another game-changing month for generative AI, bringing us some of the most anticipated model upgrades and groundbreaking new features. From xAI’s Grok 3 and Anthropic’s Claude 3.7 Sonnet, to OpenAI’s G

How to Use YOLO v12 for Object Detection? Mar 22, 2025 am 11:07 AM

YOLO (You Only Look Once) has been a leading real-time object detection framework, with each iteration improving upon the previous versions. The latest version YOLO v12 introduces advancements that significantly enhance accuracy

Best AI Art Generators (Free & Paid) for Creative Projects Apr 02, 2025 pm 06:10 PM

The article reviews top AI art generators, discussing their features, suitability for creative projects, and value. It highlights Midjourney as the best value for professionals and recommends DALL-E 2 for high-quality, customizable art.

Is ChatGPT 4 O available? Mar 28, 2025 pm 05:29 PM

ChatGPT 4 is currently available and widely used, demonstrating significant improvements in understanding context and generating coherent responses compared to its predecessors like ChatGPT 3.5. Future developments may include more personalized interactions and real-time data processing capabilities, further enhancing its potential for various applications.

Best AI Chatbots Compared (ChatGPT, Gemini, Claude & More) Apr 02, 2025 pm 06:09 PM

The article compares top AI chatbots like ChatGPT, Gemini, and Claude, focusing on their unique features, customization options, and performance in natural language processing and reliability.

How to Use Mistral OCR for Your Next RAG Model Mar 21, 2025 am 11:11 AM

Mistral OCR: Revolutionizing Retrieval-Augmented Generation with Multimodal Document Understanding Retrieval-Augmented Generation (RAG) systems have significantly advanced AI capabilities, enabling access to vast data stores for more informed respons

Top AI Writing Assistants to Boost Your Content Creation Apr 02, 2025 pm 06:11 PM

The article discusses top AI writing assistants like Grammarly, Jasper, Copy.ai, Writesonic, and Rytr, focusing on their unique features for content creation. It argues that Jasper excels in SEO optimization, while AI tools help maintain tone consist

Getting Started With Meta Llama 3.2 - Analytics Vidhya Apr 11, 2025 pm 12:04 PM

Meta's Llama 3.2: A Leap Forward in Multimodal and Mobile AI Meta recently unveiled Llama 3.2, a significant advancement in AI featuring powerful vision capabilities and lightweight text models optimized for mobile devices. Building on the success o

See all articles