Table of Contents
Introduction" >Introduction
Encyclopedia, dating back to Greece and Rome, is also 17- An outstanding achievement of the French Enlightenment in the 18th century. Knowledge encyclopedia usually refers to a reference book or compendium that briefly introduces all human knowledge or a specific field or subject. With the rapid development of the Internet, online encyclopedia has become a new carrier of knowledge, such as Wikipedia, Baidu Encyclopedia, etc. However, these encyclopedias usually use pictures, texts and tables as carriers, making it difficult to express some knowledge that requires vivid demonstration, such as tutorial (How-to) knowledge. Figure 1 shows the dilemma of using pictures and text to tell the knowledge of "Shiba Inu" - "how to draw". Through short videos, we can explain and learn this knowledge very well.
Conclusion
Introduction of the author" >Introduction of the author
Home Technology peripherals AI Kuaishou proposes a billion-level multi-modal short video encyclopedia system - Kuaipedia

Kuaishou proposes a billion-level multi-modal short video encyclopedia system - Kuaipedia

May 20, 2023 pm 05:10 PM
Kuaiped

Introduction

Currently, more and more short video users not only hope to use their fragmented time for leisure and entertainment, but also begin to hope to be able to use short videos in Get more knowledge on the platform. In 2021, Kuaishou’s pan-knowledge content playback volume increased by 58.11% year-on-year, and the platform had more than 33 million pan-knowledge live broadcasts throughout the year [1]. In order to better understand and organize pan-knowledge videos, Kuaishou MMU teamed up with Harbin Institute of Technology and others to propose the industry's first multi-modal short video encyclopedia - "Kuaipedia": using multi-modal and knowledge graph technology to extract information from massive short videos中Mining large-scale high-quality knowledge videos and structuring them to form a systematic short video encyclopedia knowledge base, provide users with a better knowledge acquisition experience, while inspiring creators to create high-quality knowledge content and build a healthy knowledge sharing ecosystem.

Kuaishou proposes a billion-level multi-modal short video encyclopedia system - Kuaipedia

## Paper link: https://www.php.cn/link/b0da9d8dd88178e3bb138e08742eb2e2

##Project homepage: ##https://www.php.cn/link/1a725948eb0c738707b5c026a65ba618##​The team mined hundreds of millions of knowledge videos from Kuaishou’s massive short videos, structured them, and built a video encyclopedia system of tens of millions of entries and knowledge points. The proposal of "Kuaipedia" helps the academic community to promote AI to understand world knowledge through multi-modal information, and has great imagination space for implementation in the industry.

Introduction

Kuaishou proposes a billion-level multi-modal short video encyclopedia system - Kuaipedia

Encyclopedia, dating back to Greece and Rome, is also 17- An outstanding achievement of the French Enlightenment in the 18th century. Knowledge encyclopedia usually refers to a reference book or compendium that briefly introduces all human knowledge or a specific field or subject. With the rapid development of the Internet, online encyclopedia has become a new carrier of knowledge, such as Wikipedia, Baidu Encyclopedia, etc. However, these encyclopedias usually use pictures, texts and tables as carriers, making it difficult to express some knowledge that requires vivid demonstration, such as tutorial (How-to) knowledge. Figure 1 shows the dilemma of using pictures and text to tell the knowledge of "Shiba Inu" - "how to draw". Through short videos, we can explain and learn this knowledge very well.

See the specific video

https://www.php.cn /link/70e9dbe24ba303f2d25ac34d3ae945c5.

Kuaishou proposes a billion-level multi-modal short video encyclopedia system - Kuaipedia Figure 1: The dilemma of knowledge transfer in how-to knowledge with pictures and texts, pictures and texts come from short videos Frame screenshot

With the continuous iteration of the content industry and media forms, short videos have increasingly become the main medium for knowledge disseminators, especially in the dissemination of knowledge about some skills and expertise. It is natural. some advantages. At present, although there are public online encyclopedias with video content, they are usually in the form of brief introductions (such as Encyclopedia of Instant Understanding), and short videos are not utilized to the maximum extent. Therefore, the expressive ability of short videos in knowledge encyclopedias has been underestimated. Severely underestimated. For example, when people talk about "Shiba Inu", in addition to the "introduction", people also pay attention to "how to choose", "how to comb the hair", "how to correct food protection", etc. Therefore, we believe that structuring knowledge-based short videos into a structured short video encyclopedia is an effective way to understand world knowledge and help humans spread knowledge more efficiently.

Reference national standards Popular science knowledge , the skill (How) category boils down to ##tutorial knowledge, in Kuaishou’s massive videos Discover high-quality knowledge videos. In addition, we present the body of knowledge extracted from the short video in the form of entries (such as Shiba Inu) , and extract the specific knowledge points explained in the video (such as Shiba Inu-selection, Shiba Inu - food protection and correction, etc.), ultimately forming a short video encyclopedia knowledge system, as shown in Figure 2.

Kuaishou proposes a billion-level multi-modal short video encyclopedia system - Kuaipedia

##Figure 2: Quick Knowledge - Overview of Multi-modal Short Video Encyclopedia

The proposal of "Kuaipedia" has the following contributions:

"Kuaipedia" Definition: We have pioneered a new multi-modal knowledge encyclopedia, which is based on entries, knowledge points, knowledge-based short videos and the relationships between them. constitute. This is the industry's first structured multi-modal short video encyclopedia.

Methods to build large-scale short video encyclopedia: We propose the use of knowledge videos A combination of recognition, entry knowledge point mining, and multi-modal knowledge links is used to build a large-scale short video encyclopedia. And pioneered the task of "multimodal knowledge linking" as an extension and extension of traditional entity linking.

Applications full of potential and imagination: Academically, "fast "Knowledge" uses a brand-new short video organization form of knowledge points, which can break through the upper limit of the current machine understanding of world knowledge by relying only on graphic knowledge graph (KG). In some downstream tasks of KG, such as entity linking, entity classification, or NLP, CV, etc. It has great potential for downstream tasks of content understanding. In the industry, forms such as "Kuaizhi" can help short video platforms operate efficiently, organize content, and improve users' understanding of knowledge and consumption efficiency.

Technical Overview

In order to achieve the above-mentioned short video encyclopedia structure, the core technology includes the following three main steps, as shown in Figure 3.

Knowledge video recognition: Through multi-modal video pre-training model, understanding And identify knowledge-based videos in massive videos;

Mining entry and knowledge points: Build the entry system “top-down” through the integration of multi-source knowledge bases , and then "bottom-up" builds the relationship between terms and knowledge points by mining user search queries to form an entry knowledge point tree;

Multi-modal knowledge link: Innovatively expands the traditional "entity linking" task and proposes to link videos to words through multi-modal content understanding technology "Multi-modal knowledge linking" task on a certain knowledge point (such as food protection correction) of articles (such as Shiba Inu).

Kuaishou proposes a billion-level multi-modal short video encyclopedia system - Kuaipedia

Figure 3: Quick Knowledge Construction Technology Link

Through a large number of detailed manual evaluations, the knowledge points and videos mined by KuaiZhi have a high accuracy and quality. For more detailed algorithms and experimental data, please refer to the paper or our Github homepage (see the beginning of the article).

##Apply

First of all, multi-modal short video encyclopedia systems such as "Kuaipedia" have great potential in academia to promote the development of AI technology for understanding world knowledge. On the one hand, "Quick Knowledge" breaks through the limitations of graphics, text and tables, and describes an entity or concept through richer knowledge points and short videos. This approach can promote the development of multi-modal knowledge graph technology. On the other hand, these knowledge points and short videos help AI to better understand world knowledge, especially some How-to knowledge that is difficult to express in pictures and texts. This kind of multi-modal knowledge can enhance AI's understanding of the world and improve AI's understanding of the world. Downstream applications in KG, NLP, CV and other fields are very helpful. On the task of CCKS entity linking, we have proven that the simple introduction of "quick knowledge" multi-modal knowledge can effectively improve BERT's performance in entity linking and entity classification.

In addition, the implementation of "Kuaizhi" in the industry is very imaginative. In the process of expanding the short video ecology to "pan-knowledge", the existing form By constraining its communication methods, "Kuaizhi" can improve the operation and distribution efficiency of the platform through structured content and better meet users' demands for knowledge. We first tried to implement this technology in the health category. The Kuaishou Health team had previously mined a batch of high-quality PUGC content purely manually using disease types as the organizational dimension. However, there were imperfections in the disease knowledge system and the level of authoritative knowledge videos. With small pain points, it is difficult to efficiently build a complete, large-scale, and structured disease video system. After using the technology of "Kuaizhi", a batch of high-quality knowledge points and knowledge videos with Kuaishou characteristics are automatically mined, which enriches the disease content and is more efficient than purely manual construction. Dozens of times. Currently, this batch of content has been launched on the selected page of Kuaishou App: click on the "bottom bar" of a disease-related video in the selected video stream to evoke the "Kuaishou Health" half-screen page, and users can consume related content under the entry to which the video belongs. Knowledge points and related knowledge videos are shown in Figure 4.

Kuaishou proposes a billion-level multi-modal short video encyclopedia system - Kuaipedia

Figure 4: Kuai Zhi is implemented in the health scene

In addition to health, "Kuaizhi" also covers knowledge content in many fields such as education, food, agriculture, rural areas and farmers, parent-child, law, technology, finance, etc., and has great application potential.

Conclusion

Faced with the development prospects of general knowledge content in the short video industry, we proposed the "Kuaipedia" multi-modal short video encyclopedia system. Starting from the massive short video content, we mined hundreds of millions of high-quality knowledge videos through multi-modal knowledge graph construction technology, and structured the knowledge content to build the industry's first large-scale systematic short video encyclopedia knowledge base, which has great significance in academic circles. There is great potential and room for imagination in the world and industry.

Introduction of the author

First author: Pan Haojie

Member of Kuaishou MMU Knowledge Graph Center, leader of the KuaiZhi project, graduated from Zhejiang University and Hong Kong University of Science and Technology with a bachelor's and master's degree, and was responsible for large-scale NLP algorithms at Alibaba Cloud PAI and framework, published more than 10 papers in top conferences and journals such as ACL, EMNLP, KDD, AIJ, etc., and a number of domestic and US patents, see Zhihu for details. Join Kuaishou in 2021.

Corresponding author: Fu Ruiji

He is the head of Kuaishou MMU Knowledge Graph Center. He graduated from Harbin Institute of Technology with a bachelor's degree, master's degree and Ph.D., and is a postdoctoral fellow at the University of Science and Technology of China. He once served as the deputy director of iFlytek AI Research Institute of HKUST and won the first prize of Wu Wenjun Artificial Intelligence Technology Progress Award. He has published many academic papers in international conferences and journals such as ACL, EMNLP, Coling, IJCAI, TASLP, etc., and applied for (obtained) more than 40 national invention patents. Join Kuaishou in 2021.

Cooperating teacher: Liu Ming

Professor/doctoral supervisor, Department of Computing, Harbin Institute of Technology. He has successively presided over many fund projects such as the National Key R&D Program Project, the National Natural Science Foundation, the China Postdoctoral Science Foundation Special Grant, the China Postdoctoral Science Foundation General Grant First Class Grant, and the Heilongjiang Provincial General Fund. Won the first prize of Heilongjiang Province Science and Technology Award, Harbin City Science and Technology Achievements, and the first prize of the 6th National Youth Artificial Intelligence Innovation and Entrepreneurship Conference. In recent years, he has published more than 20 CCFA/B papers as the first author or corresponding author, participated in the editing of one textbook, and translated one into English. He serves as the knowledge graph field chair of NLPCC2020, CCKS2020, and COLING2022, CCKS2019 publishing chair, CCKS2021 evaluation chair, and CCKS2022 workshop chair.

References

[1] Kuaishou, 2022 Kuaishou Pan-Knowledge Content Ecosystem Report.

[2] National Standards Committee: Knowledge Management Framework, GB/T 23703.

The above is the detailed content of Kuaishou proposes a billion-level multi-modal short video encyclopedia system - Kuaipedia. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Best AI Art Generators (Free & Paid) for Creative Projects Best AI Art Generators (Free & Paid) for Creative Projects Apr 02, 2025 pm 06:10 PM

The article reviews top AI art generators, discussing their features, suitability for creative projects, and value. It highlights Midjourney as the best value for professionals and recommends DALL-E 2 for high-quality, customizable art.

Getting Started With Meta Llama 3.2 - Analytics Vidhya Getting Started With Meta Llama 3.2 - Analytics Vidhya Apr 11, 2025 pm 12:04 PM

Meta's Llama 3.2: A Leap Forward in Multimodal and Mobile AI Meta recently unveiled Llama 3.2, a significant advancement in AI featuring powerful vision capabilities and lightweight text models optimized for mobile devices. Building on the success o

Best AI Chatbots Compared (ChatGPT, Gemini, Claude & More) Best AI Chatbots Compared (ChatGPT, Gemini, Claude & More) Apr 02, 2025 pm 06:09 PM

The article compares top AI chatbots like ChatGPT, Gemini, and Claude, focusing on their unique features, customization options, and performance in natural language processing and reliability.

Is ChatGPT 4 O available? Is ChatGPT 4 O available? Mar 28, 2025 pm 05:29 PM

ChatGPT 4 is currently available and widely used, demonstrating significant improvements in understanding context and generating coherent responses compared to its predecessors like ChatGPT 3.5. Future developments may include more personalized interactions and real-time data processing capabilities, further enhancing its potential for various applications.

Top AI Writing Assistants to Boost Your Content Creation Top AI Writing Assistants to Boost Your Content Creation Apr 02, 2025 pm 06:11 PM

The article discusses top AI writing assistants like Grammarly, Jasper, Copy.ai, Writesonic, and Rytr, focusing on their unique features for content creation. It argues that Jasper excels in SEO optimization, while AI tools help maintain tone consist

Top 7 Agentic RAG System to Build AI Agents Top 7 Agentic RAG System to Build AI Agents Mar 31, 2025 pm 04:25 PM

2024 witnessed a shift from simply using LLMs for content generation to understanding their inner workings. This exploration led to the discovery of AI Agents – autonomous systems handling tasks and decisions with minimal human intervention. Buildin

Choosing the Best AI Voice Generator: Top Options Reviewed Choosing the Best AI Voice Generator: Top Options Reviewed Apr 02, 2025 pm 06:12 PM

The article reviews top AI voice generators like Google Cloud, Amazon Polly, Microsoft Azure, IBM Watson, and Descript, focusing on their features, voice quality, and suitability for different needs.

AV Bytes: Meta's Llama 3.2, Google's Gemini 1.5, and More AV Bytes: Meta's Llama 3.2, Google's Gemini 1.5, and More Apr 11, 2025 pm 12:01 PM

This week's AI landscape: A whirlwind of advancements, ethical considerations, and regulatory debates. Major players like OpenAI, Google, Meta, and Microsoft have unleashed a torrent of updates, from groundbreaking new models to crucial shifts in le

See all articles