


Five-minute technical talk | AIGC introduction and application selection evaluation
Part 01 Introduction to AIGC
AIGC (AI- Generated Content (artificial intelligence production content) refers to the production method that uses AI technology to automatically or assist in generating various forms of content such as text, code, images, voices, videos, and 3D objects. AIGC represents a new trend in the development of AI technology, from perceiving and understanding the world to generating and creating the world, and from analytical capabilities to creative capabilities. AIGC has also brought about changes in content creation, improving the quality, efficiency and diversity of content.
1.1 Text generation
Text generation refers to using AI technology automatically generates text content that conforms to grammar and logic based on given input (such as keywords, pictures, voices, etc.), which is an important aspect of AIGC.
The application scenarios of text generation are very rich, including news writing, novel creation, marketing copywriting, customer service Q&A, chat robots, educational coaching, knowledge graphs, and summary generation wait.
➤ Wen Xinyiyan: A large AI model launched by Baidu that supports multi-modal output and can perform literary creation, business copywriting creation, and mathematical logic calculations , Chinese understanding, multi-modal generation, etc.
➤ ChatGPT: A chat application based on the GPT series model launched by OpenAI. Currently, the GPT-4 model has been launched. ChatGPT based on the GPT-4 model can analyze images. And interact with text and pictures.
1.2 Code generation
Includes code completion, Code refactoring, code optimization, code annotation, etc. can cover a variety of programming languages and fields. Based on OpenAI's GPT-4 model, it is even possible to generate the corresponding website code based on a hand-drawn product prototype draft.
➤ Github Copilot: An AI-assisted programming tool developed based on the OpenAI Codex model. It supports dozens of programming languages and can perform real-time processing based on code or comments. It can provide code suggestions and entire functions in the editor, and can also achieve a pair programming experience through chat interaction.
➤ Cursor: an independent IDE software that integrates OpenAI’s GPT model. Similar to Github Copilot, Cursor can write code, edit code and chat through AI.
1.3 Image generation
Image generation refers to using Artificial intelligence technology, based on given input (such as natural language, images, videos, etc.), automatically generates images that conform to semantics and aesthetics, is an important aspect of AIGC. Image generation has a wide range of application scenarios, including artistic creation, entertainment media, education and training, e-commerce marketing, medical diagnosis, etc.
➤ Wenxin Yige: An AI art and creative assistance platform launched by Baidu. Paintings can be automatically generated based on text description and style selection.
➤ DALL-E2: A generative model based on an adaptive multi-modal encoder launched by OpenAI in (such as text, pictures, etc.) to automatically generate high-quality images.
➤ Midjourney: An AI painting tool released in March 2022. It can generate pictures based on natural language, select the artistic styles of different painters, and also Identify specific lenses or photography terms. Paintings generated by this tool have won first prize in art competitions.
1.4 Video generation
Video generation is mainly divided into two types: video editing and independent video generation. Video editing can be used for video super-scoring, repair and editing. Autonomous video generation can be used for image-to-video conversion, or for generating matching videos given descriptive text. The following are some related applications:
➤ Deepfake: This is an AI video generation platform based on GAN technology, which can realize face changing, voice conversion, and expression imitation and other functions. Users only need to upload a picture or a video as a reference, and the video will be automatically generated.
➤ Make-A-Video: An AI system launched by Meta Company that can convert text into video. It can create one-of-a-kind videos filled with vibrant colors, people, and scenery based on just a few words or lines of text.
1.5 3D modeling
AIGC-based 3D Modeling technology refers to the use of artificial intelligence technology to automatically generate 3D models that comply with semantics and aesthetics based on given input (such as natural language, images, etc.). This area is currently in an early stage of exploration. The following are some related applications or models:
➤ AICommand: an open source AI command plug-in based on Unity that can generate 3D scenes through text descriptions and Text is adjusted and optimized for 3D scenes. (https://github.com/keijiro/AICommand)
➤ ICON: An open source AI model that generates 3D character modeling based on character pictures (https: //github.com/YuliangXiu/ICON). You can experience and download the generated 3D model online: https://huggingface.co/spaces/Yuliang/ICON
Part 02 AIGC Application and Model Evaluation
After ChatGPT was launched by OpenAI at the end of 2022, the cumulative number of users exceeded 100 million in just two months. It quickly became popular all over the world. As a result, the AI iPhone moment has arrived, and major IT manufacturers quickly followed up. The following is an introduction to some relevant applications or models as of April 2023.
- Wen Xin’s words: See above.
- ChatGPT: See above.
- #Bard: A lightweight version of the NLP model launched by Google based on LaMDA.
- New Bing: An intelligent search engine based on the GPT4 model launched by Microsoft. It can interact with users in natural language and combine with real-time search results to provide Information, entertainment, creation and other functions.
- ChatGLM: A large conversational language model launched by Tsinghua University based on GLM architecture, open source and supporting Chinese and English bilinguals. Low-cost minimal model construction can be carried out based on CPU, and the model can also be developed and fine-tuned secondaryly.
- Poe: A free AI chatbot application developed by Quora. The application integrates 6 mainstream AI chatbots including: ChatGPT and GPT-4. .
will be evaluated and compared from the following aspects (except Poe):
- Natural language processing
- Logical reasoning
- Code generation
- Multi-modal support
PS:
- The ChatGPT participating in the evaluation is based on the GPT-3.5 model.
- The ChatGLM participating in the evaluation is only the minimized model: chatglm-6b-int4-qe. For practical applications, the chatglm-6b model that requires GPU memory should be built, and the quality of answers will be greatly improved.
##2.1 Natural Language Processing
Evaluation Content:
➪Multiple rounds of dialogue: Let’s create a children’s story together. The rule is that I say something first and you say something next, alternating. It ends when I say "I'm done with the story." Do you understand?
➪Language understanding: My boss said 1 1=3. Everything my boss said is right, so 1 1=3, right?
➪Language Translation: Translate this passage into English: One flower blooming alone is not spring, but a hundred flowers blooming together fill the garden.
➪Emotional analysis: Analyze the emotional color of this passage: I like this new movie very much. It made me laugh many times and moved me. Cried.
- ##ChatGPT
Bard
- ChatGLM
The scores are as follows:
##2.2 Logical reasoning
Assessment content:➪ In a There are five books on the shelf: red book, green book, blue book, orange book and yellow book. The green book is to the left of the yellow book, the yellow book is the third from the left, the red book is the second from the left, and the blue book is on the far right. What is the order of these books?
➪ There are three points A, B, and C on a 100-meter-long straight line. The position of A is uncertain. The distance between A and B is 5 meters. The distance between A and C is 10 meters, what is the possible distance between B and C?
##➪ If 2
ChatGPT#文心一言
#Bard
- #NewBing
- ChatGLM
##The scores are as follows:
#2.3 Coding AbilityEvaluation content:
Code generation: writing a python function , accepts an integer as input and determines whether it is a palindrome.
- Code explanation: Explain this line of python code: my_list = [x for x in my_list if x % 2 == 0]
- Bug detection: Where is the BUG in this line of code: my_list = [x for x in my_list if x % 2 = 0]
ChatGPT
文心一言Bard
- #NewBing
- #ChatGLM
#The scores are as follows:
##2.4 Multi-modal supportMultimodal support refers to the ability to handle multiple data types, such as text, images, audio and video, etc. For example: through text input, pictures, audio and video are automatically generated based on text requirements; through picture or audio and video input, content summary text is output, etc.
##ChatGPT- ChatGPT based on the GPT-3.5 model does not support multi-modal input and output capabilities, while ChatGPT based on the GPT-4 model can analyze pictures and analyze feedback text.
##文心一言
- Wen Xin Yi Yan can currently generate images and voices based on text descriptions. The video generation capability was demonstrated at the press conference, but during actual use, the video could not be generated.
#Bard- Google Bard does not support multi-modal capabilities.
##NewBing
- NewBing's creativity mode supports generating pictures through text descriptions.
- ChatGLM
##Tsinghua’s ChatGLM does not support multiple Modal capabilities.
The scores are as follows:
#Part 03Evaluation summary and selection evaluation
Combined with the above comparison scores, a comprehensive evaluation will be considered from the two stages of Demo and production (commercial use).
The overall evaluation score is as follows:
##The selection evaluation is as follows:##Part 04
Summary ➢
Demo stage: Wen Xinyiyan is the first AI choice, NewBing and ChatGPT are the alternative AI, and ChatGLM As an exploration direction of self-developed AIGC (requires GPU resources). ➢
In the production and commercial stage, multiple lines are available:
# Domestic mainland regions are seeking to introduce AI in the form of B-side cooperation with Wen Xinyiyan;- Domestic Hong Kong, Macao and Taiwan regions can consider introducing OpenAI’s official GPT-4 API for AI introduction;
- Based on the Tsinghua ChatGLM model, build And fine-tune the development of independent AI.
- Part 05
Conclusion WebGPUfor
The above is the detailed content of Five-minute technical talk | AIGC introduction and application selection evaluation. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics



Complete Guide to Checking HDFS Configuration in CentOS Systems This article will guide you how to effectively check the configuration and running status of HDFS on CentOS systems. The following steps will help you fully understand the setup and operation of HDFS. Verify Hadoop environment variable: First, make sure the Hadoop environment variable is set correctly. In the terminal, execute the following command to verify that Hadoop is installed and configured correctly: hadoopversion Check HDFS configuration file: The core configuration file of HDFS is located in the /etc/hadoop/conf/ directory, where core-site.xml and hdfs-site.xml are crucial. use

The CentOS shutdown command is shutdown, and the syntax is shutdown [Options] Time [Information]. Options include: -h Stop the system immediately; -P Turn off the power after shutdown; -r restart; -t Waiting time. Times can be specified as immediate (now), minutes ( minutes), or a specific time (hh:mm). Added information can be displayed in system messages.

Backup and Recovery Policy of GitLab under CentOS System In order to ensure data security and recoverability, GitLab on CentOS provides a variety of backup methods. This article will introduce several common backup methods, configuration parameters and recovery processes in detail to help you establish a complete GitLab backup and recovery strategy. 1. Manual backup Use the gitlab-rakegitlab:backup:create command to execute manual backup. This command backs up key information such as GitLab repository, database, users, user groups, keys, and permissions. The default backup file is stored in the /var/opt/gitlab/backups directory. You can modify /etc/gitlab

Installing MySQL on CentOS involves the following steps: Adding the appropriate MySQL yum source. Execute the yum install mysql-server command to install the MySQL server. Use the mysql_secure_installation command to make security settings, such as setting the root user password. Customize the MySQL configuration file as needed. Tune MySQL parameters and optimize databases for performance.

PyTorch distributed training on CentOS system requires the following steps: PyTorch installation: The premise is that Python and pip are installed in CentOS system. Depending on your CUDA version, get the appropriate installation command from the PyTorch official website. For CPU-only training, you can use the following command: pipinstalltorchtorchvisiontorchaudio If you need GPU support, make sure that the corresponding version of CUDA and cuDNN are installed and use the corresponding PyTorch version for installation. Distributed environment configuration: Distributed training usually requires multiple machines or single-machine multiple GPUs. Place

Docker uses Linux kernel features to provide an efficient and isolated application running environment. Its working principle is as follows: 1. The mirror is used as a read-only template, which contains everything you need to run the application; 2. The Union File System (UnionFS) stacks multiple file systems, only storing the differences, saving space and speeding up; 3. The daemon manages the mirrors and containers, and the client uses them for interaction; 4. Namespaces and cgroups implement container isolation and resource limitations; 5. Multiple network modes support container interconnection. Only by understanding these core concepts can you better utilize Docker.

A complete guide to viewing GitLab logs under CentOS system This article will guide you how to view various GitLab logs in CentOS system, including main logs, exception logs, and other related logs. Please note that the log file path may vary depending on the GitLab version and installation method. If the following path does not exist, please check the GitLab installation directory and configuration files. 1. View the main GitLab log Use the following command to view the main log file of the GitLabRails application: Command: sudocat/var/log/gitlab/gitlab-rails/production.log This command will display product

Enable PyTorch GPU acceleration on CentOS system requires the installation of CUDA, cuDNN and GPU versions of PyTorch. The following steps will guide you through the process: CUDA and cuDNN installation determine CUDA version compatibility: Use the nvidia-smi command to view the CUDA version supported by your NVIDIA graphics card. For example, your MX450 graphics card may support CUDA11.1 or higher. Download and install CUDAToolkit: Visit the official website of NVIDIACUDAToolkit and download and install the corresponding version according to the highest CUDA version supported by your graphics card. Install cuDNN library:
