Behind the birth of Tongyi Tingwu, the first shot of AI large model response application-AI-php.cn

Home

Technology peripherals

Behind the birth of Tongyi Tingwu, the first shot of AI large model response application

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Jun 05, 2023 pm 01:50 PM

application ai large model behind

Behind the birth of Tongyi Tingwu, the first shot of AI large model response application Picture from Canva Drawable

At the beginning of 2023, the popularity of ChatGPT quickly aroused the industry’s attention to generative AI applications, and the competition for large AI models has intensified.

As an important player involved, Alibaba Cloud first launched the Tongyi Qianwen large model at the Alibaba Cloud Summit held on April 11. Immediately after June 1, Alibaba Cloud announced the progress of the Tongyi large-scale model. The new AI product "Tongyi Listening" focusing on audio and video content was officially launched, becoming the first large-scale model application product in China open to public testing. This means that Alibaba Cloud's large-scale model Language model has taken another big step towards front-end applications. At the same time, the release of its AI-based large model application is equivalent to dropping a blockbuster on the current disputed "large model melee."

The Thousand Model War is about to break out

In recent months, as major Silicon Valley companies such as Microsoft, Google, and Amazon have announced large models and AIGC progress, and started the AI competition through a series of applications such as AI search engines and AI office software, domestic Internet cloud companies have also The whole army attacked one after another. Not only big manufacturers, but also many start-ups, VC/PE institutions, and giants from various industries have poured into the AI large model track, trying to get a piece of the action. According to incomplete statistics, there are currently no fewer than 50 large companies that have announced plans to launch large AI models, and there are countless other participants.

From a structural perspective, Internet technology companies represented by "BATH" and others have firmly ranked in the first echelon of the industry by virtue of their comprehensive strengths such as scenarios, computing power, and full-stack technical capabilities; while they are in the second tier. Important industry companies, such as SenseTime, China Telecom, JD.com, 360, etc., have relied on their influence in related fields to rank in the second echelon of the industry; closely followed by various industry giants, including the co-founder of Meituan Well-known entrepreneurs such as Wang Huiwen and former Sogou CEO Wang Xiaochuan have knowledge, institutional support and relevant backgrounds. However, because they are still in the initial stage, they are temporarily ranked in the third echelon in the melee of large AI models.

From a classification point of view, each enterprise has its own positioning and division of labor around general large models and special large models. According to industry insiders, current domestic large models can be mainly divided into two categories: one is a general large model that benchmarks GPT, focusing on companies at the basic level, such as Alibaba, Baidu and other companies; the other is based on open source large models. We train large vertical models on the platform, focusing on companies in vertical industries, such as large vertical models used in finance, medical care, transportation and other fields. Due to the high technical requirements and high cost of the former, many industries have begun to cooperate with general large model companies to train vertical large models suitable for their own industries based on their own conditions.

From the perspective of the industry chain, computing power manufacturers, cloud service manufacturers, and front-end application manufacturers are all involved. The system is complete, closely connected, and broad in scope. Currently, including computing power manufacturer NVIDIA, cloud service provider Alibaba Cloud, and front-end application manufacturers such as Kingsoft Office, UFIDA, and Yuncong Technology, they are all actively participating in the technology research and development and scenario implementation of AI large models. In short, the current melee over large AI models is heating up suddenly.

Tongyi Tingwu launches the first shot of large model application

From the perspective of the levels involved in AI large models, it can be mainly divided into four levels: application layer, model layer, framework layer and chip layer. At present, most companies on the market are still at the model layer, and go deeper. It involves research on the framework layer and chip layer, and front-end applications are currently not available in the industry. As the industry's first large-scale model application product in public beta, Tongyi Tingwu's demonstration effect is extremely obvious.

On the one hand, compared with the underlying technology, the value of technology close to the application front-end will be more imaginative. Looking at the history of all technological evolutions, it is not difficult to find that the core value of technology lies in the popularity of its application and the degree to which it solves user problems. Because of this, front-end applications that undertake this responsibility and mission often have greater room for imagination.

Take Tongyi Listening, launched by Alibaba Cloud on June 1, as an example. By connecting to Alibaba’s Tongyi large model, its role is no longer just a simple tool for audio and video transcription, but it has It has become an efficient AI assistant to help people in audio and video scenarios. It provides people with a variety of office services such as automatically taking notes, organizing interviews, and extracting PPT. It can also realize the ability to convert audio and video into graphic content, summarize chapter by chapter, and summarize full-text opinions. It has "human-like" efficient retrieval and classification. ability.

In addition, in subdivided scenarios, it also has many "treasure" functions. For example, it can open the Chrome plug-in. Foreign language learners and hearing-impaired people can use the bilingual floating subtitle bar to watch videos without subtitles anytime and anywhere without schedule conflicts. At the same time, Tingwu can also become a "meeting stand-in" for professionals. When the meeting is muted, the AI can record the meeting and organize the key points; the transcription results can be downloaded as subtitle files to facilitate video post-production by new media practitioners; Tingwu organizes The Q&A review allows reporters, analysts, lawyers, HR and other groups to organize interviews more efficiently. In short, in terms of its ability to solve specific scene problems, it has exceeded the capabilities of existing audio and video applications, and has also refreshed the upper limit of previous user experience. It will definitely cause quite a stir in the industry.

On the other hand, from Alibaba Cloud itself, the Tongyi Qianwen big model has just been launched, and it can quickly launch technical applications based on the AI big model, which shows that the Alibaba Cloud AI big model is indeed relatively mature. , has the ability to derive AI applications.

From the model layer to the application layer, from AI large models to the birth of large model applications, it seems simple but in fact it is not easy to implement. Generally speaking, vertical applications are implemented based on the technical base of the general large model. Leaving aside the general large model, these are usually impossible to implement, and this requires that the general large model on which it relies is mature enough, otherwise it will be very difficult to implement. It is difficult to launch applications that are relatively easy to use and exceed the capabilities of existing products. Take Tongyi Tingwu as an example, its technical strength is not shared by all players in the industry.

Full-system AI infrastructure becomes the key to victory

Judging from the hundreds of billions of parameter training requirements required by the large AI model itself, the difficulty and complexity of its advancement may far exceed what the outside world imagines. In the long run, only companies with full-stack AI large-model technical capabilities and infrastructure capabilities will be able to go further.

First of all, because the development speed of generative AI far exceeds external expectations, the progress of any single link has limited promotion effect on the overall large model training. According to OpenAI’s calculations, since 2012, the global demand for computing power for AI model training has doubled in 3-4 months, with an annual growth rate of up to 10 times. However, according to Moore's Law, chip computing performance can only double every 18-24 months, which means that chip performance is far from keeping up with the development requirements of large AI models. Specific to related fields, CPU-based computing systems are difficult to meet the high-bandwidth and low-latency network transmission requirements for large model training. Solving these problems cannot be effective in the short term by relying on a single "pile of computing power" , and may not be economical. We must rely on multi-level overall system support from algorithms, computing power, frameworks, etc. to better cope with this change.

Secondly, due to the large computing power required to develop general-purpose large models, high inference training costs, and high data volume requirements, the threshold itself is very high, and it does not have the ability to develop full-stack large models and implement scenarios. Companies with poor capabilities and ecological openness will find it difficult to maintain a fast pace of change and will easily be eliminated. According to analysis by industry insiders, to create a successful universal large model that can be exported for external commercialization, manufacturers need to have full-stack large model training and R&D capabilities, business scenario implementation experience, AI security governance measures, and ecological openness and other core advantages, and it is difficult for ordinary enterprises to fully possess these capabilities.

As the first cloud computing service provider in Asia and the third largest in the world, Alibaba Cloud has the strongest computing power support system in China. For example, Alibaba Cloud Feitian Cloud operating system can achieve a single cluster scale of 100,000 units and a computing capacity of 100 billion files. Its Feitian intelligent computing platform can achieve 90% parallel efficiency of 1,000 cards. The self-developed network architecture can handle 10,000 cards of scale. AI clusters provide congestion-free, high-performance cluster communication capabilities. Alibaba Cloud's own deep learning platform PAI can increase computing resource utilization by more than 3 times, AI training efficiency by 11 times, and inference efficiency by 6 times. In addition, Alibaba Cloud has also taken the lead in establishing the largest AI model service community "Magic" in China to reduce the cost of large model development and help AI inclusiveness; in terms of algorithms, Alibaba has advanced language and multi-modal capabilities, ultra-large models, and general unified models. In many technical dimensions, it is in the first echelon in China. This is the core reason why Ali Tongyi’s large model can quickly “break out of the circle”.

Thirdly, from the perspective of business possibilities, companies with full-system AI infrastructure capabilities will have greater business value after the arrival of MaaS (Model as a Service) and will have greater market competition. "room for maneuver". Take Alibaba Cloud as an example. In the later period, you can not only obtain platform service fees by providing general large model services; you can also rent out computing power and promote training platforms to earn rent. There are relatively more ways to monetize, and you can compete in the market. Flexibly adjust product pricing based on circumstances to address operational challenges.

The industry welcomes the era of AI HP

With the birth of AI large model applications, a new era characterized by deep AI inclusiveness is gradually beginning. AI is deeply embedded in industrial entities and will become an irreversible industry trend.

On the one hand, the high threshold of general large models and the wide range of differentiated needs in vertical fields determine that exclusive large models and industrial applications based on general large models will become the mainstream application direction in the future, promoting the acceleration of AI. Entering thousands of industries. As mentioned above, the high threshold for general-purpose large models determines that only a few companies at home and abroad can make general-purpose large models. And as AI models become larger, the AI industry is moving from a light industry composed of "handmade workshops" Intensive production requires high-performance, low-cost systematic infrastructure to complete industrial production.

Not only do many small and medium-sized enterprises not have this ability, but even for leading companies in various industries, optimizing large model training from 0 to 1 is not economical in itself. All walks of life need AI infrastructure that is low enough in cost. And for existing manufacturers, it is not necessary to continue to join this field and "reinvent the wheel." In contrast, the training cost of large vertical models is relatively low, and some companies with rich data scenarios in professional fields have better conditions for building large vertical models and better data quality, and the products they launch are more adaptable to the vertical industry. Therefore, GPT in various vertical industries may become mainstream large model applications in the future, promoting the rapid penetration of AI into the industry.

On the other hand, the short-term bottleneck in developing large AI models is computing power, and in the long term is data. Therefore, high-quality front-end applications can help accelerate enterprises to accumulate sufficient data assets and enhance their long-term competitiveness. Accelerate the process of inclusive industrial application. Currently, the rapid iteration and evolution of large models forces all participating players to continuously accumulate computing power resources and optimize configurations from various aspects such as chips and cloud services to ensure the computing power support required for large model training. However, in the longer term, the algorithm for training large AI models is still being continuously optimized and adjusted. With breakthroughs in the algorithm in the future, computing power may no longer be a bottleneck, and high-quality data resources will become a scarce resource. , will receive more attention.

As the industry's first application based on AI large models, the launch of Tongyi Tingwu will help Alibaba accelerate the accumulation of high-quality data resources, accelerate the process of industrial inclusiveness, and lay a good foundation for longer-term development.

The above is the detailed content of Behind the birth of Tongyi Tingwu, the first shot of AI large model response application. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Assassin's Creed Shadows: Seashell Riddle Solution

3 weeks ago By DDD

What's New in Windows 11 KB5054979 & How to Fix Update Issues

2 weeks ago By DDD

Where to find the Crane Control Keycard in Atomfall

3 weeks ago By DDD

Assassin's Creed Shadows - How To Find The Blacksmith And Unlock Weapon And Armour Customisation

1 months ago By DDD

Roblox: Dead Rails - How To Complete Every Challenge

3 weeks ago By DDD

Hot Tools

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Where is the login entrance for gmail email?

7612

CakePHP Tutorial

1387

What is the format of the account name of steam

win11 activation key permanent

nyt connections hints and answers

136

Related knowledge

How to Undo Delete from Home Screen in iPhone Apr 17, 2024 pm 07:37 PM

Deleted something important from your home screen and trying to get it back? You can put app icons back on the screen in a variety of ways. We have discussed all the methods you can follow and put the app icon back on the home screen. How to Undo Remove from Home Screen in iPhone As we mentioned before, there are several ways to restore this change on iPhone. Method 1 – Replace App Icon in App Library You can place an app icon on your home screen directly from the App Library. Step 1 – Swipe sideways to find all apps in the app library. Step 2 – Find the app icon you deleted earlier. Step 3 – Simply drag the app icon from the main library to the correct location on the home screen. This is the application diagram

The role and practical application of arrow symbols in PHP Mar 22, 2024 am 11:30 AM

The role and practical application of arrow symbols in PHP In PHP, the arrow symbol (->) is usually used to access the properties and methods of objects. Objects are one of the basic concepts of object-oriented programming (OOP) in PHP. In actual development, arrow symbols play an important role in operating objects. This article will introduce the role and practical application of arrow symbols, and provide specific code examples to help readers better understand. 1. The role of the arrow symbol to access the properties of an object. The arrow symbol can be used to access the properties of an object. When we instantiate a pair

How to build an AI-oriented data governance system? Apr 12, 2024 pm 02:31 PM

In recent years, with the emergence of new technology models, the polishing of the value of application scenarios in various industries and the improvement of product effects due to the accumulation of massive data, artificial intelligence applications have radiated from fields such as consumption and the Internet to traditional industries such as manufacturing, energy, and electricity. The maturity of artificial intelligence technology and application in enterprises in various industries in the main links of economic production activities such as design, procurement, production, management, and sales is constantly improving, accelerating the implementation and coverage of artificial intelligence in all links, and gradually integrating it with the main business , in order to improve industrial status or optimize operating efficiency, and further expand its own advantages. The large-scale implementation of innovative applications of artificial intelligence technology has promoted the vigorous development of the big data intelligence market, and also injected market vitality into the underlying data governance services. With big data, cloud computing and computing

From beginner to proficient: Explore various application scenarios of Linux tee command Mar 20, 2024 am 10:00 AM

The Linuxtee command is a very useful command line tool that can write output to a file or send output to another command without affecting existing output. In this article, we will explore in depth the various application scenarios of the Linuxtee command, from entry to proficiency. 1. Basic usage First, let’s take a look at the basic usage of the tee command. The syntax of tee command is as follows: tee[OPTION]...[FILE]...This command will read data from standard input and save the data to

Explore the advantages and application scenarios of Go language Mar 27, 2024 pm 03:48 PM

The Go language is an open source programming language developed by Google and first released in 2007. It is designed to be a simple, easy-to-learn, efficient, and highly concurrency language, and is favored by more and more developers. This article will explore the advantages of Go language, introduce some application scenarios suitable for Go language, and give specific code examples. Advantages: Strong concurrency: Go language has built-in support for lightweight threads-goroutine, which can easily implement concurrent programming. Goroutin can be started by using the go keyword

The wide application of Linux in the field of cloud computing Mar 20, 2024 pm 04:51 PM

The wide application of Linux in the field of cloud computing With the continuous development and popularization of cloud computing technology, Linux, as an open source operating system, plays an important role in the field of cloud computing. Due to its stability, security and flexibility, Linux systems are widely used in various cloud computing platforms and services, providing a solid foundation for the development of cloud computing technology. This article will introduce the wide range of applications of Linux in the field of cloud computing and give specific code examples. 1. Application virtualization technology of Linux in cloud computing platform Virtualization technology

Understanding MySQL timestamps: functions, features and application scenarios Mar 15, 2024 pm 04:36 PM

MySQL timestamp is a very important data type, which can store date, time or date plus time. In the actual development process, rational use of timestamps can improve the efficiency of database operations and facilitate time-related queries and calculations. This article will discuss the functions, features, and application scenarios of MySQL timestamps, and explain them with specific code examples. 1. Functions and characteristics of MySQL timestamps There are two types of timestamps in MySQL, one is TIMESTAMP

Apple tutorial on how to close running apps Mar 22, 2024 pm 10:00 PM

1. First we click on the little white dot. 2. Click the device. 3. Click More. 4. Click Application Switcher. 5. Just close the application background.

See all articles