


Baidu CTO Wang Haifeng: Large language models bring the dawn of general artificial intelligence
On August 16, 2023, Beijing time, the WAVE SUMMIT Deep Learning Developer Conference was hosted by the National Engineering Research Center for Deep Learning Technology and Applications. At this meeting, Baidu Chief Technology Officer and National Engineering Research Center for Deep Learning Technology and Applications Center Director Wang Haifeng delivered a keynote speech. Wang Haifeng publicly stated for the first time that large language models already possess the core basic capabilities of artificial intelligence such as understanding, generation, logic, and memory, bringing new hope to the development of general artificial intelligence
8 million developers have used Flying Paddle and created more than 800,000 models
WAVE SUMMIT Deep Learning Developer Conference has been held since April 2019. At the first conference, Wang Haifeng pointed out that deep learning has broad applicability and has the characteristics of standardization, automation and modularization of industrial production, which has promoted artificial intelligence to enter the industrialization stage. After four years of development, the progress of deep learning technology and applications has fully verified this point of view. The scope of application of deep learning technology is getting wider and wider, the standardization, automation and modularity features of deep learning platforms are becoming more and more obvious, and the rise of pre-trained large models has further expanded the depth and breadth of artificial intelligence applications. Therefore, artificial intelligence has entered the stage of industrial production
In terms of standardization, we have jointly optimized frameworks and models to uniformly adapt to a variety of hardware and simplify application models, thereby greatly lowering the threshold for artificial intelligence applications; in terms of automation, we have improved the efficiency of the entire artificial intelligence research and development process. Everything from training, adaptation to inference deployment has been automated; in terms of modularity, we provide a rich industrial-level model library to facilitate the rapid application of artificial intelligence in various scenarios
Fei Paddle industrial-level deep learning open source open platform and Wenxin large model promote each other, making Fei Paddle ecology more prosperous, attracting 8 million developers, providing services to 220,000 enterprises and institutions, and creating 800,000 based on Model of a flying oar. Wang Haifeng explained the profound meaning of the Chinese name of the Flying Paddle developer community AI Studio "Galaxy Community", "Wenxin and Flying Paddle combine to enter the galaxy together." Under the guidance of Fei Piao and Wen Xin, we work with all developers to build the Galaxy community and jointly explore the endless possibilities of general artificial intelligence
Large-scale language models bring new hope for general artificial intelligence
Wang Haifeng believes that the core basic capabilities of general artificial intelligence include understanding, generation, logic and memory, and the large language model has these four capabilities, bringing hope to the realization of general artificial intelligence
Specifically, the typical abilities of artificial intelligence, such as creation, programming, problem solving and planning, are based on core basic abilities, including understanding, generation, logic and memory, although they may differ in the degree of dependence. For example, the problem-solving process requires the comprehensive use of understanding, memory, logic and generative abilities, from reading the question, solving the question to finally writing the answer
How to obtain these abilities? Taking Wen Xinyiyan as an example, we first train a large pre-trained model through fusion learning of trillions of data and hundreds of billions of knowledge. It is then further optimized using techniques such as supervised fine-tuning, reinforcement learning with human feedback, and prompts. In addition, the model also has technical advantages such as knowledge enhancement, retrieval enhancement and dialogue enhancement
Optimize data sources and data distribution through multiple strategies, long-text modeling of basic models, multi-type and multi-stage supervised fine-tuning, multi-task adaptive supervised fine-tuning, and multi-level and multi-granularity reward models and other technologies Innovate and comprehensively improve basic general capabilities. On the basis of retrieval enhancement and knowledge enhancement, the ability to master and apply world knowledge is improved through knowledge point enhancement; logical capabilities are improved by building large-scale logical data, logical knowledge modeling, multi-granular semantic knowledge combination and symbolic neural network; Ensure the security of large models by building a comprehensive security system that includes data, content, model and system security
Through Fei Paddle’s end-to-end adaptive hybrid parallel training technology and collaboratively optimized compression, inference and service deployment, the training speed of the Wenxin large model has been increased by 3 times, and the inference speed has been increased by more than 30 times
Through data-driven, prompt construction and plug-in enhancement, we have carried out scene adaptation and collaborative optimization in applications. We have launched five plug-ins: Wen Xin Yi Yan, Baidu Search, Browsing Documents, E Yan Yi Tu, Shuo Tu Jie Hua and Yijing Liuying. These plug-ins enable our models to generate real-time accurate information, long text summaries and Q&A, data insights and chart production, image-based creation and Q&A, and Vincent videos. The introduction of the plug-in mechanism expands the functional boundaries of large models and better meets the needs of different scenarios. Wang Haifeng said that in the future, Baidu will work with developers to build a plug-in ecosystem and share technological innovation results
Artificial intelligence represented by large language models is penetrating into thousands of industries, accelerating industrial upgrading and economic growth. In this process, technological innovation and application implementation form a virtuous cycle. Capabilities such as understanding, generation, logic, and memory continue to improve. The breadth and depth of industrial applications continue to expand. Large-scale language models bring new hope for general artificial intelligence.
The above is the detailed content of Baidu CTO Wang Haifeng: Large language models bring the dawn of general artificial intelligence. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics



On March 21, the PS5 transplanted game "Horizon: West" will be officially launched on the Steam platform. Because it was previously exclusive to PlayStation, it took four years for the first game "Horizon: Zero Dawn" to be released on PC. The latest sequel "Horizon West", which has been released two years ago, will finally be released on the PC platform. As we all know, when a game is ported to the PC platform, there will usually be corresponding graphics upgrades. "Horizon: West End" has already shown top-notch image quality performance on PS5. Therefore, people are full of expectations for how the game will perform when it launches on the Steam platform on March 21. In addition, when the game is launched on PC, it will also support NVIDIADLSS3, NVIDIADL
![Guangdong is actively working hard! Promote the acceleration of the construction of a leading place for innovation in the general artificial intelligence industry [Intelligent Computing Center Industry Market Analysis Appendix]](https://img.php.cn/upload/article/000/887/227/169994745364509.jpg?x-oss-process=image/resize,m_fill,h_207,w_330)
The content that needs to be rewritten is: Image source: Photo Network On November 13, the official website of the Guangdong Provincial People's Government released the "Implementation Opinions on Accelerating the Construction of a Leading Place for Innovation in the General Artificial Intelligence Industry" to build Guangdong into a national general artificial intelligence industry. Leading the innovation of the artificial intelligence industry, it will build a national intelligent computing hub, a data zone in the Guangdong-Hong Kong-Macao Greater Bay Area, and a national demonstration highland for scenario applications, forming a good development pattern of "computing power interconnection, algorithm open source, data fusion, and application emergence." The implementation opinions also put forward specific goals for the development of the general artificial intelligence industry in Guangdong: by 2025, the scale of intelligent computing power will be the first in the country and the world's leading, the general artificial intelligence technology innovation system will be relatively complete, and high-level artificial intelligence application scenarios will be further expanded, and the core Industry scale

“Generative artificial intelligence is the most important technological revolution in the past 40 years.” This is the latest judgment of Microsoft co-founder Bill Gates. When intelligence emerges, how to promote scientific research and application innovation has become the focus of the industry. On September 7, at the "New Generation Data Base - Exploring the Application and Development of Graph Intelligence" sub-forum at the 2023 Bund Conference, Ant Group presented a fusion research - "Large Graph Model" (Large Graph Model, referred to as LGM). This research combines graph computing with graph learning and large language models, using the generation capabilities of large language models and the correlation analysis capabilities of graph computing to provide more intuitive and comprehensive information presentation and more accurate insights, thereby better Solve massive and complex digital application problems. at present

At the WaveSummit2023 Deep Learning Developer Conference held on August 16, Wang Haifeng, chief technology officer of Baidu and director of the National Engineering Research Center for Deep Learning Technology and Applications, said that large language models have four core basic capabilities of understanding, generation, logic, and memory. Bringing the dawn of general artificial intelligence. The Feipiao ecosystem brings together 800 developers and creates 800,000 models. In Wang Haifeng’s view, the typical capabilities of artificial intelligence, such as creation, programming, problem solving, planning, etc., all rely on the four major components of understanding, generation, logic, and memory to varying degrees. Core basic competencies. Take problem solving as an example. From reading the question, to answering the question, to finally writing the answer, artificial intelligence is nothing more than a combination of the four major abilities of understanding, memory, logic, and generation. How to Get the Big Four in Artificial Intelligence

To gain a true understanding of artificial intelligence, researchers should turn their attention to developing a basic, underlying AGI technology that can replicate human understanding of the environment. Industry giants like Google, Microsoft, and Facebook, research labs like Elon Musk’s OpenAI, and even platforms like SingularityNET are all betting on artificial general intelligence (AGI)—the ability of intelligent agents to understand or learn any intellectual task that humans cannot accomplish, which represents the future of artificial intelligence technology. Somewhat surprisingly, however, none of these companies are focused on developing a basic, underlying AGI technology that replicates human contextual understanding. This may explain why

On August 16, 2023, Beijing time, the WAVESUMMIT Deep Learning Developer Conference was hosted by the National Engineering Research Center for Deep Learning Technology and Applications. At this meeting, Baidu Chief Technology Officer and Director of the National Engineering Research Center for Deep Learning Technology and Applications Wang Haifeng delivered a speech keynote speech. Wang Haifeng publicly stated for the first time that the large language model already possesses the core basic capabilities of artificial intelligence such as understanding, generation, logic, and memory, bringing new hope to the development of general artificial intelligence. 8 million developers have used Fei Paddle and created With more than 800,000 models, the WAVESUMMIT Deep Learning Developer Conference has been held since April 2019. At the first conference, Wang Haifeng pointed out that deep learning has broad applicability and has the ability to industrialize production.

At busy urban street intersections, illegal behaviors of motor vehicles, non-motor vehicles and pedestrians are instantly captured; on many highways across the country, when unexpected accidents occur, rescue vehicles can be dispatched efficiently while also predicting future traffic conditions. How is this done? Let’s take a look at how Pudong Enterprise Shanma Intelligence uses AI technology to identify and analyze and apply these scenarios one by one. There are a lot of public camera videos in the city. A large amount of video data is generated on the Internet every day. How to make full use of these video resources? At the 2023 World Artificial Intelligence Conference, Shanma Intelligence announced the ATOMAI productivity platform. Ensuring the accumulation of technological achievements and precipitation of data assets is the main goal of this platform. Breaking resources into parts and making effective reuse a reality is based on the ATOMAI productivity platform.

In the context of the new technological revolution and the accelerated construction of new industrial ecology, the influence of artificial intelligence continues to increase. From concept to practical application, and then to penetration into various industries, artificial intelligence is inseparable from the efforts of a number of leading companies such as Yunzhisheng. In its continuous exploration, Yunzhisheng has also won industry recognition for its solid work and won multiple important awards. From October 12th to 14th, with the theme of "AI Integration Innovation·Promoting High-Quality Development" The Shenzhen International Artificial Intelligence Exhibition was held in Futian, Shenzhen. During the exhibition, the annual selection results of the "2023 GAIE Awards" were announced. Yunzhisheng was invited to participate in the conference, and with its continuous efforts and innovative practices in the field of artificial intelligence, Yunzhisheng was awarded the honorary title of "Best Artificial Intelligence Enterprise". In addition, not long ago, Zhiding Technology Alliance
