Home Technology peripherals AI The first batch of large AI models are open to the public

The first batch of large AI models are open to the public

Sep 18, 2023 pm 05:41 PM
ai model Open to the public Programming at scale

In the early morning of yesterday, Baidu and Baichuan Intelligence successively announced that the large artificial intelligence models Wen Xinyiyan and Baichuan large models are open to the public. They are also the first large language models in my country to open their services to the public through registration.

The first batch of large AI models are open to the public

Information map Web page screenshot

At 0:00 yesterday, Wen Xin Yi Yan announced that it would be the first to be fully open to the whole society. Users can download the "Wen Xin Yi Yan" application in the app store or log in to the "Wen Xin Yi Yan official website" to experience it. It is reported that Baidu will also open a batch of newly reconstructed AI native applications, allowing users to fully experience the four core capabilities of generative AI: understanding, generation, logic, and memory

In the early hours of yesterday, Baichuan Intelligent announced that its large model had been registered and passed the "Interim Measures for Generative Artificial Intelligence Service Management". Baichuan Intelligence was founded by former Sogou CEO Wang Xiaochuan on April 10 this year. The core team is composed of top talents from well-known technology companies such as Sogou, Baidu, Huawei, Microsoft, ByteDance, and Tencent. Only 4 months after its establishment, Baichuan Intelligent has released three general-purpose large language models, including Baichuan-7B, the country's first open source large language model with 7 billion parameters that can be used for free commercial use, and Baichuan, a large language model with 53 billion parameters. -53B

In the first batch of registration lists of the "Interim Measures for Generative Artificial Intelligence Service Management", in addition to Baidu and Baichuan Intelligence, there are large models from enterprises and institutions such as ByteDance, SenseTime, Zidong Taichu, Zhipu Huazhang, etc. It is also included and can officially provide services to the public

The first batch of large AI models are now open to the public

Process Editor: u027

The above is the detailed content of The first batch of large AI models are open to the public. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

The demand for computing power has exploded under the wave of AI large models. SenseTime's 'large model + large computing power” empowers the development of multiple industries. The demand for computing power has exploded under the wave of AI large models. SenseTime's 'large model + large computing power” empowers the development of multiple industries. Jun 09, 2023 pm 07:35 PM

Recently, the "Lingang New Area Intelligent Computing Conference" with the theme of "AI leads the era, computing power drives the future" was held. At the meeting, the New Area Intelligent Computing Industry Alliance was formally established. SenseTime became a member of the alliance as a computing power provider. At the same time, SenseTime was awarded the title of "New Area Intelligent Computing Industry Chain Master" enterprise. As an active participant in the Lingang computing power ecosystem, SenseTime has built one of the largest intelligent computing platforms in Asia - SenseTime AIDC, which can output a total computing power of 5,000 Petaflops and support 20 ultra-large models with hundreds of billions of parameters. Train at the same time. SenseCore, a large-scale device based on AIDC and built forward-looking, is committed to creating high-efficiency, low-cost, and large-scale next-generation AI infrastructure and services to empower artificial intelligence.

Researcher: AI model inference consumes more power, and industry electricity consumption in 2027 will be comparable to that of the Netherlands Researcher: AI model inference consumes more power, and industry electricity consumption in 2027 will be comparable to that of the Netherlands Oct 14, 2023 am 08:25 AM

IT House reported on October 13 that "Joule", a sister journal of "Cell", published a paper this week called "The growing energy footprint of artificial intelligence (The growing energy footprint of artificial intelligence)". Through inquiries, we learned that this paper was published by Alex DeVries, the founder of the scientific research institution Digiconomist. He claimed that the reasoning performance of artificial intelligence in the future may consume a lot of electricity. It is estimated that by 2027, the electricity consumption of artificial intelligence may be equivalent to the electricity consumption of the Netherlands for a year. Alex DeVries said that the outside world has always believed that training an AI model is "the most important thing in AI".

China Unicom releases large image and text AI model that can generate images and video clips from text China Unicom releases large image and text AI model that can generate images and video clips from text Jun 29, 2023 am 09:26 AM

Driving China News on June 28, 2023, today during the Mobile World Congress in Shanghai, China Unicom released the graphic model "Honghu Graphic Model 1.0". China Unicom said that the Honghu graphic model is the first large model for operators' value-added services. China Business News reporter learned that Honghu’s graphic model currently has two versions of 800 million training parameters and 2 billion training parameters, which can realize functions such as text-based pictures, video editing, and pictures-based pictures. In addition, China Unicom Chairman Liu Liehong also said in today's keynote speech that generative AI is ushering in a singularity of development, and 50% of jobs will be profoundly affected by artificial intelligence in the next two years.

If they disagree, they will score points. Why are big domestic AI models addicted to 'swiping the rankings'? If they disagree, they will score points. Why are big domestic AI models addicted to 'swiping the rankings'? Dec 02, 2023 am 08:53 AM

I believe that friends who follow the mobile phone circle will not be unfamiliar with the phrase "get a score if you don't accept it". For example, theoretical performance testing software such as AnTuTu and GeekBench have attracted much attention from players because they can reflect the performance of mobile phones to a certain extent. Similarly, there are corresponding benchmarking software for PC processors and graphics cards to measure their performance. Since "everything can be benchmarked", the most popular large AI models have also begun to participate in benchmarking competitions, especially in the "Hundred Models" After the "war" began, there were breakthroughs almost every day. Each company claimed to be "the first in running scores." The large domestic AI models almost never fell behind in terms of performance scores, but they were never able to surpass GP in terms of user experience.

Four times faster, Bytedance's open source high-performance training inference engine LightSeq technology revealed Four times faster, Bytedance's open source high-performance training inference engine LightSeq technology revealed May 02, 2023 pm 05:52 PM

The Transformer model comes from the paper "Attentionisallyouneed" published by the Google team in 2017. This paper first proposed the concept of using Attention to replace the cyclic structure of the Seq2Seq model, which brought a great impact to the NLP field. And with the continuous advancement of research in recent years, Transformer-related technologies have gradually flowed from natural language processing to other fields. Up to now, the Transformer series models have become mainstream models in NLP, CV, ASR and other fields. Therefore, how to train and infer Transformer models faster has become an important research direction in the industry. Low-precision quantization techniques can

The Network Center of the Joint Institute of Physics, Chinese Academy of Sciences releases the AI ​​model MatChat The Network Center of the Joint Institute of Physics, Chinese Academy of Sciences releases the AI ​​model MatChat Nov 03, 2023 pm 08:13 PM

IT House reported on November 3 that the official website of the Institute of Physics of the Chinese Academy of Sciences published an article. Recently, the SF10 Group of the Institute of Physics of the Chinese Academy of Sciences/Beijing National Research Center for Condensed Matter Physics and the Computer Network Information Center of the Chinese Academy of Sciences collaborated to apply large AI models to materials science. In the field, tens of thousands of chemical synthesis pathway data are fed to the large language model LLAMA2-7b, thereby obtaining a MatChat model, which can be used to predict the synthesis pathways of inorganic materials. IT House noted that the model can perform logical reasoning based on the queried structure and output the corresponding preparation process and formula. It has been deployed online and is open to all materials researchers, bringing new inspiration and new ideas to materials research and innovation. This work is for large language models in the field of segmented science

Meta researchers make a new AI attempt: teaching robots to navigate physically without maps or training Meta researchers make a new AI attempt: teaching robots to navigate physically without maps or training Apr 09, 2023 pm 08:31 PM

The artificial intelligence department of Meta Platforms recently stated that they are teaching AI models how to learn to walk in the physical world with the support of a small amount of training data, and have made rapid progress. This research can significantly shorten the time for AI models to acquire visual navigation capabilities. Previously, achieving such goals required repeated "reinforcement learning" using large data sets. Meta AI researchers said that this exploration of AI visual navigation will have a significant impact on the virtual world. The basic idea of ​​the project is not complicated: to help AI navigate physical space just like humans do, simply through observation and exploration. The Meta AI department explained, “For example, if we want AR glasses to guide us to find keys, we must

Nvidia releases TensorRT-LLM open source software to improve AI model performance on high-end GPU chips Nvidia releases TensorRT-LLM open source software to improve AI model performance on high-end GPU chips Sep 14, 2023 pm 12:29 PM

Nvidia recently announced the launch of a new open source software suite called TensorRT-LLM, which expands the capabilities of large language model optimization on Nvidia GPUs and breaks the limits of artificial intelligence inference performance after deployment. Generative AI large language models have become popular due to their impressive capabilities. It expands the possibilities of artificial intelligence and is widely used in various industries. Users can obtain information by talking to chatbots, summarize large documents, write software code, and discover new ways to understand information, said Ian Buck, vice president of hyperscale and high-performance computing at Nvidia Corporation: "Large language model inference is becoming increasingly difficult. .The complexity of the model continues to increase, the model becomes more and more intelligent, and it becomes

See all articles