Zhipu AI launches the third-generation large base model ChatGLM3 to adapt to more domestic chips-AI-php.cn

Home

Technology peripherals

Zhipu AI launches the third-generation large base model ChatGLM3 to adapt to more domestic chips

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Oct 30, 2023 pm 06:05 PM

The news on October 27, 2023 is that Zhipu AI released a new self-developed third-generation large base model ChatGLM3 and related series of products at the China Computer Conference (CNCC). This release marks a major breakthrough for Zhipu AI after launching the 100 billion base conversation model ChatGLM and ChatGLM2

ChatGLM3 is developed using an original multi-stage enhanced pre-training method. This method can make training more complete. According to the evaluation results, in 44 Chinese and English public data set tests, ChatGLM3 ranked first among domestic models of the same size. Zhang Peng, CEO of Zhipu AI, released new products at the press conference and demonstrated the latest product features in real time

ChatGLM3 new technology upgrade with higher performance and lower cost

ChatGLM3 launched by Zhipu AI has become more powerful with richer training data and better training solutions. Compared with ChatGLM2, MMLU increased by 36%, CEval increased by 33%, GSM8K increased by 179%, and BBH increased by 126%

At the same time, ChatGLM3 aims at GPT-4V and has implemented iterative upgrades of several new functions, including CogVLM with multi-modal understanding capabilities - image recognition semantics, which has achieved SOTA on more than 10 international standard image and text evaluation data sets. ; Code enhancement module Code Interpreter generates code and executes it according to user needs, automatically completing complex tasks such as data analysis and file processing; Web search enhancement WebGLM-access search enhancement can automatically find relevant information on the Internet based on questions and provide answers when answering Refer to relevant literature or article links. The semantic and logical capabilities of ChatGLM3 have been greatly enhanced.

ChatGLM3 also integrates the self-developed AgentTuning technology, which activates the model agent capabilities, especially in terms of intelligent planning and execution, which is 1000% improved compared to ChatGLM2; it also enables domestic large models to natively support tool calling, code execution, Complex scenarios such as games, database operations, knowledge graph search and reasoning, and operating systems.

In addition, ChatGLM3 this time launches end test models ChatGLM3-1.5B and ChatGLM3-3B that can be deployed on mobile phones, supporting a variety of mobile phones and vehicle platforms including vivo, Xiaomi, and Samsung, and even supporting CPU chips on mobile platforms. Inference speed can reach 20 tokens/s. In terms of accuracy, the performance of the 1.5B and 3B models is close to that of the ChatGLM2-6B model on public benchmarks.

Based on the latest efficient dynamic reasoning and memory optimization technology, the current reasoning framework of ChatGLM3 is better than the current best open source implementation under the same hardware and model conditions, including vLLM launched by the University of Berkeley and the latest version of Hugging Face TGI , the inference speed is increased by 2-3 times, and the inference cost is doubled, only 0.5 points per thousand tokens, the lowest cost.

This content is for reference only and does not constitute any investment advice. Readers should use their own judgment when using this information and assume responsibility for their own decisions. This website is not responsible for any losses caused by the use of this content

This account does not make any statement or guarantee as to the availability, accuracy, timeliness, validity or completeness of any information published, and hereby disclaims any liability or consequences that may arise from the information. After rewriting: This account makes no representation or warranty as to the availability, accuracy, timeliness, validity or completeness of any information posted, and disclaims any liability or consequences in this statement

2. This account is non-commercial and non-profit. The reproduced content does not mean that you agree with its views and are responsible for its authenticity, nor is it intended to constitute any other guidance. This website is not responsible for any direct or indirect responsibility for any inaccuracies or errors in any information reproduced or published.

3. The information, materials, text, pictures, etc. used in this article come from the Internet, and all reproduced content has been marked with the source. If you find any work that infringes your intellectual property rights or personal legal rights, please contact us and we will modify or delete it in a timely manner

Zhipu AI launches the third-generation large base model ChatGLM3 to adapt to more domestic chips

The above is the detailed content of Zhipu AI launches the third-generation large base model ChatGLM3 to adapt to more domestic chips. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Assassin's Creed Shadows: Seashell Riddle Solution

3 weeks ago By DDD

What's New in Windows 11 KB5054979 & How to Fix Update Issues

2 weeks ago By DDD

Where to find the Crane Control Keycard in Atomfall

3 weeks ago By DDD

Assassin's Creed Shadows - How To Find The Blacksmith And Unlock Weapon And Armour Customisation

1 months ago By DDD

Roblox: Dead Rails - How To Complete Every Challenge

3 weeks ago By DDD

Hot Tools

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Where is the login entrance for gmail email?

7611

CakePHP Tutorial

1387

What is the format of the account name of steam

win11 activation key permanent

nyt connections hints and answers

136

Related knowledge

Top 5 GenAI Launches of February 2025: GPT-4.5, Grok-3 & More! Mar 22, 2025 am 10:58 AM

February 2025 has been yet another game-changing month for generative AI, bringing us some of the most anticipated model upgrades and groundbreaking new features. From xAI’s Grok 3 and Anthropic’s Claude 3.7 Sonnet, to OpenAI’s G

How to Use YOLO v12 for Object Detection? Mar 22, 2025 am 11:07 AM

YOLO (You Only Look Once) has been a leading real-time object detection framework, with each iteration improving upon the previous versions. The latest version YOLO v12 introduces advancements that significantly enhance accuracy

Best AI Art Generators (Free & Paid) for Creative Projects Apr 02, 2025 pm 06:10 PM

The article reviews top AI art generators, discussing their features, suitability for creative projects, and value. It highlights Midjourney as the best value for professionals and recommends DALL-E 2 for high-quality, customizable art.

Is ChatGPT 4 O available? Mar 28, 2025 pm 05:29 PM

ChatGPT 4 is currently available and widely used, demonstrating significant improvements in understanding context and generating coherent responses compared to its predecessors like ChatGPT 3.5. Future developments may include more personalized interactions and real-time data processing capabilities, further enhancing its potential for various applications.

Best AI Chatbots Compared (ChatGPT, Gemini, Claude & More) Apr 02, 2025 pm 06:09 PM

The article compares top AI chatbots like ChatGPT, Gemini, and Claude, focusing on their unique features, customization options, and performance in natural language processing and reliability.

How to Use Mistral OCR for Your Next RAG Model Mar 21, 2025 am 11:11 AM

Mistral OCR: Revolutionizing Retrieval-Augmented Generation with Multimodal Document Understanding Retrieval-Augmented Generation (RAG) systems have significantly advanced AI capabilities, enabling access to vast data stores for more informed respons

Top AI Writing Assistants to Boost Your Content Creation Apr 02, 2025 pm 06:11 PM

The article discusses top AI writing assistants like Grammarly, Jasper, Copy.ai, Writesonic, and Rytr, focusing on their unique features for content creation. It argues that Jasper excels in SEO optimization, while AI tools help maintain tone consist

Getting Started With Meta Llama 3.2 - Analytics Vidhya Apr 11, 2025 pm 12:04 PM

Meta's Llama 3.2: A Leap Forward in Multimodal and Mobile AI Meta recently unveiled Llama 3.2, a significant advancement in AI featuring powerful vision capabilities and lightweight text models optimized for mobile devices. Building on the success o

See all articles