


Alibaba Cloud releases General Question Answering 2.0, which surpasses GPT-3.5 in performance and accelerates its pursuit of GPT-4
On October 31, Alibaba Cloud officially released Tongyi Qianwen 2.0, a large model with hundreds of billions of parameters. In 10 authoritative evaluations, the comprehensive performance of Tongyi Qianwen 2.0 exceeded GPT-3.5 and is currently Accelerate to catch up with GPT-4. On the same day, Tongyi Qianwen APP was officially launched in major mobile application markets, and everyone can directly experience the latest model capabilities through the APP.
In the past six months, Tongyi Qianwen 2.0 has made a huge leap in performance. Compared with version 1.0 released in April, Tongyi Qianwen 2.0has been significantly improvedin the abilities of understanding complex instructions, literary creation, general mathematics, knowledge memory, and resisting hallucinations. At present, the comprehensive performance of
Tongyi Qianwen has exceeded GPT-3.5, accelerating to catch up with GPT-4.Picture: Tongyi Qianwen 2.0 comprehensive performancehas exceeded GPT-3.5 and is accelerating to catch up GPT-4
in MMLU, C-Eval, GSM8K, HumanEval, MATH, etc. 10 On a
mainstream benchmark evaluation set, Tongyi Qianwen 2.0's overall score surpassed Meta's Llama-2-70B, compared with OpenAI's Chat-3.5, it was nine wins and one loss, and compared with GPT-4, it was With four wins and six losses, the gap with GPT-4 has further narrowed.The ability to understand Chinese and English is the basic skill of a large language model.
In terms of English tasks, Tongyi Qianwen 2.0 scored 82.5 on the MMLU benchmark, second only to GPT-4. By significantly increasing the number of parameters, Tongyi Qianwen 2.0 can better understand and process complex tasks. In terms of language structure and concepts; in terms of Chinese tasks, Tongyi Qianwen 2.0 achieved the highest score on the C-Eval benchmark with a clear advantage. This is because the model learned more Chinese corpus during training, further strengthening its Chinese understanding and expression capabilities.In areas such as mathematical reasoning and code understanding, Tongyi Qianwen 2.0 has made significant progress. In the reasoning benchmark test GSM8K, Tongyi Qianwen ranked second, demonstrating strong computing and logical reasoning capabilities; in the HumanEval test, Tongyi Qianwen's score closely followed GPT-4 and GPT-3.5, which mainly measures large-scale The ability of the model to understand and execute code fragments is the basis for large models to be used in scenarios such as programming assistance and automatic code repair.
##Picture: Tongyi Qianwen 2.0release
##Tongyi Qianwen is more mature and easier to use. Tongyi Qianwen 2.0 has made technical optimizations in terms of instruction compliance, tool use, refined creation, etc. can be better integrated into downstream application scenarios. Tongyi Large Model official website has launched multi-modal and plug-in functions, supporting segmented tasks such as image input and document parsing.
At the same time, eight major industry model groups based on Tongyi large model training were launched. They are Tongyi Lingma-Intelligent Coding Assistant, Tongyi Zhiwen-AI Reading Assistant, Tongyi Listening-Work and Study AI Assistant. ##、Tongyi Xiaomi-Intelligent Customer Service、 Tongyi Renxin-Personal Exclusive health assistant , Tongyi Farui-AI legal advisor. 8 major industry models are oriented to the most popular vertical scenarios, using domain data for specialized training. Users can directly experience model functions on the official website, and developers can integrate model capabilities into their own large model applications and services through web page embedding, API/SDK calls, etc. Picture: Tongyi large model family has been fully upgraded, 8 major industry modelsgroups are online
As of October, Alibaba Cloud has conducted in-depth cooperation with more than 60 industry leaders , to promote the implementation of Tongyi Qianwen in the fields of office, cultural tourism, electric power, government affairs, medical insurance, transportation, manufacturing, finance, software development and other fields.
Zhou Jingren revealed that Alibaba Cloud plans to open source the 72B version of Tongyi Qianwen in the near future. Previously, Alibaba Cloud has open sourced the 7B and 14B version models, and the cumulative number of
. Alibaba Cloud will continue to support developers from thousands of industries to innovate models and applications based on the Tongyi Qianwen open source model.
Picture: Tongyi Qianwen 72B will be open source soon
The above is the detailed content of Alibaba Cloud releases General Question Answering 2.0, which surpasses GPT-3.5 in performance and accelerates its pursuit of GPT-4. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics



According to news from this website on August 5, Alibaba Cloud announced that the 2024 Yunqi Conference will be held in Yunqi Town, Hangzhou from September 19th to 21st. There will be a three-day main forum, 400 sub-forums and parallel topics, as well as nearly four Ten thousand square meters of exhibition area. Yunqi Conference is free and open to the public. From now on, the public can apply for free tickets through the official website of Yunqi Conference. An all-pass ticket of 5,000 yuan can be purchased. The ticket website is attached on this website: https://yunqi.aliyun.com/2024 /ticket-list According to reports, the Yunqi Conference originated in 2009 and was originally named the First China Website Development Forum. In 2011, it evolved into the Alibaba Cloud Developer Conference. In 2015, it was officially renamed the "Yunqi Conference" and has continued to successful move

Alibaba Cloud today announced an open source project called Qwen-14B, which includes a parametric model and a conversation model. This open source project allows free commercial use. This site states: Alibaba Cloud has previously open sourced a parameter model Qwen-7B worth 7 billion. The download volume in more than a month has exceeded 1 million times. According to the data provided by Alibaba Cloud, Qwen -14B surpasses models of the same size in multiple authoritative evaluations, and some indicators are even close to Llama2-70B. According to reports, Qwen-14B is a high-performance open source model that supports multiple languages. Its overall training data exceeds 3 trillion Tokens, has stronger reasoning, cognition, planning and memory capabilities, and supports a maximum context window of 8k

According to news on November 7, Tongyi Qianwen App, a subsidiary of Alibaba Cloud, has recently landed on the Apple AppStore, providing Apple users with a brand new application choice. The application's installation package size is 25.9MB and has been launched on multiple Android application markets before. Tongyi Qianwen is a powerful and ultra-large-scale pre-training model that can provide users with comprehensive assistance in multiple fields such as creative copywriting, office assistants, learning assistance, and interesting life. According to the application introduction, the functions of the application include: In the field of creative copywriting, users can generate Xiaohongshu copywriting, create scripts, and perform rewriting and polishing operations. The office assistant function can generate code, interpret code, and expand weekly reports, etc. The learning assistant has many functions such as Chinese-English translation, math problem solving, and classical Chinese translation.

Detailed explanation of Maven Alibaba Cloud image configuration Maven is a Java project management tool. By configuring Maven, you can easily download dependent libraries and build projects. The Alibaba Cloud image can speed up Maven's download speed and improve project construction efficiency. This article will introduce in detail how to configure Alibaba Cloud mirroring and provide specific code examples. What is Alibaba Cloud Image? Alibaba Cloud Mirror is the Maven mirror service provided by Alibaba Cloud. By using Alibaba Cloud Mirror, you can greatly speed up the downloading of Maven dependency libraries. Alibaba Cloud Mirror

Alibaba Cloud caching mechanisms include Alibaba Cloud Redis, Alibaba Cloud Memcache, distributed cache service DSC, Alibaba Cloud Table Store, CDN, etc. Detailed introduction: 1. Alibaba Cloud Redis: A distributed memory database provided by Alibaba Cloud that supports high-speed reading and writing and data persistence. By storing data in memory, it can provide low-latency data access and high concurrency processing capabilities; 2. Alibaba Cloud Memcache: the cache system provided by Alibaba Cloud, etc.

According to news from this website on November 8, Alibaba Cloud issued a statement today saying that a self-media article titled "Alibaba's "Master Tai" Zheng Junfang will resign as executive director and general manager of Alibaba Cloud". The content of this article is purely fabricated and seriously inaccurate. . Alibaba Cloud reserves the right to pursue legal liability against relevant self-media. Judging from the screenshots posted by Alibaba Cloud, this article comes from "Leopard Change". As of the time of publishing on this site, the article has not been deleted. The article stated that "Zheng Junfang may gradually retire in the future, step down as the chief risk officer and chief financial officer of Cloud Intelligence Group, and will no longer be in charge of specific business." Public information shows that Zheng Junfang is currently a partner of Alibaba, chief risk officer of Alibaba Group, director of Cloud Intelligence Group, and concurrently serves as the group's CCO and head of the group's customer experience business group. She took office as Ali in September this year

Today, Beijing Kingsoft Office Software Co., Ltd. ("Kingsoft Office" for short) and Alibaba Cloud have reached a strategic cooperation. Both parties will leverage their respective technical advantages and platform capabilities to develop cloud resources, AI large models, product ecological integration, joint solutions, etc. Carry out in-depth cooperation in multiple fields to achieve ecological coordinated development. Zhang Qingyuan, CEO of Kingsoft Office, and Wang Jian, academician of the Chinese Academy of Engineering and founder of Alibaba Cloud, witnessed the signing. Jiang Zhiqiang, Senior Vice President of Kingsoft Office, and Zhang Tao, Vice President of Global Commercial of Alibaba Cloud Intelligence Group, signed the cooperation agreement on behalf of both parties. Kingsoft Office is a leading office software service provider in China, providing office services to users in more than 220 countries and regions around the world. In order to promote technical cooperation and ecological synergy between the two parties, create better smart office applications and provide users with more

How to configure Alibaba Cloud Win server to support PHP running? With the rise of web applications, PHP is widely used as a popular server-side scripting language. Setting up and running a PHP environment on Alibaba Cloud's Windows server is one of the challenges faced by many developers and administrators. This article will introduce in detail how to configure the PHP environment on Alibaba Cloud's Windows server so that it can run smoothly. First, make sure you have purchased a Windows server on Alibaba Cloud and connected it
