Home Technology peripherals AI The 'Android moment' of domestic large AI models has arrived! Alibaba Cloud Tongyi Qianwen is free, open source, and available for commercial use

The 'Android moment' of domestic large AI models has arrived! Alibaba Cloud Tongyi Qianwen is free, open source, and available for commercial use

Aug 05, 2023 pm 05:45 PM
Ali Cloud large model Android moment

After its overseas Meta, Alibaba has become another technology giant that promotes the trend of artificial intelligence (AI) large model "Android moment"

According to reports from Beijing Business Daily, Alibaba Cloud will release the open source general question and answer model Qwen-7B and conversation model Qwen-7B-Chat on Thursday, August 3. Both models have 7 billion parameters. They have launched the first "Model as a Service" open platform in China, the Magic Community, and it can be used for free, and commercial use is also allowed

Users can quantify Qwen-7B and Qwen-7B-Chat through open source code, and deploy and run models on consumer-grade graphics cards. They can directly download the model from the Moda community, or access and call Qwen-7B and Qwen-7B-Chat through the Alibaba Cloud Lingji platform. Alibaba Cloud provides users with services including model training, inference, deployment and fine-tuning

On the Magic Tower community, there is a post dedicated to the installation method of the Tongyi Qianwen model, the best practices for creating space experience, model reasoning and model training, and also attaches screenshots of the model link and download situation

The Android moment of domestic large AI models has arrived! Alibaba Cloud Tongyi Qianwen is free, open source, and available for commercial use

According to public information, Qwen-7B is a base model that is pre-trained using deduplicated and filtered data of more than 2.2 trillion tokens. It supports multiple languages ​​such as Chinese and English, and has a context window length of 8k. The model contains high-quality Chinese, English, multi-language, code, mathematics and other data, covering the entire network text, encyclopedia, books, code, mathematics and vertical fields in various fields

According to the MMLU evaluation results, Qwen-7B performed well in English evaluation, surpassing other similar open source pre-training models and being competitive with larger-scale models. In terms of Chinese evaluation, Qwen-7B achieved the highest score on the C-Eval validation set and was competitive even with larger-scale models

The following is a comparison of the MMLU 5-shot accuracy results of Qwen-7B

The Android moment of domestic large AI models has arrived! Alibaba Cloud Tongyi Qianwen is free, open source, and available for commercial use

Alibaba Cloud has built an AI assistant Qwen-7B-Chat based on the base model through the alignment mechanism. It is a large language model of Chinese and English dialogue based on Transformer, which has successfully achieved alignment with human cognition. The model uses a variety of pre-training data, including online texts, professional books, codes, etc., covering a wide range of areas

The zero-shot accuracy of the Qwen-7B-Chat model on both the C-Eval validation set and the MMLU evaluation set exceeds that of other similar alignment models

The following is a comparison of the zero-shot accuracy results on the C-Eval test set

The Android moment of domestic large AI models has arrived! Alibaba Cloud Tongyi Qianwen is free, open source, and available for commercial use

Alibaba Cloud became the first large technology company in China to join the ranks of open source large models. In July this year, it jointly released with Meta a commercial version of the open source AI model Llama 2, which can replace OpenAI and Google's models. In addition, Zhipu AI and Tsinghua KEG Laboratory also announced China’s top open source large model

in July

The advantages of open source models are to increase user acceptance and provide more data for artificial intelligence processing. The larger the data volume of LLM, the more powerful its function. In addition, the open source model helps researchers and developers find and solve vulnerabilities, improving technology and security levels

At the Alibaba Cloud Summit in April 2023, Alibaba announced the opening of Tongyi Qianwen to enterprises, allowing enterprises to use Tongyi Qianwen’s capabilities to train their own large models

Zhou Jingren, Chief Technology Officer (CTO) of Alibaba Cloud Intelligence Group, said that in the future, enterprises can make full use of Alibaba Cloud's Tongyi Qianwen capabilities and combine their own industry knowledge and application scenarios to train customized enterprise large models. For example, each company can have its own intelligent customer service, intelligent shopping guide, intelligent voice assistant, copywriting assistant, AI designer and self-driving model and other functions

Zhang Yong, CEO of Alibaba Group and CEO of Alibaba Cloud Intelligence Group, said that all Alibaba products will be integrated with the Tongyi Qianwen large model

The Android moment of domestic large AI models has arrived! Alibaba Cloud Tongyi Qianwen is free, open source, and available for commercial use

Alibaba Cloud hopes to help more companies use large models to adapt to the needs of the AI ​​era, so that each company can have its own dedicated large model for its industry capabilities, and reconstruct it based on Tongyi Qianwen

The above is the detailed content of The 'Android moment' of domestic large AI models has arrived! Alibaba Cloud Tongyi Qianwen is free, open source, and available for commercial use. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Big model app Tencent Yuanbao is online! Hunyuan is upgraded to create an all-round AI assistant that can be carried anywhere Big model app Tencent Yuanbao is online! Hunyuan is upgraded to create an all-round AI assistant that can be carried anywhere Jun 09, 2024 pm 10:38 PM

On May 30, Tencent announced a comprehensive upgrade of its Hunyuan model. The App "Tencent Yuanbao" based on the Hunyuan model was officially launched and can be downloaded from Apple and Android app stores. Compared with the Hunyuan applet version in the previous testing stage, Tencent Yuanbao provides core capabilities such as AI search, AI summary, and AI writing for work efficiency scenarios; for daily life scenarios, Yuanbao's gameplay is also richer and provides multiple features. AI application, and new gameplay methods such as creating personal agents are added. "Tencent does not strive to be the first to make large models." Liu Yuhong, vice president of Tencent Cloud and head of Tencent Hunyuan large model, said: "In the past year, we continued to promote the capabilities of Tencent Hunyuan large model. In the rich and massive Polish technology in business scenarios while gaining insights into users’ real needs

Bytedance Beanbao large model released, Volcano Engine full-stack AI service helps enterprises intelligently transform Bytedance Beanbao large model released, Volcano Engine full-stack AI service helps enterprises intelligently transform Jun 05, 2024 pm 07:59 PM

Tan Dai, President of Volcano Engine, said that companies that want to implement large models well face three key challenges: model effectiveness, inference costs, and implementation difficulty: they must have good basic large models as support to solve complex problems, and they must also have low-cost inference. Services allow large models to be widely used, and more tools, platforms and applications are needed to help companies implement scenarios. ——Tan Dai, President of Huoshan Engine 01. The large bean bag model makes its debut and is heavily used. Polishing the model effect is the most critical challenge for the implementation of AI. Tan Dai pointed out that only through extensive use can a good model be polished. Currently, the Doubao model processes 120 billion tokens of text and generates 30 million images every day. In order to help enterprises implement large-scale model scenarios, the beanbao large-scale model independently developed by ByteDance will be launched through the volcano

Alibaba Cloud announced that the 2024 Yunqi Conference will be held in Hangzhou from September 19th to 21st. Free application for free tickets Alibaba Cloud announced that the 2024 Yunqi Conference will be held in Hangzhou from September 19th to 21st. Free application for free tickets Aug 07, 2024 pm 07:12 PM

According to news from this website on August 5, Alibaba Cloud announced that the 2024 Yunqi Conference will be held in Yunqi Town, Hangzhou from September 19th to 21st. There will be a three-day main forum, 400 sub-forums and parallel topics, as well as nearly four Ten thousand square meters of exhibition area. Yunqi Conference is free and open to the public. From now on, the public can apply for free tickets through the official website of Yunqi Conference. An all-pass ticket of 5,000 yuan can be purchased. The ticket website is attached on this website: https://yunqi.aliyun.com/2024 /ticket-list According to reports, the Yunqi Conference originated in 2009 and was originally named the First China Website Development Forum. In 2011, it evolved into the Alibaba Cloud Developer Conference. In 2015, it was officially renamed the "Yunqi Conference" and has continued to successful move

Uncovering the NVIDIA large model inference framework: TensorRT-LLM Uncovering the NVIDIA large model inference framework: TensorRT-LLM Feb 01, 2024 pm 05:24 PM

1. Product positioning of TensorRT-LLM TensorRT-LLM is a scalable inference solution developed by NVIDIA for large language models (LLM). It builds, compiles and executes calculation graphs based on the TensorRT deep learning compilation framework, and draws on the efficient Kernels implementation in FastTransformer. In addition, it utilizes NCCL for communication between devices. Developers can customize operators to meet specific needs based on technology development and demand differences, such as developing customized GEMM based on cutlass. TensorRT-LLM is NVIDIA's official inference solution, committed to providing high performance and continuously improving its practicality. TensorRT-LL

Advanced practice of industrial knowledge graph Advanced practice of industrial knowledge graph Jun 13, 2024 am 11:59 AM

1. Background Introduction First, let’s introduce the development history of Yunwen Technology. Yunwen Technology Company...2023 is the period when large models are prevalent. Many companies believe that the importance of graphs has been greatly reduced after large models, and the preset information systems studied previously are no longer important. However, with the promotion of RAG and the prevalence of data governance, we have found that more efficient data governance and high-quality data are important prerequisites for improving the effectiveness of privatized large models. Therefore, more and more companies are beginning to pay attention to knowledge construction related content. This also promotes the construction and processing of knowledge to a higher level, where there are many techniques and methods that can be explored. It can be seen that the emergence of a new technology does not necessarily defeat all old technologies. It is also possible that the new technology and the old technology will be integrated with each other.

Benchmark GPT-4! China Mobile's Jiutian large model passed dual registration Benchmark GPT-4! China Mobile's Jiutian large model passed dual registration Apr 04, 2024 am 09:31 AM

According to news on April 4, the Cyberspace Administration of China recently released a list of registered large models, and China Mobile’s “Jiutian Natural Language Interaction Large Model” was included in it, marking that China Mobile’s Jiutian AI large model can officially provide generative artificial intelligence services to the outside world. . China Mobile stated that this is the first large-scale model developed by a central enterprise to have passed both the national "Generative Artificial Intelligence Service Registration" and the "Domestic Deep Synthetic Service Algorithm Registration" dual registrations. According to reports, Jiutian’s natural language interaction large model has the characteristics of enhanced industry capabilities, security and credibility, and supports full-stack localization. It has formed various parameter versions such as 9 billion, 13.9 billion, 57 billion, and 100 billion, and can be flexibly deployed in Cloud, edge and end are different situations

New test benchmark released, the most powerful open source Llama 3 is embarrassed New test benchmark released, the most powerful open source Llama 3 is embarrassed Apr 23, 2024 pm 12:13 PM

If the test questions are too simple, both top students and poor students can get 90 points, and the gap cannot be widened... With the release of stronger models such as Claude3, Llama3 and even GPT-5 later, the industry is in urgent need of a more difficult and differentiated model Benchmarks. LMSYS, the organization behind the large model arena, launched the next generation benchmark, Arena-Hard, which attracted widespread attention. There is also the latest reference for the strength of the two fine-tuned versions of Llama3 instructions. Compared with MTBench, which had similar scores before, the Arena-Hard discrimination increased from 22.6% to 87.4%, which is stronger and weaker at a glance. Arena-Hard is built using real-time human data from the arena and has a consistency rate of 89.1% with human preferences.

GPT Store can't even open its doors. How dare this domestic platform take this path? ? GPT Store can't even open its doors. How dare this domestic platform take this path? ? Apr 19, 2024 pm 09:30 PM

Pay attention, this man has connected more than 1,000 large models, allowing you to plug in and switch seamlessly. Recently, a visual AI workflow has been launched: giving you an intuitive drag-and-drop interface, you can drag, pull, and drag to arrange your own workflow on an infinite canvas. As the saying goes, war costs speed, and Qubit heard that within 48 hours of this AIWorkflow going online, users had already configured personal workflows with more than 100 nodes. Without further ado, what I want to talk about today is Dify, an LLMOps company, and its CEO Zhang Luyu. Zhang Luyu is also the founder of Dify. Before joining the business, he had 11 years of experience in the Internet industry. I am engaged in product design, understand project management, and have some unique insights into SaaS. Later he

See all articles