Home Technology peripherals AI Supports 380,000 words input at a time! Tencent Hunyuan launches 256k long article model, open to enterprises and individual developers through Tencent Cloud​

Supports 380,000 words input at a time! Tencent Hunyuan launches 256k long article model, open to enterprises and individual developers through Tencent Cloud​

Jun 08, 2024 am 11:11 AM
Tencent Cloud Tencent Hunyuan large model

AILarge model technology is becoming a key force in promoting the development of high-quality productivity and plays an important role in the integration with thousands of industries. Tencent's Hunyuan large model has expanded the model to trillions of parameter scale by adopting the hybrid expert model (MoE) structure, adding "Brain Capacity not only improves prediction performance, but also drives down inference costs. As a general model, Tencent Hunyuan leads the industry in Chinese performance, especially in text generation, mathematical logic and multi-turn dialogue.

Recently, Tencent Hunyuan large model officially released the 256k long article model, and made it available to the majority of enterprises and individual developers through Tencent Cloud Open to support wider innovation and applications. Tencent Hunyuan256k model version has the ability to process ultra-long text exceeding 38 million characters. In dialogue application scenarios, this model can"memory"more dialogue content, effectively avoiding "Forgot" information and other issues. In addition, it has excellent contextual analysis capabilities to provide more precise and relevant feedback to conversation participants, helping them make more informed decisions.

In addition, this model version also shows strong performance in reading comprehension of long documents and large-scale data analysis. It can provide strong work support for professionals in finance, medical, education, travel and other industries, and significantly improve their work efficiency. The model has also been deeply optimized in terms of inference performance, ensuring that users can enjoy a smoother and more efficient experience in actual applications on platforms such as Tencent Cloud.


##Reduce "forgetfulness" and make large models smarter

In large model products, handling conversational requirements is a core function. However, due to the limitations of long text processing capabilities, traditional large models are prone to "losing direction" or "Memory Loss" As the length of the conversation increases, the amount of forgotten information also increases.

Tencent Hunyuan256k model is specially optimized for this challenge. It adopts an advanced"Expert Hybrid"(MoE) architecture, And it integrates innovative technologies such as RoPE-NTK and Flash Attention V2, while maintaining the ability to support general short texts (less than 4,000 characters), while achieving a breakthrough in the depth and breadth of long text processing.

Currently, Tencent Hunyuan’s large model has the ability to understand 256k of ultra-long context in a single process The number of characters exceeds 38. After rigorous "finding a needle in a haystack"After task testing, the model’s accuracy in long text processing has reached 99.99%, which is also in a leading position internationally.


Continuous and stable iteration, the efficiency of large model application is improved

Tencent Hunyuan Large Model is the first in the industry to adopt the hybrid expert model (MoE) structure, and has accumulated a large number of self-developed technologies in the process. In the previous version 32K, this model has significantly surpassed similar open source models on the market and demonstrated excellent performance in a variety of application scenarios.

After a new iteration, Tencent Hunyuan256k#GSB evaluation in the general field , compared to the previous version, the winning rate is 50.72%. At the same time, the training set of Tencent Hunyuan256k integrates high-quality annotated data such as long text data, translation data, and multi-document question and answer data in medical, financial and other fields, which makes the model In practical applications, especially in the medical and financial industries that require frequent analysis and processing of large amounts of long text data, it can provide more accurate and efficient work support.

For example, when a financial report issued by the central bank is input into the Tencent Hunyuan256k# model, the model can quickly refine and summarize The main points of the report were processed to a satisfactory level in terms of speed and accuracy.

Supports 380,000 words input at a time! Tencent Hunyuan launches 256k long article model, open to enterprises and individual developers through Tencent Cloud​


##Inference performance optimization, bringing stronger large models Comprehension

At the same time, Tencent Hunyuan256k has made in-depth optimization on inference performance. Model's QPM## compared to FP16 accuracy in INT8 accuracy mode # (query rate per second) achieved a significant improvement of 23.9%, while the first word time only increased by 5.7% . These improvements significantly enhance the model's responsiveness and overall efficiency in real-world applications.

Take the analysis of "The Romance of the Three Kingdoms" as an example. Tencent Hunyuan256k can quickly read and retrieve this hundreds of thousands of words. Classical novels can not only accurately identify the key characters and plots of events in the novels, but can even provide accurate information on detailed descriptions of weather, character clothing, etc.

Supports 380,000 words input at a time! Tencent Hunyuan launches 256k long article model, open to enterprises and individual developers through Tencent Cloud​


##AI

Large model as the basis of new quality productivity A key component that plays a vital role in promoting industrial upgrading and achieving high-quality development. The launch of Tencent Hunyuan256k model has injected new vitality into the entire industry and opened up wider application prospects.

Currently, Tencent Hunyuan

256k long article model has been opened to the majority of enterprises and individual developers through Tencent Cloud. Users can use hunyuan-standardVersion256kLong text model access. This enables more developers and users to easily access and use the powerful functions of Tencent’s Hunyuan model, thereby providing intelligent solutions for all walks of life and promoting the development of more innovative application scenarios accomplish.

The above is the detailed content of Supports 380,000 words input at a time! Tencent Hunyuan launches 256k long article model, open to enterprises and individual developers through Tencent Cloud​. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Best Graphic Settings
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. How to Fix Audio if You Can't Hear Anyone
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Recognition from the first prize of Science and Technology Progress Award: Tencent solved the problem of training large models with trillions of parameters Recognition from the first prize of Science and Technology Progress Award: Tencent solved the problem of training large models with trillions of parameters Mar 27, 2024 pm 09:41 PM

The list of recipients of the China Electronics Society’s 2023 Science and Technology Awards has been announced. This time, we discovered a familiar figure—Tencent’s Angel machine learning platform. In the current era of rapid development of large models, the Science and Technology Award is awarded to machine learning platform research and application projects, fully affirming the value and importance of model training platforms. The Science and Technology Award recognizes the research and application of machine learning platform projects, and fully recognizes the value and importance of model training platforms, especially in the context of the rapid development of large-scale models. With the rise of deep learning, major companies have begun to realize the importance of machine learning platforms in the development of artificial intelligence technology. Google, Microsoft, Nvidia and other companies have launched their own machine learning platforms to accelerate

How to make a WeChat link? Sharing how to create WeChat links How to make a WeChat link? Sharing how to create WeChat links Mar 09, 2024 pm 09:37 PM

WeChat, as a popular social software, not only provides people with the convenience of instant messaging, but also integrates a variety of functions to enrich users' social experience. Among them, the creation and sharing of WeChat links is an important part of WeChat functions. The production of WeChat links mainly relies on the WeChat public platform and its related functions, as well as third-party tools. The following are several common methods of making WeChat links. How to make a WeChat link? The first way to create WeChat links is to use the image and text editor of the WeChat public platform. 1. Log in to the WeChat public platform and enter the image and text editing interface. 2. Add text or images in the editor, and then use the link button to add the required link. This method is suitable for simple text or image links. The second method is to use HTML code

Tencent Hunyuan large model has been fully reduced in price! Hunyuan-lite is free from now on Tencent Hunyuan large model has been fully reduced in price! Hunyuan-lite is free from now on Jun 02, 2024 pm 08:07 PM

On May 22, Tencent Cloud announced a new large model upgrade plan. One of the main models, Hunyuan-lite model, the total API input and output length is planned to be upgraded from the current 4k to 256k, and the price is adjusted from 0.008 yuan/thousand tokens to fully free. The Hunyuan-standardAPI input price dropped from 0.01 yuan/thousand tokens to 0.0045 yuan/thousand tokens, a decrease of 55%, and the API output price dropped from 0.01 yuan/thousand tokens to 0.005 yuan/thousand tokens, a decrease of 50%. The newly launched Hunyuan-standard-256k has the ability to process ultra-long text of more than 380,000 characters, and the API input price has been reduced to 0.015 yuan/thousand toke.

Should I enable IPv6 on my home router? 'Must-see: Advantages of enabling IPV6 on your home router' Should I enable IPv6 on my home router? 'Must-see: Advantages of enabling IPV6 on your home router' Feb 07, 2024 am 09:03 AM

IPv4 is exhausted and IPv6 is urgently needed, but is this upgrade just a passive change? What does IPv6 mean to the general public? How much change can the comprehensive upgrade of IPv6 bring to our network? 01 Large-scale IPv6 transformation is about to be realized. Recently, the General Office of the Ministry of Industry and Information Technology and the General Office of the State Administration of Radio and Television issued a notice proposing requirements to promote the IPv6 transformation of Internet TV services. China Mobile, Alibaba Cloud, Tencent Cloud, Baidu Cloud, JD Cloud, Huawei Cloud and Wangsu Technology need to carry out IPv6 transformation of the content distribution network (CDN) related to Internet TV business. By the end of 2020, Internet TV service capabilities based on IPv6 protocol will reach 85% of IPv4

GPT Store can't even open its doors. How dare this domestic platform take this path? ? GPT Store can't even open its doors. How dare this domestic platform take this path? ? Apr 19, 2024 pm 09:30 PM

Pay attention, this man has connected more than 1,000 large models, allowing you to plug in and switch seamlessly. Recently, a visual AI workflow has been launched: giving you an intuitive drag-and-drop interface, you can drag, pull, and drag to arrange your own workflow on an infinite canvas. As the saying goes, war costs speed, and Qubit heard that within 48 hours of this AIWorkflow going online, users had already configured personal workflows with more than 100 nodes. Without further ado, what I want to talk about today is Dify, an LLMOps company, and its CEO Zhang Luyu. Zhang Luyu is also the founder of Dify. Before joining the business, he had 11 years of experience in the Internet industry. I am engaged in product design, understand project management, and have some unique insights into SaaS. Later he

Use vscode to remotely debug the Linux kernel Use vscode to remotely debug the Linux kernel Feb 05, 2024 pm 12:30 PM

Preface The previous article introduced the use of QEMU+GDB to debug the Linux kernel. However, sometimes it is not very convenient to directly use GDB to debug and view the code. Therefore, on such an important occasion, how can the artifact of vscode be missing? This article introduces how to use vscode to remotely debug the kernel. Environment for this article: Windows 10 vs Code Ubuntu 20.04. I personally use Tencent Cloud Server, so I save the process of installing a virtual machine. Start directly from vscode configuration. Install the vscode plug-in remote-ssh. Find the Remote-SSH plug-in in the plug-in library and install it. After the installation is complete, there will be an additional function on the right toolbar. Press F1 to call out the pair.

Tencent Hunyuan upgrades model matrix, launching 256k long text model on the cloud​ Tencent Hunyuan upgrades model matrix, launching 256k long text model on the cloud​ Jun 01, 2024 pm 01:46 PM

The implementation of large models is accelerating, and "industrial practicality" has become a development consensus. On May 17, 2024, the Tencent Cloud Generative AI Industry Application Summit was held in Beijing, announcing a series of progress in large model development and application products. Tencent's Hunyuan large model capabilities continue to upgrade. Multiple versions of models hunyuan-pro, hunyuan-standard, and hunyuan-lite are open to the public through Tencent Cloud to meet the model needs of enterprise customers and developers in different scenarios, and to implement the most cost-effective model solutions. . Tencent Cloud releases three major tools: knowledge engine for large models, image creation engine, and video creation engine, creating a native tool chain for the era of large models, simplifying data access, model fine-tuning, and application development processes through PaaS services to help enterprises

How to install PHP and integrate with Apache on Debian 12 How to install PHP and integrate with Apache on Debian 12 Feb 20, 2024 pm 02:30 PM

PHP is a popular programming language that is widely used to develop various website applications. Many well-known websites and open source programs are developed using PHP, such as WordPress, Magento and Laravel. This tutorial will introduce how to install PHP in Debian12 and the integration of PHP and Apache. Prerequisite: You need to have a server with Debian12 installed to facilitate the drill operation on it. Of course, it is also recommended that you purchase an Alibaba Cloud VPS or Tencent Cloud VPS virtual host. If you prefer foreign servers, I recommend you try VPS on Vultr. You will get a $50 experience when you sign up, which is very cost-effective. Of course there is a host, but for security reasons it is not recommended to use it.

See all articles