


Intel open source NPU acceleration library, Core Ultra processor AI PC can run lightweight large language models
News on March 4th showed that Intel recently released its NPU acceleration library on GitHub. This move enables AI PCs equipped with Core Ultra processors to run lightweight large-scale applications such as TinyLlama and Gemma-2b more smoothly. Language model.
The Core Ultra series integrates the NPU AI engine for the first time. This engine can handle some lightweight AI inference tasks and work together with the CPU and GPU to meet various needs. Requirements for AI applications.
It is understood that although the NPU acceleration library released this time is mainly prepared for developers, those who have certain programming experience Users can also try it. Tony Mongkolsmai, a software architect at Intel, demonstrated how to run an AI chatbot based on the 1.1 billion parameter TinyLlama large model on an MSI Monarch 14 AI Evo laptop, which can conduct simple conversations. At the same time, Windows Task Manager also shows valid calls to the NPU.
However, the current open source NPU acceleration library still has some shortcomings in functionality. It supports 8-bit quantization and FP16 precision, but does not yet support 4-bit quantization and BF16 precision. As well as advanced functions such as NPU/GPU hybrid computing, the relevant technical documentation has not yet been provided. However, Intel has promised to gradually expand its functions in the future, which is expected to double the existing functions, which will undoubtedly bring more convenience and possibilities to AI developers.
The above is the detailed content of Intel open source NPU acceleration library, Core Ultra processor AI PC can run lightweight large language models. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics



IntelArrowLakeisexpectedtobebasedonthesameprocessorarchitectureasLunarLake,meaningthatIntel'sbrandnewLionCoveperformancecoreswillbecombinedwiththeeconomicalSkymontefficiencycores.WhileLunarLakeisonlyavailableasava

Beelink has launched a new mini PC, the EQi12. It's an upgraded version of the EQ12that the brand introduced last year, which came with the Intel N100. The newer computer can be equipped with up to the Intel Core i7 12650H, one of the higher-end chip

LG already offers the Gram 16 Pro with Intel Meteor Lake processors (curr. $1,699.99 on Amazon). However, the company has decided to switch out Intel's Meteor Lake architecture for Lunar Lake, which Intel showcased at IFA 2024 in Berlin last week. Th

IntelTXT is a hardware-assisted security technology launched by Intel. It can ensure the integrity and security of the server during startup by establishing a protected space between the CPU and BIOS. The full name of TXT is TrustedExecutionTechnology, which is Trusted Execution Technology. Simply put, TXT is a security technology that provides hardware-level protection to ensure that the server has not been modified by malicious programs or unauthorized software when it is started. this one

According to news on March 4, Intel recently released its NPU acceleration library on GitHub. This move enables AIPCs equipped with Core Ultra processors to more smoothly run lightweight large-scale language models such as TinyLlama and Gemma-2b. The Core Ultra series integrates the NPUAI engine for the first time. This engine can handle some lightweight AI inference tasks and work together with the CPU and GPU to meet the requirements of various AI applications. It is understood that although the NPU acceleration library released this time is mainly prepared for developers, users with certain programming experience can also try to use it. Intel software architect Tony Mongkolsmai demonstrates how to

Lenovo has now released the Yoga Slim 7i Aura Edition, less than a week after initially presenting the Intel Lunar Lake-based laptop. Please note that while the company revealed plenty of details about the laptop during IFA 2024 in Berlin, it elected

Intel's next-generation GPU architecture is expected to launch in September as part of Intel Lunar Lake. Like the Arc Alchemist iGPU from Meteor Lake, the iGPU has up to 8 Xe cores, but the new architecture is expected to achieve 50% higher performan

Thirteen is known to be bad lack. Superstition is not a virtue in the tech business though. For Lenovo, the 13th generation of the premium laptop is seemingly bound to be just as successful as the twelve predecessors. At IFA, the biggest PC manufactu
