Home Technology peripherals AI Intel launches next-generation data center CPU design: Chiplet design, performance increased by 240%

Intel launches next-generation data center CPU design: Chiplet design, performance increased by 240%

Aug 30, 2023 pm 12:53 PM
getting Started Intel

The next generation of Intel chips must have huge performance improvements.

The annual Hot Chips is the most important technical conference in the semiconductor industry. Among them, experts in the chip field gather together, and global chip manufacturers often choose to release new products here or explain their future development directions.

On Monday local time, at Hot Chips 2023 held at Stanford University, Intel revealed for the first time a new generation of data center chip "Sierra Forest", which has higher performance per watt than previous chips. Generation has increased by 240% and is expected to be launched next year.
Intel launches next-generation data center CPU design: Chiplet design, performance increased by 240%
At the same time, Intel divided its data center chips into two categories for the first time: one is Granite Rapids, focusing on high energy consumption and high performance; the other is Sierra Forest, focusing on High energy efficiency.

Let’s look at the specific details of the two data center chips, Granite Rapids and Sierra Forest.

Overall, thanks to the introduction of area-efficient E cores (energy efficiency cores), Granite Rapids and Sierra Forest are expected to become Intel Xeon (Xeon) so far One of the most important updates in the scalable hardware ecosystem.

Let’s first look at Sierra Forest, which is Intel’s first E-core Xeon scalable chip for data centers and the leading product of the EUV-based Intel 3 process. Intel said Sierra Forest is expected to be available in the first half of next year. Meanwhile, Granite Rapids also uses the same Intel 3 process.
Intel launches next-generation data center CPU design: Chiplet design, performance increased by 240%
In terms of design, both Granite and Sierra are based on chiplet designs and rely on Intel EMIB (Embedded Multi-Die Interconnect Bridge, embedded multi-core interconnection Bridging) technology is a hybrid of computing and I/O chiplets packaged together. Not only that, this chiplet design is also unique, using different computing/IO chiplets instead of packaging "complete" Xeon chiplets together.

This means that Granite and Sierra can share a general-purpose I/O chiplet built on the Intel 7 process.
Intel launches next-generation data center CPU design: Chiplet design, performance increased by 240%
#In addition to sharing platform details, Intel also provided for the first time a high-level overview of the architecture used by the E-core and P-core (performance core). As is the case with multiple generations of Xeon now, Intel is leveraging the same basic CPU architecture as its consumer parts.

Thus, Granite and Sierra can be thought of as deconstructed Meteor Lake processors, with Granite featuring Redwood Cove P cores and Sierra featuring Crestmont E cores.

Intel launches next-generation data center CPU design: Chiplet design, performance increased by 240%

As mentioned before, this is Intel's first attempt at delivering E-cores to the Xeon market. For Intel, this means tailoring the E-Core design for data center workloads, a significant departure from the previous generation's consumer-focused E-Core designs.

Intel revealed that Crestmont is offering a 6-wide instruction decode path as well as an 8-wide fallback backend. While not as powerful as Intel's P-Core, the E-Core is by no means a lightweight core, and Intel's design decisions reflect this. Still, its design is much more efficient than the P-cores in Granite, both in terms of chip space and power consumption.

Crestmont’s L1 instruction cache (I-cache) will be 64KB, twice the size of the I-cache in earlier designs. Meanwhile, new members of the Crestmont E-core family can package those cores into 2- or 4-core clusters, unlike Gracemont, which currently only offers 4-core clusters. Finally, for Sierra/Crestmont, the chip will provide instructions as close as possible to Granite Rapids. This means there is BF16 data type support, as well as support for various instruction sets such as AVX-IFMA and AVX-DOT-PROD-INT8.

Intel launches next-generation data center CPU design: Chiplet design, performance increased by 240%

Meanwhile for Granite Rapids, we have the Redwood Cove P core. Redwood/Granite is the legacy core of Xeon processors, and the changes aren't as drastic for Intel as Sierra Forest, but that doesn't mean they haven't improved.

In terms of microarchitecture, Redwood Cove gets the same 64KB I-cache as Crestmont, which is 2x the capacity of its predecessor. But most notably, Intel managed to further reduce the latency of floating point multiplication, reducing it from 4/5 cycles to just 3 cycles. Basic instruction latency improvements like this are rare, so we always look forward to them.

In addition to this, the Redwood Cove microarchitecture also has features such as branch prediction and prefetching, which are typical optimization goals for Intel. Anything they can do to improve branch prediction (and reduce the cost of rare mistakes) often pays relatively large dividends in terms of performance.

Redwood Cove's AMX Matrix engine gets FP16 support, especially for the Xeon series, and FP16 isn't used as much as the already supported BF16 and INT8, but it's generally Improved AMX flexibility.

Support for memory encryption is also being improved. Redwood Cove in Granite Rapids will support 2048 256-bit memory keys, while Sapphire Rapids will support 128 keys.

Intel launches next-generation data center CPU design: Chiplet design, performance increased by 240%

While it’s too early to talk about individual SKUs for Granite Rapids and Sierra Forest, But Intel has clearly told us that core counts are increasing overall. The Granite Rapids chip will offer more CPU cores than the Sapphire Rapids (60 for the SPR XCC). Of course, Sierra's 144 cores will provide more CPU cores.

Between previous Xeon delays and taking a long time to bring E-core Xeon scalable chips to market, Intel doesn't have as much of a presence in the data center market as it once did. Dominant position, so Granite Rapids and Sierra Forest will mark an important inflection point, pointing the way for the future development of Intel's data center products.

Intel launches next-generation data center CPU design: Chiplet design, performance increased by 240%

We know that the data centers that power the Internet and online services host massive computing power requirements, and also consumes a lot of electricity. In recent years, with the development of technologies such as AI, technology companies are facing the challenge of increasing computing power and reducing energy consumption, which has prompted chip companies to focus on improving power consumption efficiency.

Currently, Intel’s share in the data center chip market is being eroded step by step by competitors such as AMD and Ampere (a startup founded by former Intel executive Renee James).

This year, Ampere and AMD have launched their own high-efficiency cloud computing chips, and Arm also proposed the Neoverse V2 platform at this Hot Chips 2023. As competition becomes increasingly fierce, it is inevitable that Intel will feel a sense of crisis.

Reference link:
https://www.anandtech.com/show/20034/hot-chips-2023-intel-details -granite-rapids-and-sierra-forest-xeons
https://www.reuters.com/technology/intel-says-new-sierra-forest-chip-more- than-double-power-efficiency-2023-08-28/

The above is the detailed content of Intel launches next-generation data center CPU design: Chiplet design, performance increased by 240%. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Repo: How To Revive Teammates
4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Intel Core Ultra 9 285K processor exposed: CineBench R23 multi-core running score is 18% higher than i9-14900K Intel Core Ultra 9 285K processor exposed: CineBench R23 multi-core running score is 18% higher than i9-14900K Jul 25, 2024 pm 12:25 PM

According to news from this website on July 25, the source Jaykihn posted a tweet on the X platform yesterday (July 24), sharing the running score data of the Intel Core Ultra9285K "ArrowLake-S" desktop processor. The results show that it is better than the Core 14900K 18% faster. This site quoted the content of the tweet. The source shared the running scores of the ES2 and QS versions of the Intel Core Ultra9285K processor and compared them with the Core i9-14900K processor. According to reports, the TD of ArrowLake-SQS when running workloads such as CinebenchR23, Geekbench5, SpeedoMeter, WebXPRT4 and CrossMark

Generate PPT with one click! Kimi: Let the 'PPT migrant workers' become popular first Generate PPT with one click! Kimi: Let the 'PPT migrant workers' become popular first Aug 01, 2024 pm 03:28 PM

Kimi: In just one sentence, in just ten seconds, a PPT will be ready. PPT is so annoying! To hold a meeting, you need to have a PPT; to write a weekly report, you need to have a PPT; to make an investment, you need to show a PPT; even when you accuse someone of cheating, you have to send a PPT. College is more like studying a PPT major. You watch PPT in class and do PPT after class. Perhaps, when Dennis Austin invented PPT 37 years ago, he did not expect that one day PPT would become so widespread. Talking about our hard experience of making PPT brings tears to our eyes. "It took three months to make a PPT of more than 20 pages, and I revised it dozens of times. I felt like vomiting when I saw the PPT." "At my peak, I did five PPTs a day, and even my breathing was PPT." If you have an impromptu meeting, you should do it

Intel announces Wi-Fi 7 BE201 network card, supports CNVio3 interface Intel announces Wi-Fi 7 BE201 network card, supports CNVio3 interface Jun 07, 2024 pm 03:34 PM

According to news from this site on June 1, Intel updated the support document on May 27 and announced the product details of the Wi-Fi7 (802.11be) BE201 network card code-named "Fillmore Peak2". Source of the above picture: benchlife website Note: Unlike the existing BE200 and BE202 which use PCIe/USB interface, BE201 supports the latest CNVio3 interface. The main specifications of the BE201 network card are similar to those of the BE200. It supports 2x2TX/RX streams, supports 2.4GHz, 5GHz and 6GHz. The maximum network speed can reach 5Gbps, which is far lower than the maximum standard rate of 40Gbit/s. BE201 also supports Bluetooth 5.4 and Bluetooth LE.

MSI launches new MS-C918 mini console with Intel Alder Lake-N N100 processor MSI launches new MS-C918 mini console with Intel Alder Lake-N N100 processor Jul 03, 2024 am 11:33 AM

This website reported on July 3 that in order to meet the diversified needs of modern enterprises, MSIIPC, a subsidiary of MSI, has recently launched the MS-C918, an industrial mini host. No public price has been found yet. MS-C918 is positioned for enterprises that focus on cost-effectiveness, ease of use and portability. It is specially designed for non-critical environments and provides a 3-year service life guarantee. MS-C918 is a handheld industrial computer, using Intel AlderLake-NN100 processor, specially tailored for ultra-low power solutions. The main functions and features of MS-C918 attached to this site are as follows: Compact size: 80 mm x 80 mm x 36 mm, palm size, easy to operate and hidden behind the monitor. Display function: via 2 HDMI2.

ASUS releases BIOS update for Z790 motherboards to alleviate instability issues with Intel's 13th/14th generation Core processors ASUS releases BIOS update for Z790 motherboards to alleviate instability issues with Intel's 13th/14th generation Core processors Aug 09, 2024 am 12:47 AM

According to news from this website on August 8, MSI and ASUS today launched a beta version of BIOS containing the 0x129 microcode update for some Z790 motherboards in response to the instability issues in Intel Core 13th and 14th generation desktop processors. ASUS's first batch of motherboards to provide BIOS updates include: ROGMAXIMUSZ790HEROBetaBios2503ROGMAXIMUSZ790DARKHEROBetaBios1503ROGMAXIMUSZ790HEROBTFBetaBios1503ROGMAXIMUSZ790HEROEVA-02 joint version BetaBios2503ROGMAXIMUSZ790A

All CVPR 2024 awards announced! Nearly 10,000 people attended the conference offline, and a Chinese researcher from Google won the best paper award All CVPR 2024 awards announced! Nearly 10,000 people attended the conference offline, and a Chinese researcher from Google won the best paper award Jun 20, 2024 pm 05:43 PM

In the early morning of June 20th, Beijing time, CVPR2024, the top international computer vision conference held in Seattle, officially announced the best paper and other awards. This year, a total of 10 papers won awards, including 2 best papers and 2 best student papers. In addition, there were 2 best paper nominations and 4 best student paper nominations. The top conference in the field of computer vision (CV) is CVPR, which attracts a large number of research institutions and universities every year. According to statistics, a total of 11,532 papers were submitted this year, and 2,719 were accepted, with an acceptance rate of 23.6%. According to Georgia Institute of Technology’s statistical analysis of CVPR2024 data, from the perspective of research topics, the largest number of papers is image and video synthesis and generation (Imageandvideosyn

Intel Panther Lake mobile processor specifications exposed: up to '4+8+4' 16-core CPU, 12 Xe3 core display Intel Panther Lake mobile processor specifications exposed: up to '4+8+4' 16-core CPU, 12 Xe3 core display Jul 18, 2024 pm 04:43 PM

According to news from this site on July 16, following the revelation of the specifications of the ArrowLake desktop processor and the BartlettLake desktop processor, blogger @jaykihn0 released the specifications of the mobile U and H versions of the Intel PantherLake processor in the early morning. The Panther Lake mobile processor is expected to be named the Core Ultra300 series and will be available in the following versions: PTL-U: 4P+0E+4LPE+4Xe, 15WPL1PTL-H: 4P+8E+4LPE+12Xe, 25WPL1PTL-H: 4P+8E+4LPE+ 4Xe, 25WPL1. The blogger also released the 12Xe nuclear display version of the PantherLake processor.

Intel explains in detail the Intel 3 process: applying more EUV lithography, increasing the frequency of the same power consumption by up to 18% Intel explains in detail the Intel 3 process: applying more EUV lithography, increasing the frequency of the same power consumption by up to 18% Jun 19, 2024 pm 10:53 PM

According to news from this site on June 19, as part of the 2024 IEEEVLSI seminar activities, Intel recently introduced the technical details of the Intel3 process node on its official website. Intel's latest generation of FinFET transistor technology is Intel's latest generation of FinFET transistor technology. Compared with Intel4, it has added steps to use EUV. It will also be a node family that provides foundry services for a long time, including basic Intel3 and three variant nodes. Among them, Intel3-E natively supports 1.2V high voltage, which is suitable for the manufacturing of analog modules; while the future Intel3-PT will further improve the overall performance and support finer 9μm pitch TSV and hybrid bonding. Intel claims that as its

See all articles