Japanese Fugaku, who has dominated the rankings for two years and completed four consecutive championships, finally fell from the "altar".
In the just-announced list of the top 500 global supercomputers in 2022, Frontier from the Oak Ridge National Laboratory (ORNL) in the United States won the first place. With an HPL score of 1.102 Exaflop/s, Frontier is not only the most powerful supercomputer in history, it is also the first true "Exascale supercomputer."
In contrast, China’s Sunway TaihuLight and Tianhe-2 dropped to sixth and ninth. Last year, the United States included Sunway TaihuLight's processor supplier Sunway on the entity list, which more or less affected the development of China's supercomputing.
The total computing power of the Top500’s 64-bit floating point calculations this time is 4.4 exaflops, an increase of 44.7% from the previous list six months ago.
So, how "unbelievable" has Frontier's performance been? ? ! To put it simply, even if the computing power of the 468 supercomputers behind it is added together, they cannot compare to one Frontier.
In addition to such dazzling computing power, Frontier is also one of the fastest supercomputers in the world for running AI. In the mixed precision performance test of the HPL-AI benchmark, Frontier ran a score of 6.88 ExaFlops.
In vernacular, this performance is equivalent to giving each neuron in an intelligent body composed of 86 billion neurons to issue 68 million instructions per second. Using this large model for training AI, it is like a rocket engine coupled with a Formula One car, which can take off directly on the track. The new Frontier covers an area of 372 square meters, consists of 74 HPE Cray EX cabinets, and can accommodate 9408 nodes.
Each node has an AMD Milan "Trento" 7A53 Epyc CPU, paired with 512GB DDR4 memory and 4 AMD Radeon Instinct MI250X GPUs. Overall, the system has 602,112 CPU cores connected to 4.6PB of DDR4 memory. Additionally, 37,888 AMD MI250X GPUs with 8,138,240 cores are paired with 4.6PB of HBM memory (128GB per GPU). The CPU and GPU are connected using the HPE Cray Slingshot-11 Ethernet-based network fabric.
The storage is connected to an extremely high-performance storage subsystem, with a capacity of 700PB, a throughput of 75TB/s and a performance of 15 billion IOPS. The metadata layer is spread across 480 NVMe SSDs, providing 10PB of overall capacity, while 5,400 NVMe SSDs provide 11.5PB of capacity for the main high-speed storage tier. At the same time, 47,700 PMR hard drives provide 679PB of capacity.
In terms of heat dissipation, the water-cooled system allows 6,000 gallons of water to be "moved" throughout the system by a 350-horsepower pump, thus controlling the temperature at around 85 degrees. . To put it into perspective, these pumps can fill an Olympic-sized swimming pool in 30 minutes. Frontier is not only the fastest supercomputer in the world, but also the most environmentally friendly supercomputer.
Frontier’s energy consumption ratio also ranks first on the Green500 list of global supercomputer environmental protection levels. The energy consumption efficiency of a single cabinet unit composed of the third-generation AMD Epyc processor and the Instinct MI250x accelerator is 62.68 gigaflops per watt of power operations.
However, it is not easy to assemble Frontier. ORNL must purchase 60 million parts with 685 different numbers. In addition, we had to deal with a shortage of chips during the construction process.
ORNL has 167 affected part numbers, which works out to 2 million parts. And AMD is missing 15 numbered parts for the MI200 GPU.
So ORNL worked with ASCR to obtain a Defense Priority and Allocation System (DPAS) rating for these parts, which means that the U.S. government can use the annual defense budget to purchase them at a better price than the market price. these parts. Soon, the "shortage" problem was solved.
After this wave of operations, can the so-called "doubts" in many people's minds be solved? The United States: For the sake of our national defense, we will increase our efforts to develop supercomputing!
Although it costs so much, it has a wide range of uses. In an era when rich men like Nvidia are building digital twins of the earth, supercomputers in official departments of various countries are becoming more and more important in developing AI that processes huge data, simulating climate and industrial applications. Even if Oak Forest can’t use up all the computing power, it can still rent or sell it. The industry is hungry for computing power.
And there are precedents for this in other countries: In an experiment to predict the material processing results of machine tools, it took 8 hours for Japan's DMG Mori Sei Machine Tools to have its own industrial computer, but Fugaku only needed 10 minutes to complete it.
In the new list, China and the United States are still the countries with the most entries.
In terms of the number of supercomputers on the list, China is still at 173, which is the same as the previous issue and continues to top the list. The United States dropped to 127 from 150 in the previous period, ranking second. Despite this, these two countries still account for 60% of the total number on the TOP 500 list. Japan, Germany, and France ranked 3-5 with 34, 31, and 22 units respectively.
Although Frontier has officially become the most powerful supercomputer in the world, it has not been compared with the two Chinese supercomputers that were exposed last year. However, neither system currently submits test data to Top500.
According to the disclosed data, many foreign media have inferred that the peak value of Sunway OceanLight supercomputer on the HPL benchmark is about 1.3 exaflops/1.05 exaflops, while Tianhe-3 can reach 1.7 exaflops peak value/1.3 exaflops sustained value. However, considering the various "stuck necks" in the United States on chips, the consequences of China's supercomputing rankings are likely to become a "target" for sanctions. Presumably, we will not see any relevant data in the short term. updated.
In terms of the number of entries on the list, China occupies the top spot with a proportion of 34.6%, and the United States ranks second with a proportion of 25.4%. China and the United States still compete for hegemony.
In the overall performance rankings, the United States’ advantage is still unshakable, firmly occupying the first place with 47.3% of the total computing power. It is not difficult to see that the number of people who have reached the top this time is Frontier has contributed a lot to the stable dominance of the United States.
In addition, this Top 500 list also counts the progress of supercomputing power since 1990, and conducts linear regression. The horizontal axis in the table is time, and the vertical axis is computing power (taken as a logarithm).
The green dot is the total computing power of the supercomputer on the list, the yellow triangle is the computing power of the top supercomputer on the list, and the blue square is the computing power of the last (500th) supercomputer on the list. The straight lines of corresponding colors are the results of linear regression. It can be seen that these three straight lines are roughly in line with Moore's Law's predictive judgment on the increase in computing power. It is said that Moore's Law is dead, but it seems that in the field of supercomputing, it is still alive and well.
The United States is not the only region that has achieved remarkable results. Europe also performed well. From the overall ranking, the United States, China, and Japan occupy the top three, and the remaining seven seats in the top ten are all occupied by European countries.
Among them, 31 German stations are on the list, 22 French stations are on the list, and 14 Italian stations are on the list. Finland's newly deployed LUMI supercomputer barely surpassed the United States' Summit with an FP64 performance of 151.9 petaflops, ranking third.
In addition, France’s GENCI-CINES’ Adastra almost won the 10th place with a performance of 46.1 petaflops. Although nowhere near as powerful as LUMI, Adastra still retains the distinction of being the second most powerful supercomputer in Europe. Systems such as LUMI and Adastra also illustrate another trend. Like Frontier, they're based on HPE's all-AMD Cray EX platform, using third-generation Epyc CPUs and Instinct GPUs.
For the first time in a long time, AMD is represented among the host CPUs in the system in the Top500 share, but Intel Xeon processors still dominate.
Since the beginning of this century, Intel has begun to develop its efforts in the field of high-performance computing. In just a few years, it has defeated several previous major players such as IBM and Hewlett-Packard. I was so squeezed out that I lost my vitality. From 2017 to 2020, Intel's market share in the field of high-performance computing reached its peak, once exceeding 90%.
Until now, Intel's share is still as high as 77.4%, a decrease of 4.2 percentage points, while AMD's share has just increased by this number. Epycs is expected to gain more momentum against the Xeon SP. Moreover, AMD has the momentum to catch up from behind. In the 2022 Global Supercomputing Top500 list, there are now 94 supercomputers relying on AMD products, an annual increase of 95%. Moreover, AMD’s Instinct MI200 accelerator is used in 7 of the supercomputers for the first time.
Another question is, why not use GPU accelerator? Maybe because it's expensive? As a computing engine, GPU accelerators can improve huge performance and cost-effectiveness. However, currently, accelerators do not occupy a dominant position among the supercomputing architectures on the Top500 list. These GPU accelerators are mainly from Nvidia. However, in addition to NVIDIA's own HPC center, which has a large number of applications that need to be ported to GPUs and is not short of money, thousands of other HPC centers around the world are still short of money. The code of these HPCs is explicitly written for CPUs.
On the current Top500 list, only 170 supercomputers use GPU accelerators - of course most of them are from NVIDIA. Another factor is that cheaper CPUs themselves are now adding accelerated computing functions similar to GPUs, such as vector and now matrix calculations.
This saves money and is easier to program than transferring CPU programs to GPU. We may never see a Top500 list consisting primarily of GPU-accelerated systems. In fact, in top-ranked supercomputing systems such as "Fuyue" and "TaihuLight", the CPU basically acts as a GPU with vector and matrix calculations.
The above is the detailed content of The United States regains the top spot in supercomputing in the world! Humanity has achieved exascale supercomputing, and Frontier has killed the top 500 companies in the world.. For more information, please follow other related articles on the PHP Chinese website!