


NVIDIA H100 dominates the authoritative AI performance test, completing large model training based on GPT-3 in 11 minutes
On Tuesday local time, MLCommons, an open industry alliance in the field of machine learning and artificial intelligence, disclosed the latest data from two MLPerf benchmarks. Among them, the NVIDIA H100 chipset set new records in all categories in the test of artificial intelligence computing power performance. It is also the only hardware platform that can run all tests.
(Source: NVIDIA, MLCommons)
MLPerf is an artificial intelligence leadership alliance composed of academia, laboratories and industries. It is currently an internationally recognized and authoritative AI performance evaluation benchmark. Training v3.0 contains 8 different loads, including vision (image classification, biomedical image segmentation, object detection for two loads), language (speech recognition, large language model, natural language processing) and recommendation system. In other words, different equipment vendors take different amounts of time to complete the benchmark task.
(Training v3.0 training benchmark, source: MLCommons)
In the "big language model" training test that investors are more concerned about, the data submitted by NVIDIA and GPU cloud computing platform CoreWeave set a cruel industry standard for this test. With the concerted efforts of 896 Intel Xeon 8462Y processors and 3584 NVIDIA H100 chips, it only took 10.94 minutes to complete the large language model training task based on GPT-3.
Except for Nvidia, only Intel’s product portfolio received evaluation data on this project. In a system built with 96 Xeon 8380 processors and 96 Habana Gaudi2 AI chips, the time to complete the same test was 311.94 minutes. Using a platform with 768 H100 chips, the horizontal comparison test only takes 45.6 minutes.
(The more chips, the better the data, source: NVIDIA)
Regarding this result, Intel also said that there is still room for improvement. Theoretically, as long as more chips are stacked, the calculation results will naturally be faster. Jordan Plawner, Intel's senior director of AI products, told the media that Habana's computing results will be improved by 1.5 times to 2 times. Plawner declined to disclose the specific price of Habana Gaudi2, saying only that the industry needs a second manufacturer to provide AI training chips, and MLPerf data shows that Intel has the ability to fill this demand.
In the BERT-Large model training that is more familiar to Chinese investors, NVIDIA and CoreWeave pushed the data to an extreme 0.13 minutes. In the case of 64 cards, the test data also reached 0.89 minutes. The current infrastructure of mainstream large models is the Transformer structure in the BERT model.
Source: Financial Associated Press
The above is the detailed content of NVIDIA H100 dominates the authoritative AI performance test, completing large model training based on GPT-3 in 11 minutes. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics



Vibe coding is reshaping the world of software development by letting us create applications using natural language instead of endless lines of code. Inspired by visionaries like Andrej Karpathy, this innovative approach lets dev

February 2025 has been yet another game-changing month for generative AI, bringing us some of the most anticipated model upgrades and groundbreaking new features. From xAI’s Grok 3 and Anthropic’s Claude 3.7 Sonnet, to OpenAI’s G

YOLO (You Only Look Once) has been a leading real-time object detection framework, with each iteration improving upon the previous versions. The latest version YOLO v12 introduces advancements that significantly enhance accuracy

The article reviews top AI art generators, discussing their features, suitability for creative projects, and value. It highlights Midjourney as the best value for professionals and recommends DALL-E 2 for high-quality, customizable art.

ChatGPT 4 is currently available and widely used, demonstrating significant improvements in understanding context and generating coherent responses compared to its predecessors like ChatGPT 3.5. Future developments may include more personalized interactions and real-time data processing capabilities, further enhancing its potential for various applications.

The article compares top AI chatbots like ChatGPT, Gemini, and Claude, focusing on their unique features, customization options, and performance in natural language processing and reliability.

Mistral OCR: Revolutionizing Retrieval-Augmented Generation with Multimodal Document Understanding Retrieval-Augmented Generation (RAG) systems have significantly advanced AI capabilities, enabling access to vast data stores for more informed respons

The article discusses top AI writing assistants like Grammarly, Jasper, Copy.ai, Writesonic, and Rytr, focusing on their unique features for content creation. It argues that Jasper excels in SEO optimization, while AI tools help maintain tone consist
