Intel Gaudi2 Accelerator MLPerf Benchmarks Show A Viable AI Alternative To NVIDIA’s GPUs
International Conference on Nuclear Physics and Particle Physics Research Excellence
Before diving into the benchmarks, it is important to realize that the AI industry is predicted to grow exponentially in the coming years. Gartner, an American insight firm, has disclosed that the AI industry is set to reach a $53.4 billion valuation in 2023, which is an increment of 20.9% from the previous year. Moreover, the firm predicts that the industry could almost reach the $120 billion mark by 2027, which is a prime reason why companies like Intel and AMD are striving to make a dominant entry with their own AI solutions.
Based on the data released by Intel, Gaudi2 accelerators have seen a decent performance uplift, outperforming NVIDIA's A100 AI GPUs by a good margin. Intel says that their Gaudi2 accelerator offers much better "value" compared to their counterparts in the market, and they are indeed right given the inflated prices NVIDIA's AI GPUs are being sold at.
While the above-mentioned statistics are indeed impressive, Intel has left out some details to reveal such as relevant TDP and temperature stats. They don't matter in the longer run especially with the current state of the industry, since the H100s are facing an immense shortage, with the demand and supply chain disrupted due to an "influx" of huge orders. In light of this, Intel's Gaudi2 accelerators are turning out to be a strong alternative, however, work still needs to be done here.
Apart from Gaudi accelerators, Intel also revealed some benchmarks of their 4th Gen Intel Xeon Scalable and Xeon CPU Max, which have been lately thriving in the industry due to the performance value they bring onboard. These are the main result highlights:The 4th Gen Intel Xeon Scalable processor is ideal for building and deploying general-purpose AI workloads with the most popular AI frameworks and libraries. For the GPT-J 100-word summarization task of a news article of approximately 1,000 to 1,500 words, 4th Gen Intel Xeon processors summarized two paragraphs per second in offline mode and one paragraph per second in real-time server mode.
For the first time, Intel submitted MLPerf results for the Intel Xeon CPU Max Series, which provides up to 64 gigabytes (GB) of high-bandwidth memory. For GPT-J, it was the only CPU able to achieve 99.9% accuracy, which is critical for applications for which the highest accuracy is of paramount performance.
Zinsner has expressed the company's attention to adopting a more "balanced" approach in the AI industry, offering competitive products. The official has realized that fact that the company's Sapphire Rapids lineup has received more attention, while the AI accelerator department was left out. Zinsner claims that Intel has been focusing on its GPU segment for a while now, with a majority of the company's finances allocated to its development which catalyzed the "downfall" of data center revenue.
Intel is currently moving towards focusing on the AI industry since it has seen the wonders it has done with the likes of NVIDIA and SK Hynix. The approach of Team Blue with Gaudi AI accelerators in the future is hinting towards a positive upturn, and one of the indicators we have seen is the company's plans to integrate next-gen Falcon Shores chips with the Gaudi lineup, expanding its potential to a whole new level. However, work needs to be done here and the only way Intel could oust competition is through revamping its AI products to a whole new level.
Intel collaborated with its original equipment manufacturer (OEM) customers to deliver their own submissions, further showcasing AI performance scalability and wide availability of general-purpose servers powered by Intel Xeon processors that can meet customer service level agreements (SLAs).
#AIInnovation#GPUAlternative#IntelAI#AIHardwareBenchmark#MLPerfBenchmark
#AIVision#AIResearch#AcceleratedComputing#AIInfrastructure#IntelVsNVIDIA
#AICompute#DeepLearning#AIWorkloads
Comments
Post a Comment