Intel Unveils AI Chips for Enhanced Performance at Vision 2024 Event

Intel claims its new Gaudi 3 AI accelerators outpower Nvidia’s flagship GPUs “at a fraction of the cost” at Vision 2024

April 11, 2024

3 Min Read

A rectangular semiconductor led on a grey sheet illuminated by a purple light

Intel

Intel unveiled its latest AI chips, Gaudi 3, designed to offer increased performance training and running large language models at Intel Vision 2024 in Phoenix, Arizona this week.

The new hardware is designed to power cloud-based infrastructure for training and inferencing large language models.

Intel claims the new hardware is 50% faster at training and 30% faster in processing large language models compared to Nvidia’s H100 GPUs.

“Innovation is advancing at an unprecedented pace, all enabled by silicon – and every company is quickly becoming an AI company,” said Intel CEO Pat Gelsinger. “Intel is bringing AI everywhere across the enterprise, from the PC to the data center to the edge. Our latest Gaudi, Xeon and Core Ultra platforms are delivering a cohesive set of flexible solutions tailored to meet the changing needs of our customers and partners and capitalize on the immense opportunities ahead.”

A man in a grey jacket, blue jeans and a black t-shirt holding a stack of semiconductors up on stage in front of a blue background

Intel CEO Pat Gelsinger with eight interconnected Gaudi 3s | Credit: Intel

The new hardware boasts increased memory and networking bandwidth and provides four times the AI compute compared to the prior Gaudi 2s. The chips house tens of thousands of accelerators and can be interconnected via Ethernet.

Intel said the Gaudi 3 chips' cost-effectiveness makes them suitable for enterprises running experiments or scaling existing AI deployments. Enterprise customers can use the chips to scale their efforts from a single node to clusters to support their AI workloads.

“In the ever-evolving landscape of the AI market, a significant gap persists in the current offerings. Feedback from our customers and the broader market underscores a desire for increased choice,” said Justin Hotard, Intel executive vice president and general manager of its data center and AI group.

“Enterprises weigh considerations such as availability, scalability, performance, cost, and energy efficiency. Intel Gaudi 3 stands out as the generative AI alternative presenting a compelling combination of price performance, system scalability, and time-to-value advantage.”

South Korean internet provider Naver is among the early adopters. At Vision 2024, the company announced it is using the new Gaudi hardware to power new large language models for its global AI services.

Intel’s Gaudi 3 chips will be available in the second half of 2024 to hardware manufacturers including Dell Technologies, HPE and Lenovo before a general release later this year.

A PCIe add-in card for the Gaudi 3, which allows users to connect other high-speed input-output components like Wi-Fi cards or SSDs, will also launch in Q3 2024.

Alongside Gaudi 3, Intel provided a glimpse at its new Xeon processors.

The Xeon 6s, built to power generative AI workloads, can run in data centers for cloud and edge workloads.

Intel unveiled two Xeon 6 product lines, Sierra Forest and Granite Rapids, which will launch later this year.

A man in a grey jacket, blue jeans and a black t-shirt holding up two circular semiconductors on stage in front of a blue background

Gelsinger shows off the new Sierra Forest and Granite Rapids Xeon processes | Credit: Intel

The Xeon 6 processors will replace the 5th gen versions, which were only rolled out earlier this year.

Additionally, Intel unveiled Tiber, a suite of solutions businesses can use to ease generative AI software deployments.

Tiber compiles tools enterprise customers can use to securely roll out AI services in the cloud or at the edge. Tiber will be released in the coming months.

About the Author(s)

Ben Wodecki

Jr. Editor

Ben Wodecki is the Jr. Editor of AI Business, covering a wide range of AI content. Ben joined the team in March 2021 as assistant editor and was promoted to Jr. Editor. He has written for The New Statesman, Intellectual Property Magazine, and The Telegraph India, among others. He holds an MSc in Digital Journalism from Middlesex University.

See more from Ben Wodecki

Related Topics

Recent in ML

Related Topics

Recent in NLP

Related Topics

Recent in Data

Related Topics

Recent in Automation

Related Topics

Recent in Verticals

Related Topics

Recent in Responsible AI

Related Topics

Recent in Companies

Related Topics

Intel Unveils AI Chips for Enhanced Performance at Vision 2024 Event

About the Author(s)

Latest News

Trending articles