Most Viewed Content:

Toyota responds to continued production cuts in the next 3 months: easing pressure on dealer earnings

In response to the news that "production will continue...

India’s censorship body gave power to remove pirated Movies from platforms

India’s Ministry of Information and Broadcasting announced that its...

Microsoft working on new features for Win11 / Win12: smart notifications, depth-of-field effects

According to the source Albacore (@thebookisclosed), Microsoft is preparing...

Microsoft Azure ND H100 v5 VM Preview Released: 8 to Thousands of NVIDIA H100 Tensor Core GPUs

According to an official release from Microsoft Azure, Microsoft is applying the last decade of supercomputing experience and experience supporting very large AI training workloads to build an AI infrastructure with high performance at scale. The Microsoft Azure Smart Cloud, specifically virtual machines (VMs) accelerated with graphics processing units (GPUs), provides the foundation for generative AI development for Microsoft and its customers.

Microsoft is now shipping the ND H100 v5 VM, Azure’s more powerful and highly scalable family of AI virtual machines to date. The VMs support on-demand configurations of up to 8 to thousands of NVIDIA H100 GPUs interconnected over Quantum-2 InfiniBand networks, enabling the significantly higher performance of AI models. Compared to the previous generation of ND A100 v4 VMs, this virtual machine release includes the following innovations:

  • 8 NVIDIA H100 Tensor Core GPUs interconnected via next-generation NV Switch and NV Link 4.0.
  • Each GPU is equipped with 400 Gb/s NVIDIA Quantum-2 CX7 InfiniBand and 3.2Tb/s performance per VM in a non-blocking fat-tree network.
  • The eight local GPUs in each VM are interconnected with each other via NV Switch and NV Link 4.0 with 3.8Tb/s of pairwise bandwidth.
  • 4th generation Intel Xeon Scalable processors.
  • PCIE 5th generation host to GPU interconnect with 64Gb/s per GPU bandwidth.
  • 16-channel 4800 MHz DDR5 memory.

NVIDIA Quantum-2 uses the 7th generation NVIDIA InfiniBand architecture to provide AI developers and scientific researchers with superb network performance and rich features to help them solve challenging problems, remote direct memory access (RDMA) and ultra-fast speeds of up to 400 Gb/s to power advanced supercomputing data centers.

Microsoft says that large-scale AI is built into Azure’s DNA. Initial investments in large-scale language modelling research, such as Turing, and milestones such as building the first AI supercomputers in the cloud, have prepared the ground for the introduction of generative AI. Azure OpenAI services enable customers to leverage the power of large-scale generative AI models. “Scale” has always been one of the goals of Azure’s optimized AI infrastructure. Now, Microsoft is bringing supercomputing capabilities to startups and enterprises of all sizes without significant physical hardware or software investment.

Now, with the preview release of ND H100 v5, it will become a standard service in the Azure portfolio.

Latest

Samsung Galaxy S24 FE with model number SM-721U appears in UK operator database

According to database information recently disclosed by British telecom...

Formula E releases new GEN3 EVO racing car: 0 to 100 in 1.86 seconds

Formula E released the new GEN3 EVO racing car....

Google Pixel 8a phone renderings re-exposed in four color variants

Source @Evleaks tweeted, once again sharing a high-definition rendering...

Trumpchi E9 four-seater high-end version unveiled at the Beijing Auto Show

On the second day of the Beijing Auto Show,...

Newsletter

Don't miss

Samsung Galaxy S24 FE with model number SM-721U appears in UK operator database

According to database information recently disclosed by British telecom...

Formula E releases new GEN3 EVO racing car: 0 to 100 in 1.86 seconds

Formula E released the new GEN3 EVO racing car....

Google Pixel 8a phone renderings re-exposed in four color variants

Source @Evleaks tweeted, once again sharing a high-definition rendering...

Trumpchi E9 four-seater high-end version unveiled at the Beijing Auto Show

On the second day of the Beijing Auto Show,...

Newly added color configurations and improved real shots of the new Avita 11

At the 2024 Beijing Auto Show, the new Avita...
James Lopez
James Lopezhttps://www.techgoing.com
James Lopez joined Techgoing as Senior News Editor in 2022. He's been a tech blogger since before the word was invented, and will never log off.

OPPO officially announced the White version of Find X7 phone

OPPO officially announced today that the Find X7 phone will be launched in a white version and released three promotional images. As you can see...

OPPO K12 launching April 24, boasting 100-watt flash charging and extended battery life

The OPPO K12 phone will be officially announced on April 24, with the slogan “providing 500 million mass users with 100-watt flash charging and...

Apple expected to skip M3 and release M4 version of Mac mini by year’s end

According to Bloomberg reporter Mark Gurman, Apple may skip the M3 version of the Mac mini and switch to the more powerful M4 chip. Mark...