Most Viewed Content:

Redmi K60 series full body photo and close-up photos reveal hidden texture under the glass

Update: Redmi released a close-up photo of the Redmi...

Huawei will launch fast-charging solution for electric vehicles this year

At the recent international forum of the China Electric...

Xiaomi folding phone appearance patent authorized

Xiaomi Mobile Software Co., Ltd. applied for the "cell...

Microsoft Azure ND H100 v5 VM Preview Released: 8 to Thousands of NVIDIA H100 Tensor Core GPUs

According to an official release from Microsoft Azure, Microsoft is applying the last decade of supercomputing experience and experience supporting very large AI training workloads to build an AI infrastructure with high performance at scale. The Microsoft Azure Smart Cloud, specifically virtual machines (VMs) accelerated with graphics processing units (GPUs), provides the foundation for generative AI development for Microsoft and its customers.

Microsoft is now shipping the ND H100 v5 VM, Azure’s more powerful and highly scalable family of AI virtual machines to date. The VMs support on-demand configurations of up to 8 to thousands of NVIDIA H100 GPUs interconnected over Quantum-2 InfiniBand networks, enabling the significantly higher performance of AI models. Compared to the previous generation of ND A100 v4 VMs, this virtual machine release includes the following innovations:

  • 8 NVIDIA H100 Tensor Core GPUs interconnected via next-generation NV Switch and NV Link 4.0.
  • Each GPU is equipped with 400 Gb/s NVIDIA Quantum-2 CX7 InfiniBand and 3.2Tb/s performance per VM in a non-blocking fat-tree network.
  • The eight local GPUs in each VM are interconnected with each other via NV Switch and NV Link 4.0 with 3.8Tb/s of pairwise bandwidth.
  • 4th generation Intel Xeon Scalable processors.
  • PCIE 5th generation host to GPU interconnect with 64Gb/s per GPU bandwidth.
  • 16-channel 4800 MHz DDR5 memory.

NVIDIA Quantum-2 uses the 7th generation NVIDIA InfiniBand architecture to provide AI developers and scientific researchers with superb network performance and rich features to help them solve challenging problems, remote direct memory access (RDMA) and ultra-fast speeds of up to 400 Gb/s to power advanced supercomputing data centers.

Microsoft says that large-scale AI is built into Azure’s DNA. Initial investments in large-scale language modelling research, such as Turing, and milestones such as building the first AI supercomputers in the cloud, have prepared the ground for the introduction of generative AI. Azure OpenAI services enable customers to leverage the power of large-scale generative AI models. “Scale” has always been one of the goals of Azure’s optimized AI infrastructure. Now, Microsoft is bringing supercomputing capabilities to startups and enterprises of all sizes without significant physical hardware or software investment.

Now, with the preview release of ND H100 v5, it will become a standard service in the Azure portfolio.

Latest

Sony will launch a new 70-200mm f / 4.0 lens: compact appearance and macro function

According to Sonyalpharumors news, Sony will soon launch a...

Samsung’s OLED TV will be launched on April 27: using quantum dot technology

Samsung’s official mall has now opened pre-registration for new...

Xiaomi Buds 4 earphones debut with 192kHz sound quality

Xiaomi announced today that Xiaomi Buds 4 is the...

Lenovo previews a small new mini host: dual-fan cooling, desktop performance

Lenovo Xiaoxin mini host will be released on April...
spot_img

Newsletter

Don't miss

Sony will launch a new 70-200mm f / 4.0 lens: compact appearance and macro function

According to Sonyalpharumors news, Sony will soon launch a...

Samsung’s OLED TV will be launched on April 27: using quantum dot technology

Samsung’s official mall has now opened pre-registration for new...

Xiaomi Buds 4 earphones debut with 192kHz sound quality

Xiaomi announced today that Xiaomi Buds 4 is the...

Lenovo previews a small new mini host: dual-fan cooling, desktop performance

Lenovo Xiaoxin mini host will be released on April...

ASUS Announces ProArt Series RTX 4080/4070 Ti Graphics Cards: 2.5 Slot Thickness

ASUS today announced the launch of the ProArt series...
spot_imgspot_img

Apple’s iPhones made in India account for 7%: the output value exceeds 7 billion U.S. dollars

According to Bloomberg, Apple generated more than $7 billion in output from iPhone phones assembled in India in its latest fiscal year ending in...

Trinity 5: Clockwork Conspiracy game released, multiplayer cooperative puzzle

THQ Nordic officials learned that the classic platform jumping puzzle game "Trinity" series will soon be released sequel, "Trinity 5: Clockwork Conspiracy" will be...

Microsoft official website Surface return period extended to 60 days

Microsoft's official website recently updated the Surface return policy, users can enjoy 60 days of return service for Surface devices purchased on the official...