NVIDIA and Microsoft Create One of the Most Powerful Cloud Computers with AI

NVIDIA and Microsoft are embarking on a new project, namely the creation of one of the world’s most powerful AI supercomputers.

It will be powered by Microsoft’s Azure supercomputing infrastructure combined with NVIDIA GPUs, networking and a full suite of artificial intelligence software. Supercomputer will help enterprises train, deploy and scale artificial intelligence, including large, advanced models.

“Our collaboration with Microsoft will provide researchers and companies with state-of-the-art AI infrastructure and software to capitalize on the transformative power of AI.”

said Manuvir Das, vice president of enterprise computing at NVIDIA.

Azure’s cloud-based AI supercomputer includes powerful and scalable ND- and NC-series virtual machines optimized for AI distributed training and inference. It is the first public cloud to incorporate NVIDIA’s advanced AI stack, adding tens of thousands of NVIDIA A100 and H100 GPUs, NVIDIA Quantum-2 400Gb/s InfiniBand networking and the NVIDIA AI Enterprise software suite to its platform.

“AI is fueling the next wave of automation across enterprises and industrial computing, enabling organizations to do more with less as they navigate economic uncertainties,”
said Scott Guthrie, executive vice president of the Cloud + AI Group at Microsoft.

“Our collaboration with NVIDIA unlocks the world’s most scalable supercomputer platform, which delivers state-of-the-art AI capabilities for every enterprise on Microsoft Azure”,
he continued.

The platform will support a broad range of AI applications and services, including Microsoft DeepSpeed and the NVIDIA AI Enterprise software suite.

Microsoft DeepSpeed will leverage the NVIDIA H100 Transformer Engine to accelerate transformer-based models used for large language models, generative AI and writing computer code, among other applications. This technology applies 8-bit floating point precision capabilities to DeepSpeed to dramatically accelerate AI calculations for transformers – at twice the throughput of 16-bit operations.